Next Generation Sequencing
Support and Educational Content

Target enrichment identifies mutations that confer fitness effects

xGen® Lockdown® Probes facilitate cost-effective capture of defined genomic regions

Researchers in Prof Jeffrey Barrick’s laborato­ry (University of Texas at Austin, Austin, TX, USA) use experiments with microorganisms to study evolution in the laboratory as it happens. The team investigates a variety of biological mechanisms that can influence evolutionary processes ranging from changes in mutation rates to expanding genetic codes with unnatural amino acids.

One line of their research develops methods to track rare mutations in an evolving pop­ulation to assess effects on fitness from sequence alone. Such information would allow the researchers to reduce the many steps involved in identifying mutations, recreating the mutations in strains, and observing how these mutations alter fitness when introduced into an organism. As a result of extensive laboratory evolution studies of E. coli B strain REL606 [1], Prof Barrick’s team has identified several commonly mutated genes within which a number of different mutations can confer fitness benefits.

We spoke to Dr Daniel Deatherage, a postdoctoral fellow in Prof Barrick’s lab, who is investigating the genetic diversity of these adaptive mutations. He and his colleagues work with 18 different bacterial populations, half of which are derived from REL606. The other half are from a similar strain, differing only in a mutation in the mutS gene, which makes them mutate at high frequency. The scientists included this hyper-mutating strain to ensure they would see sufficient mutations for their analyses. While insufficient mutations in REL606 did not turn out to be an issue, use of both strain backgrounds has enabled the group to compare the genetic diversity that arises as mutants compete in each type of popula­tion. Their preliminary observations were as expected—more mutations occurred early and more interesting competition events occurred in the hyper-mutating populations.

The genomic regions being studied by the scientists are not hotspots for mutations. Rather, on examining previously published sequencing data from 12 replicate E. coli REL606 populations evolved for 20,000 generations, Dr Deatherage identified genes that are consistently mutated among all replicates. Thus, in up to 12 independent experiments, a mutation in a specific gene was highly beneficial and able to sweep through the entire population. So the team was confident that under these conditions early mutations in these genes would arise in their next set of evolution experiments. By using target enrichment procedures they hoped to achieve adequate sensitivity to detect many competing mutations in these genes when the mutations were still “new” and very rare within the population.

Sequencing populations to identify rare mutations

To sequence the commonly mutated genes, researchers in the Barrick laboratory performed target enrichment of these gene regions using xGen® Lockdown® Probes from IDT. This allowed them to focus sequencing runs on only the regions of interest, enabling greater depth of coverage, reducing the amount of data collection and analysis, and saving on the cost of sequencing reagents.

The protocol started with growing successive generations of the 18 individual cell populations in liquid culture and transferring 100 μL culture into 10 mL fresh medium each day (approximately 6.7 generations per transfer). The cells were periodically plated during culture to monitor for contamination, and aliquots of the cells were frozen daily. A trick employed by the scientists was to inoculate each population with araA+ and araA– clones, which when grown on tetra­zolium indicator agar, form white and red colonies, respectively. Use of araA+/araA– provided a phenotype that distinguished the 2 cell types in a population and enabled tracking of mutation frequency; e.g., when red colonies were no longer visible because they had been outcompeted by white colonies, the scientists knew that at least one mutation had swept through the population as a whole. The cells were passaged for ~500 generations (~75 passages).

Genomic DNA from the 500th generation of each population was isolated for enrichment and sequencing. An Illumina library prepara­tion was made using custom adapters based on Illumina adapters, but with different bar­coding sequences and other minor modifi­cations. Approximately 120 xGen Lockdown Probes were then used to enrich 8 regions that contain the gene sequences of interest, following the Nimblegen SeqCap® protocol and using appropriate blocking oligos for the modified Illumina adapters. Hybridiza­tion was performed for 72 hours, followed by stringent washes with hard vortexing us­ing reagents heated to 90°C. Dr Deatherage revealed, “The whole time I was purifying the DNA from the pulldown [target capture] I was thinking, “There is no way there is go­ing to be anything left here.” I was vortexing so hard; but I had faith in what other people have done. Sure enough when we got the sequencing back, we had very nice coverage on almost all of our samples.” Subsequently, sequencing was performed on the enriched DNA using an Illumina HiSeq® instrument.

Sequencing outcomes

Dr Deatherage and colleagues identified populations of interest that they believed had higher frequencies of mutation at the endpoints (Figure 1). As a next step, the researchers will trace the progression of those populations using the frozen stocks to examine the frequency of mutations at clos­er intervals and to monitor the mutations as they rose to prominence, and possibly fell to extinction, in the populations over time. Ultimately, the scientists hope to correlate the increase in mutation frequency, when the mutations were new and ultra-rare, with their observed fitness effects. They are looking forward to eliminating competition assays, which are more tedious—it can take several weeks to create a mutant, and then 1 week to perform the assay. 

Figure 1. Mutational frequencies per gene from mixed population sequencing of 500 generation E. Coli populations. “A” populations of E. coli REL606 with typical mutation rates (ancestral rates), and “M” populations founded with cells harboring mutations in the mutS gene making them hypermutators (mutator rates), were indi­vidually grown through approximately 500 generations (see text). Barcoded adapters were added to genomic DNA from these populations to make mixed population sequencing possible. Use of target capture probes (xGen® Lockdown® Probes) from IDT allowed the researchers to focus sequencing efforts on genes commonly found to contain beneficial mutations. (A) Larger cumulative mutational frequencies in the mutS populations show both the possibility of multiple mutations arising within a single gene (for cumulative frequencies exceeding 1), as well as more diverse subpopulations competing (increased cumulative frequency irrespective of gene). (B) Meanwhile, the increased maximum frequency of mutations among mutS populations suggests that these mutations arose earlier giving them more time to reach higher frequencies. Multiple genes reached higher frequencies in mutS populations, further showing how increased mutation rates can more quickly generate multiple beneficial mutations within a single subpopulation.

Dr Deatherage is excited about the idea of being able to get the same answers from sequencing. He stated, “Hopefully, one day we won’t even need competition assays; instead, we will be able to track the frequency of a mutation using targeted capture assays over a given time course and gauge its importance based on the fitness effect.”

Mutations confer fitness benefits

The scientists see evidence of clonal interfer­ence—multiple beneficial mutations within an asexual population competing with each other to take over the population. They also see evidence of multiple mutations within a single gene, and high frequency mutations in multiple genes within a single population, which raises the possibility that secondary mutations (rather than just a single mutation) are contributing to the fixation of mutations in a population. Dr Deatherage believes this process may be causing suboptimal mutations to fix in a population because if, for example, one mu­tation of topA provides a 10% fitness effect and another provides an 11% fitness effect, resulting in a difference of only 1% between the two, these mutations should coexist for many generations. While coexisting, both subpopulations will continue to evolve. If the subpopulation with the 10% fitness increase picks up a mutation in another gene, e.g., spoT, which confers an additional 10% fitness benefit, that cumulative fitness advantage of 20% will drive that subpopu­lation towards fixation markedly faster. The subpopulation with the 11% fitness effect from topA becomes in danger of extinction if it does not pick up another beneficial mutation, despite the topA allele conferring a superior fitness benefit on its own.

Data analysis challenges

Several challenges exist with analyzing the sequencing data. A major issue is the error rate current NGS sequencing methods incur, particularly when coupled with relatively low sequencing read coverage. Illumina’s reported error rate per base per read works out to 1 in 1000 bases. Thus, a base change present in 1 in 100 reads cannot be confi­dently classified as a mutation vs. an error in sequencing because this would be expected to happen by chance at many positions in the reference genome. However, an increased number of reads at this position (e.g., 100 identical errors in 10,000 reads), would allow one to confidently conclude that there was a mutation present at a frequency of ~1%. Deeper coverage can the­oretically allow mutation calls to be made at frequencies approaching the error rate.

The team intends to further improve the confidence of their calls through the use of “duplex sequencing” [2]. Duplex sequencing works by using modified Illumina adapters that have 12 bases of random sequence at the 3’ end. These 12 random bases are sequenced as part of the read and can be used to identify all reads that arise from the same molecule of gDNA. By comparing all reads that correspond to a single molecule of DNA, consensus sequences can be pro­duced confidently, eliminating errors that occur in a single read due to sequencing or PCR errors. Duplex sequencing reports a theoretical error rate of 1 per 1 billion bases of sequence.

xGen® Lockdown® Probes as the tool of choice for target capture

When Dr Deatherage started researching target capture for this project, he imme­diately eliminated chip-based methods where an excess of DNA is used to saturate the limited number of probes on the chip, resulting in recovery of only a portion of the DNA. He was concerned that this could lead to mutated sequences not binding as well as reference (wild type) sequences and, therefore, being selectively excluded from capture. Additionally, as most existing target enrichment panels were focused on human exome–sized enrichment, targeting at least 25 Mb (6X the size of the 4.6 Mb E. coli ge­nome they were studying) was pointless. Dr Deatherage commented, “That is one very nice thing about the IDT probes—we could start from a single biotinylated probe and expand to a whole pool of these [xGen] Lockdown Probes. The targeted range was very nice.”

“Eventual winners, eventual losers” and cancer applications

Some members of Prof Barrick’s lab are in­terested in investigating so-called “eventual winner, eventual loser” situations, where one subpopulation can initially have a fitness advantage over a lesser subpopulation due to different mutations that each has accumulated, but after more time the lesser subpopulation has a better chance of ex­periencing more beneficial mutations and, therefore, reliably outcompeting the other population over the long term [3]. Dr Death­erage described this as a specific, highly beneficial mutation limiting the ability for further evolution in the losers. The research­ers plan to examine additional cases of the eventual winner, eventual loser dynamic to understand the molecular mechanisms that lead to interactions between mutations that result in this “dead-end” effect.

Another application of this research is mon­itoring the evolution of cancer. Considering cancer to be an evolutionary disease with multiple mutations that allow the cells to escape growth restrictions, the team plans to monitor the evolution of tumors. The sci­entists can identify key oncogenes that are mutated in cancers, and then capture them with xGen Lockdown Probes for further study. They should then be able to monitor mutations in these targets at lower frequen­cies than is currently possible due to the large size of the human genome, a prospect Dr Deatherage finds very exciting.


  1. Barrick JE, Yu DS, et al. (2009) Genome evolution and adaptation in a long-term experiment with Escherichia coli. Nature, 461:1243–1247.
  2. Schmitt MW, Kennedy SR, et al. (2012) Detec­tion of ultra-rare mutations by next-generation sequencing. Proc Natl Acad Sci USA, 109(36):14508–14513.
  3. Woods RJ, Barrick JE, et al. (2011) Second-order selection for evolvability in a large Escherichia coli population. Science, 331:1433–1436.

Profile: The Barrick Laboratory; University of Texas (UT) at Austin

The Barrick Lab: Prof Jeffrey Barrick (back row, center); Dr Daniel Deatherage (front row, second from left).

Prof Barrick’s lab (www.barrick­ is interested in under­standing evolution as a creative force. The lab is currently using next generation DNA sequenc­ing and high throughput geno­typing to examine mutations within E. coli populations from a 20-year evolution experiment to understand the evolutionary forces at work. In particular, they hope to understand how differ­ences in evolutionary potential can arise in contending lineages in these populations and deter­mine the eventual winners.

Dr Daniel Deatherage is a postdoctoral associate in the Barrick lab. Dr Deatherage did his PhD under Tim Huang at Ohio State University, where he worked on epi­genetic regulation of miRNAs and analyzed ChIP sequencing data on TGFβ for signaling in ovarian cancer. During his PhD he developed an interest in sequencing technology and was intrigued by the information NGS could provide. He was attracted by Prof Barrick’s research at UT Austin because of their shared interest in NGS and the excellent program Prof Barrick had put together for obtaining and analyzing sequencing data. Dr Deatherage looks forward to developing the cancer studies as independent research.

Product focus: Target capture reagents

xGen® Lockdown® Probes—target capture probe pools for NGS

xGen Lockdown Probes are pools of individually synthesized, quality controlled, and normalized hybridization probes. Use them to generate custom capture panels for targeted sequencing to enhance the performance of existing panels. xGen Predesigned Gene Capture Probe Pools are available for any human RefSeq coding gene. Select from predesigned and custom probes that offer:

  • Sensitive detection of SNPs, indels, CNV, LOH, and translocations
  • GMP compliance for clinical and diagnostics research
  • Flexibility to augment existing panels or create completely custom panels
  • Quick delivery

Discover more about xGen Lockdown Probes.

xGen Lockdown Panels

xGen Lockdown Panels are preconfigured, validated, and stocked pools of xGen Lockdown Probes for targeted next generation sequencing of defined gene families:

  • xGen Exome Research Panel
  • xGen Acute Myeloid Leukemia Panel
  • xGen Pan-Cancer Panel
  • xGen Inherited Diseases Panel
  • xGen Human ID Research Panel
  • xGen Human mtDNA Research Panel

Discover more about xGen Lockdown Panels.

xGen Lockdown Reagents—hybridization and wash kit

xGen Lockdown Reagents have been optimized to deliver deep, even coverage of targets captured using xGen Lockdown Probes and Panels. Achieve uniform coverage with hybridization and wash buffers that are optimized for target enrichment using xGen Lockdown Probes and Panels. A short, 4-hour hybridization protocol generates results quickly.

Discover more about xGen Lockdown Reagents.

xGen Blocking Oligos

xGen Blocking Oligos prevent adapter cross hybridization and minimize off-target capture, increasing specificity during targeted sequencing. Universal Blocking Oligos—TS Mix are designed for use with Illumina barcoded adapters. This ready-to-use mix effectively blocks dual- or single-index adapters to significantly improve on-target performance.

  • xGen Standard Blocking Oligos
  • xGen Custom Blocking Oligos
  • xGen Blocking Oligos for diagnostics—contact for information
  • xGen Blocking Oligos for Ion Torrent adapters

Discover more about xGen Blocking Oligos.

Additional reading

Analyzing the exome—focus your NGS analysis with high-performance target capture—Webinar review: Targeted sequencing capture, using probe pools or panels, can increase read depth and the number of samples per run, while decreasing sequencing cost and simplifying data analysis. See how using individually synthesized, quality checked, DNA target capture probes (xGen® Lockdown® Probes) covering the human exome (xGen Exome Research Panel) performs across a variety of metrics; and compares to other available exome panels.

NGS target capture recommendations for FFPE samples—Webinar review: Learn how it is possible to create high quality target capture libraries from formalin-fixed, paraffin-embedded samples. Dr Kristina Giorda presents an FFPE sample workflow with a concise explanation of DNA quality analysis and how quality assessment can be used to guide the amount of DNA input for NGS library preparation.

NGS detection of low frequency genetic variants using novel, molecular sequencing adaptors—Webinar review: Watch this webinar recording to learn about unique molecular adaptors and a high-performance target capture method for NGS analysis of low frequency variants.

Targeting cancer pathways: Sensitive, comprehensive detection of genomic alterations using a custom NGS panel—Citation summary: Learn how researchers use xGen Lockdown Probes to screen cancer samples for key genes related to targeted cancer therapies.

Review other DECODED Online newsletter articles on NGS applications.

You can also browse our DECODED Online newsletter for additional application reviews, lab tips, and citation summaries to facilitate your research.

Author: Nicola Brookman-Amissah, PhD, is a Scientific Writer at IDT.

© 2013, 2017 Integrated DNA Technologies. All rights reserved. Trademarks contained herein are the property of Integrated DNA Technologies, Inc. or their respective owners. For specific trademark and licensing information, see

xGen® Blocking Oligos

Adapter blocking oligos increase the number of on-target reads by preventing non-specific binding during hybridization.

Find blockers for your platform ≫

Related Articles

Target Enrichment Facilitates Focused Next Generation Sequencing

The rationale and benefits of enriching subsets of the genome (target enrichment by hybrid capture) prior to sequencing.

Read more ≫

Improving Uniform Coverage of Targeted Sequences for NGS

The challenges faced in obtaining uniform coverage of NGS data and how IDT xGen® Lockdown® Probes are uniquely positioned to facilitate uniform sequence coverage.

Read more ≫

Towards Providing Personalized Medicine—Considerations for Reliable NGS Data

Geneseeq Technology, Inc. demonstrates how they improved their target capture methods to increase accuracy in clinical diagnostics by using optimized blocking oligos and stringent hybridization conditions.

Read more ≫

Delivering Comprehensive Genomic Profiling for Clinical Cancer Care Using Targeted Sequencing

Read how scientists at Foundation Medicine, Inc use hybrid selection in their FoundationOne® Test to help clinicians select patient specific treatment options.

Read more ≫

Insertion Site Detection and Targeted RNA Capture Using Next Generation Sequencing

Scientists at Cofactor Genomics use in-solution hybridization to focus on regions of interest for next generation sequencing.

Read more ≫