Synthetic Biology
Support and Educational Content

Observing subpopulations within cloned plasmids using NGS analysis

Why IDT is replacing Sanger sequencing with NGS for gene synthesis confirmation

Generating plasmid clones of DNA sequences is one of the most common and important methods in biology research. These methods are indispensable, because, with modest effort, propagation of a single DNA plasmid that was transformed into E. coli provides an indispensable tool capable of producing millions of exact copies, or clones, of that DNA. However, it has also been shown that the fidelity of DNA replication in E. coli is not perfect [1]. Secondary structure, and damage to the DNA through oxidation and other sources can lead to errors in replication [2]. When these errors occur early during colony propagation, they can become a significant proportion of the plasmid sequence population by the time colonies are selected for sequence purification.

Because Sanger sequencing relies on a consensus signal generated from a population of molecules, small variations within the population can be obscured within the signal. In contrast, next generation sequencing (NGS) uses massively parallel analysis of sequence traces for thousands of individual molecules. This makes it possible to detect small variances within populations of molecules.

Examples of sequencing results from samples with contaminating subpopulations can be seen in Figures 1 and 2. In Figure 1, the Sanger trace data for a synthetic gene allows detection of a mixed population of plasmid sequences, visible as strong overlapping peaks of different bases (Figure 1B). A quantitative analysis of the same plasmid preparation using NGS shows that this subpopulation is actually ~17% of the total population (Figure 1A). Figure 2 shows data from a cloned and purified plasmid that contains a subpopulation with a sequence mutation(s) comprising ~7% of the total sequence population. While detectable by NGS analysis (Figure 2A), this contamination is not visible by Sanger sequencing of the same DNA preparation (Figure 2B). While many subpopulations are identifiable by Sanger sequencing, surprisingly large subpopulations go undetected. Minor subpopulations, however, are easily identified in NGS data.

Clonally purified plasmid with a contaminating subpopulation visible in both Sanger sequencing and NGS data

Figure 1. Clonally purified plasmid with a contaminating subpopulation visible in both Sanger sequencing and NGS data. A) A gene sequenced by NGS reveals a subpopulation of approximately 17% incorrect clones that has a 14 bp deletion between bases 431 and 445. B) Bidirectional Sanger sequencing data of the same sample also shows two populations of DNA in the traces. The bottom Sanger trace shows mixed populations beginning at base 431 and continuing in the reverse direction while the top Sanger trace shows the populations are identical from base 445 onwards. The expected sequence is AACAGCAACAACTG.

Clonally purified plasmid with a contaminating subpopulation visible in NGS data but not Sanger sequencing trace

Figure 2. Clonally purified plasmid with a contaminating subpopulation visible in NGS data but not Sanger sequencing trace. A) A Gene sequenced by NGS shows a ~7% error between bp 400 and 500. B) Bidirectional Sanger sequencing of the same sample, shows no error within the same region. The expected sequence is GTTAACGGTATGCACGCGCCGGGTCTGTGCG.

NGS is more sensitive for detection of subpopulations, but Sanger is still useful

While Sanger sequencing has been the standard for sequence confirmation for decades, these examples illustrate the superior ability of NGS analysis to detect subpopulations within cloned plasmid preparations that would otherwise go undetected by Sanger sequencing.

It is important to note that Sanger sequencing continues to be a useful tool for many labs, and for molecular cloning applications. However, as with any method, it is necessary to understand what the technology limitations are, and to verify important results through replication and/or with other methods.


  1. Kunkel TA. (2004) DNA replication fidelity. J Biol Chem, 279(17):16895–16898.

  2. Schaaper RM, Dunn RL. (1991) Spontaneous mutation in the Escherichia coli lacI gene. Genetics, 129(2):317–326.

Product focus

gBlocks® Gene Fragments are double-stranded, sequence-verified, DNA genomic blocks, 125–2000 bp in length, that can be shipped in 2–5 working days for affordable and easy gene construction or modification. These dsDNA fragments have been used in a wide range of applications including, gene construction, CRISPR-mediated genome editing, antibody research, codon optimization, mutagenesis, and aptamer expression. They can also be used for generating qPCR standards.

Learn more about gBlocks Gene Fragments at

Additional reading

The gene construction revolution—See how use of high-quality, custom dsDNA fragments as a starting material allows you to turn what might otherwise be multi-step cloning assemblies into simpler reactions. You can often just order an entire target sequence that is ready for cloning or other uses.

iGEM students engineer biological tools for a better world—Projects from 2 of the prize-winning 2013 iGEM teams show how non-standard natural and synthetic amino acids can be used in 1) peptide synthesis, and 2) tuberculosis monitoring and treatment. Both projects make use of gBlocks Gene Fragments to speed up construct assembly.

Synthetic biology educational content—View additional articles about synthetic biology applications, protocols, and products.

See all DECODED Online articles.

Authors: Adam Clore, PhD, is the manager of synthetic biology technical support, and Tina Lubke is a process development manager at IDT.

© 2016 Integrated DNA Technologies. All rights reserved. Trademarks contained herein are the property of Integrated DNA Technologies, Inc. or their respective owners. For specific trademark and licensing information, see


gBlocks® Gene Fragments

Double-stranded DNA up to 3,000 kb—great for easy gene construction, CRISPR genome editing, and more.

Learn how ≫

Related Articles

CRISPR and Cas9 for Flexible Genome Editing

Applications of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) for targeted gene editing or removal of sequences.

Read more ≫

Cloning Strategies Part 3: Blunt-End Cloning

Tips and tricks for blunt-end cloning.

Read more ≫

Building Biological Factories for Renewable and Sustainable Products

Amyris uses genetic engineering and screening technologies to design microorganisms that convert plant-sourced sugars into target molecules, including pharmaceuticals, biofuels, polymers, flavors, and fragrances.

Read more ≫

iGEM Students Engineer Biological Tools for a Better World

iGEM teams show how non-standard natural and synthetic amino acids can be used in 1) peptide synthesis, and 2) disease monitoring and treatment. Both projects make use of gBlocks® Gene Fragments to speed construct assembly.

Read more ≫

Creating a Synthetic Immune System for Optimized Immune Profiling

A high-throughput method for sequencing and quantification of rearranged antigen receptors on T- and B-cells using gBlocks Gene Fragments to create a “gold standard synthetic immune system.”

Read more ≫