What is CRISPR screening?
CRISPR screening is a large-scale genetic loss-of-function experimental approach designed to find the equivalent of a few needles in a haystack. CRISPR screening facilitates discovery of key genes or genetic sequences that elicit a specific function or phenotype for a cell type (for a few examples, see Table 1). Like all good scientific experiments, CRISPR screening experiments are designed with a hypothesis in mind, but unlike many experiments, the hypothesis is not a narrow one. All CRISPR screening experiments have the broad hypothesis that there are a few genetic sequences or genes in the genome that have a certain physiological effect, and that these few genetic sequences can be identified. The result of a successful experiment is a short list of candidate genes or genetic sequences that appear to participate in producing the physiological effect under investigation. Therefore, CRISPR screening experiments not only start with a broad hypothesis, they end by generating new, narrow hypotheses. That is, each identified gene or genetic sequence usually needs to be investigated further using other biological methods to determine if it really produces the effect being studied.
CRISPR, as it is used by many researchers, is a method of making double-strand cuts at specifically targeted sites in DNA. When such cuts are produced in genomic DNA in cells, the cells use their DNA repair systems to mend the cuts. Commonly, the repair process is imprecise, and it results in mutations that knock out the targeted gene. This knockout event is the main result that scientists want for most CRISPR screening experiments . CRISPR is described in much more detail on our website.
Table 1. Examples of uses of CRISPR screening
|Identify genes or DNA sequences causing cells to be either resistant or sensitive to a drug
|Identify genes or DNA sequences affecting susceptibility to environmental toxins
|Identify components of a cellular pathway
|Identify genes or DNA sequences leading to a particular disease state
How does CRISPR screening work?
Most CRISPR screening is done in cell culture. A few papers have described CRISPR screening in animals, and this will be described in more detail below. However, the main idea is easier to understand in cells, so we’ll start our description here.
The basic idea of CRISPR screening is to knock out every gene that could be important, but knock out only one gene per cell (Figure 1). The intended result is a population of cells with a different gene knocked out in each cell in the dish. Some cells will die, but others will survive, or even grow better and become the predominant cell type. After the knockout cells are allowed to grow for a few days, next-generation sequencing (NGS) is performed on the entire mixed population of cells to determine which sequences are present and which are depleted or absent. Such an experiment identifies genetic sequences necessary for survival under normal conditions. However, another aim addressed in most CRISPR screening studies is the identification of genes that allow cells to survive under specific conditions, such as drug treatment or other physiological situations of interest.
Therefore, in most CRISPR screening experiments, there is a specific physiological situation that needs to be understood better. For example, a cancer cell line may be resistant to some drug. The drug kills other cells but not this cancer cell line. Scientists usually start with a list of thousands of genes or genetic sequences that might be involved in this drug resistance and aim to narrow this list down by CRISPR screening. Often, screens start with a list of all genes in the genome to make sure nothing is missed.
From the list of genes or genetic sequences, scientists then generate a long list of CRISPR targets. These targets are ~20-base DNA sequences located in the genome adjacent to sites known as protospacer-adjacent motifs (PAMs). For CRISPR screening, it is essential to knock out all the genes being studied. Therefore, to increase the probability of cutting, several target sites must be selected for every gene or genetic sequence being studied. Approximately 6–8 target sites per gene are recommended , although some researchers have had success targeting fewer sites per gene than this .
Figure 1. CRISPR screening using pooled DNA oligos. A pool of oligos is designed to target a massive number of genes. A library of lentiviruses is produced from the oligos and is used to infect cells. CRISPR genome editing knocks out different genes in different cells. Next-generation sequencing is used to determine which genes are present and which are absent. Genes for drug resistance or for drug sensitivity can be identified; negative screens determine genes conferring resistance, and positive screens determine genes conferring sensitivity. See text for details.
Control sequences: A properly designed CRISPR screening experiment should have numerous control sequences. These include the following:
- Negative controls: Negative control sequences should be designed not to have genetically- mediated physiological effects, so they can help identify any indirect confounding effects from the experimental materials and procedures. As few as 100 such control sequences have been used successfully ; however, other authors recommend using around 1000 control sequences . Negative control sequences often include the following:
- Non-targeting DNA sequences . These non-targeting control sequences can either be designed without any homology to the cell’s genome, or they can have homology to a sequence that does not have an identifiable PAM nearby . These controls should not cause cutting of the DNA and therefore should not have physiological effects.
- Controls that target regions of the genome not known to contain any genes . These “safe-targeting” control sequences may or may not truly be “safe,” as they may cause unexpected biological effects.
- Sequences targeting genes already known not to have any effect on the physiological response under study. This is another kind of “safe-targeting” control .
- Positive controls: These are highly recommended when available. For this, it is necessary to know something about the biological effect being studied . That is, if a specific gene is known to be involved in the biological effect, the pool should include several sequences targeting that gene.
Once the target and control sequences are identified, the next step is usually to design a pool of oligos which will be used to make lentiviruses. Some researchers instead use adeno-associated virus [12,13], but this is not common. Another approach, which does not pool the oligos, is described near the end of this article. For the commonly used pool approach, each oligo in the pool must contain DNA to encode at least the targeting region of the CRISPR guide, or frequently the entire single-guide RNA (sgRNA) including the target sequence . Each oligo also must have sites at each end to allow cloning into lentiviral gene-containing plasmids appropriately designed for biosafety [9,14]. From the plasmids, lentiviruses are produced as a pool containing thousands of CRISPR targeting sequences, with one targeting sequence per virus particle (virion).
What is a CRISPR library and how is it used?
The term “CRISPR library” is often used interchangeably with terms such as “CRISPR guide RNA library” or “sgRNA library.” In almost all cases, this “RNA library” is really the batch of lentiviruses produced from the pool of oligos! Therefore, such a library is not actually free RNA, even when people call it an “RNA library.” (There is one exception, described at the end of this article.) It is true that lentiviruses are RNA viruses, so each lentivirus in the library contains RNA, but this is not CRISPR guide RNA. This is viral RNA, which is much too long for a Cas enzyme to use in genome editing. When lentiviral libraries infect cells, CRISPR does not begin immediately, because there is no CRISPR guide RNA yet. Instead, lentiviral RNA is first reverse-transcribed to DNA, which is then integrated into the genome of infected cells . This integration can occur at any of several hundred sites in the cellular genome . These sites are not related to the CRISPR target sites, but this integration but may affect the function of the genes surrounding the insertion site.
In CRISPR lentiviral screening experiments, scientists aim to infect the cells with no more than one virion per cell. To make this practical, a very small ratio of virions to cells, or multiplicity of infection (MOI), is often used, in the range of 5–30% . Since each lentivirus in the library includes one sequence from the oligo pool, one such sequence is integrated as DNA into the genome of each of the infected cells. After integration, the lentiviral sequences, including the cloned-in CRISPR sequences, are transcribed to RNA. Thus, CRISPR guide RNA is eventually produced after the library of lentiviruses is used to infect cells .
The Cas enzyme
It is necessary for the cells to express a Cas (CRISPR-associated) enzyme for CRISPR screening to work. The CRISPR guide RNA guides the Cas protein to the target site. Then, the Cas enzyme cuts the DNA. There are several methods that scientists use to deliver a Cas enzyme into cells for CRISPR screening, including:
- Using a cell line that stably expresses a Cas enzyme. Such a cell line is often produced in advance using a separate lentivirus carrying the gene for the desired Cas enzyme, some time before starting the lentiviral CRISPR library screen .
- Producing a pool of lentiviruses that contain both the gene for the desired Cas enzyme and the DNA that codes for the guide RNA .
Why are lentiviruses used in CRISPR screening?
Lentiviruses are potentially dangerous, and appropriate safety measures must be taken , so why are they so popular for CRISPR screening? As mentioned above, lentiviruses stably integrate their DNA into mammalian genomes [7,15]. This feature is used in CRISPR screening as follows. When cells that are infected with lentivirus survive, they undergo integration of the lentiviral genome, including the ~20 CRISPR targeting bases that have been cloned into the lentiviral genome. The cells continue to carry these guide sequences and express the corresponding CRISPR RNA (crRNA) even after the genes of interest are knocked out. When the cells proliferate, successive generations of cells carry the lentiviral DNA coding for guide RNA. This is useful, because DNA or RNA from the cells can be sequenced, so the scientists can find out which DNA/RNA sequence each cell contains. The continued expression of this DNA or RNA—after some cells have had a chance to proliferate, while other cells have been killed—is what is measured in the CRISPR screen.
How many cells are needed?
As mentioned above, an MOI of under 30% is usually used to try to ensure that no cell is infected with more than one virion (and thus more than one CRISPR targeting sequence). This means that many more cells are needed than just the ones that are infected. In addition, it is essential to infect at least 500 cells per targeting sequence to be sure of obtaining results within the sensitivity of the assay . Therefore, for a CRISPR library that contains 100,000 targeting sequences, at least 1.67 x 108 cells are recommended . Because the library is pooled, the cells are screened in large dishes, not in multiwell plates .
After infection, what happens?
After treatment of the cells with the viral library of targeting sequences and the desired Cas enzyme, the cells must be incubated for a few days. This allows the cells time to develop phenotypic CRISPR-mediated changes—in many experiments, this means the cells either grow or die off (Figure 1). Then, drug or other treatment is performed if desired for the specific experiment. After this, both of the total genomic DNA samples or total RNA samples from the two mixed populations (i.e., the control cells and the drug-treated cells) are collected for sequencing. The DNA (or RNA) is subjected to sequencing by NGS. The two resulting lists of sequences (control vs. drug-treated) can be compared.
Negative vs. positive screens
Drug resistance and drug sensitivity are two of the major physiological responses that are frequently studied by CRISPR screening (Figure 1). Negative screens are used to find genes that cause drug resistance, and positive screens are used to find genes that cause drug sensitivity. These approaches are very similar, but the outcomes are different, as described below.
Negative Screens: Returning to the example of a cancer cell line that is resistant to a specific drug, how does screening work? If the resistant cancer cell line is first CRISPR-screened and then treated with the drug (which usually does not kill the cells, because they are resistant), some of those cancer cells will actually die in response to the drug. This suggests that resistance genes were knocked out. The control for this experiment would be cells targeted with the same lentiviral pool, but not treated with the drug. This is necessary because if essential genes are knocked out in some cells, those cells would die even without the drug. There will be many sequences detected from no-drug control cells, because most of these cells survive. However, for the experimental sample (drug-treated cells), the target sequences that knocked out the resistance genes will not be detected by sequencing. The missing sequences have a high probability of having targeted the drug resistance genes. This is called a negative screen (Figure 1). To confirm the findings of a negative CRISPR screen, other kinds of biological studies usually are done.
Positive Screens: Cell types that are sensitive to a particular drug are naturally killed by the drug. To identify the genes that confer sensitivity, scientists can use the same method described for resistant cancer cell lines. In this case, some of the guide RNAs in the screen may knock out genes that cause sensitivity to the drug. The unedited cells (which are the majority of the cells after the CRISPR screen but before the drug treatment, because the MOI is less than 30%) will be killed by the drug. Furthermore, most of the CRISPR-edited cells will also be killed, because most genes in the genome are not drug-sensitivity genes. Only a few cells, those with the drug-sensitivity genes knocked out, will grow. These cells proliferate far more than the other cells in the presence of the drug and may take over the population. Sequencing by NGS will show the presence of the DNA encoding the guide RNA that was used to delete the sensitivity gene. This DNA will be present in large amounts, because the cells carrying this DNA have proliferated since their sensitivity gene has been knocked out. This is called a positive screen (Figure 1).
Why is it important to target multiple sites per gene?
Targeting multiple sites per gene is important for at least two reasons:
- To increase the probability of generating a cut within the targeted gene (as described earlier), as not all CRISPR targets are cut with equally high efficiency.
- To provide confidence of a real result. If only one sequence that targets a gene is missing (in a negative screen) or overrepresented (in a positive screen) in the final NGS analysis, this might be due to:
- Random chance or imperfect experimental setup
- Lentiviral integration into an important gene, thus knocking out that gene even before CRISPR takes place
- Off-target effects
However, if the results of multiple targets in the same gene are consistent, it is likely that these results are real. This is strong evidence that a gene responsible for the physiological effect has been identified .
More applications of CRISPR screening
In addition to looking for genes that cause drug resistance or drug sensitivity in cells, CRISPR screening has been used in many other contexts . For example, it has been used to identify genes important in mitochondrial metabolism  and lysosome function . With a catalytically dead Cas9 enzyme in conjunction with transcriptional activators, CRISPR screening has been used to turn on expression of thousands of long non-coding RNAs (lncRNAs) to identify those involved in regulating drug resistance in melanoma cells . Other screening projects have used catalytically dead Cas9 both in combination with transcriptional activators to turn on expression of a large number of genes, as well as without transcriptional activators thus knocking down expression of many genes . Another approach to CRISPR screening includes using two different targeting sequences per lentivirus within a library, an approach that has a variety of applications ranging from use of Cas9 nickases in screening to deletion of numerous large segments of genomic DNA in large screening projects .
CRISPR screening in animals
It is impractical for most labs to implement large CRISPR screens in animals, as this could require large numbers of animals each carrying different genetic mutations for a single experiment. However, some CRISPR screening methods have been used in animals. For example, in one project, a mouse cancer cell line which normally does not metastasize was treated with a CRISPR library of over 67,000 lentiviruses. The cancer cells were then transplanted into mice. Tumors grew and metastasized. By sequencing the DNA in the metastases, the researchers identified several CRISPR-targeted genes. This showed that loss of function of these genes may cause tumor growth and metastasis . In another study, researchers used catalytically dead Cas9 fused to a cytidine deaminase base-editing protein system, which enzymatically alters the genomic DNA sequence by introducing G>A and C>T base substitutions without making double-strand breaks. They treated mouse embryonic stem cells with an sgRNA library, using their base-editing system to insert specific mutations at 77 targeted sites. The mouse embryonic stem cells were injected into the cytoplasm of mouse oocytes, which were then implanted into female mice. The mice gave birth to offspring with the targeted mutations. This screen identified four amino acid positions in one protein essential for production of primordial germ cells. This novel screening technology provides a method for researchers to investigate protein sequence and function relationships in vivo .
CRISPR screening without a pool, using arrayed crRNAs
Almost all CRISPR screening is done with a pool of lentiviruses carrying a large number of sequences to be targeted. In some instances, however, scientists may already have a short list of genes of interest (for example, all the genes in one signaling pathway) that they want to narrow down further. When such a short list is already available, a different approach can be taken (Figure 2). A small CRISPR library can consist of RNA, with each of the sequences to be investigated individually produced and kept separate in an array format in different wells of a multiwell plate. In this experimental design, each individual crRNA can be complexed with an appropriate Cas enzyme in each well. These ribonucleoprotein complexes can be transfected (often by electroporation) individually into cells in different wells of a multiwell plate. Thus, each well of cells is transfected with one predetermined CRISPR targeting sequence. Then, depending on the experimental setup, the wells can be monitored for a physiological effect (live/dead cells, expression of fluorescent protein, or some other readout). This approach eliminates the need for NGS but tends to be more labor-intensive and can be much more expensive if a large library is used. However, by removing the need for lentivirus, it is likely that fewer than 6–8 target sites per gene need to be selected, as there is no chance that lentiviral integration will disrupt an important genomic sequence. In such experiments, a smaller number, such as 4 target sites per gene, would probably be sufficient for many genes. However, even just one target site per gene can be sufficient if the target site is already well characterized and known to cause knockout . IDT offers predesigned CRISPR crRNA libraries and options for creating custom libraries. For more information, please contact CRISPR@idtdna.com.
Figure 2. CRISPR screening using arrayed crRNAs. Individual crRNA sequences targeting different genes are kept separate in different wells of a multiwell plate. These RNAs are combined with a Cas enzyme to form ribonucleoprotein (RNP) complexes. RNPs are then transfected by electroporation into cells in individual wells of other multiwell plates. CRISPR genome editing knocks out different genes in different wells of cells. Drug or other treatment can be performed as needed. Observation of changes in a known phenotype or response in different wells of cells is used to determine which genes are involved in the physiological effect being studied. See text for details.
CRISPR screens hold great promise for identifying genes and genetic sequences involved in many physiological pathways and pathological conditions. IDT offers oPools oligo pools (pooled DNA oligonucleotides) to facilitate your production of libraries of lentiviruses carrying sequences encoding CRISPR guide RNAs.