How do CRISPR-Cas systems target specific DNA sequences?

CRISPR-Cas systems are RNA-guided molecular machines that recognize and cut specific DNA sequences through a combination of sequence complementarity and short DNA motifs. The system evolved in bacteria and archaea as an adaptive immune response to viruses and plasmids, a discovery attributed to Rodolphe Barrangou at North Carolina State University who demonstrated spacer acquisition and immunity in 2007. The basic targeting mechanism depends on two interacting components: a guide RNA that specifies the sequence to be targeted and a Cas nuclease that binds the guide and executes cleavage.

How sequence recognition works

A short fragment of foreign DNA called a spacer is incorporated into the host CRISPR array during the adaptation phase. That spacer is transcribed and processed into a CRISPR RNA or crRNA, which base-pairs with a complementary sequence in invading DNA. In many widely used systems the crRNA is combined with a trans-activating RNA to form a single-guide RNA, an innovation leveraged in laboratory tools described by Jennifer Doudna at the University of California, Berkeley and Emmanuelle Charpentier at the Max Planck Unit for the Science of Pathogens. The guide RNA forms an R-loop by hybridizing to the target strand, exposing the non-complementary strand for nuclease attack.

Target binding also requires recognition of a short flanking motif called the protospacer adjacent motif or PAM. The PAM is not present in the organism’s own CRISPR array, so it serves as a discrimination signal that prevents self-cleavage. Different Cas proteins recognize different PAM sequences; for example, Streptococcus pyogenes Cas9 recognizes an NGG PAM. The presence of a correct PAM and extensive complementarity within a critical region of the guide called the seed sequence together determine binding affinity and specificity.

Cutting and consequences

Cas9 contains two nuclease domains that cleave each DNA strand, producing a double-strand break. Other nucleases such as Cas12 create staggered cuts and can exhibit collateral single-stranded DNA cleavage after target recognition. In genome editing applications pioneered by Feng Zhang at the Broad Institute of MIT and Harvard these breaks are repaired by the cell’s own machinery, notably non-homologous end joining which can introduce insertions or deletions, or homology-directed repair which can incorporate supplied templates. The outcome depends on cell type and repair pathway activity, and repair variability underlies both therapeutic promise and risks of unintended edits.

The biological relevance extends beyond molecular mechanics. CRISPR-based tools have transformed research and raised ethical and regulatory debates about clinical use, ecological releases such as gene drives, and equitable access. Indigenous communities and conservationists have voiced concerns about altering wild populations, and jurisdictions differ sharply in governance approaches. Understanding the precise targeting rules — PAM dependence, seed-region sensitivity, and nuclease behavior — is essential for minimizing off-target effects and for responsible deployment across medical, agricultural, and environmental contexts.