Customizing CRISPR-Cas PAM specificity with protein language models.

Nayfach, S., Bhatnagar, A., Novichkov, A., Kim, N., Hoffnagle, A. M., Hussain, R., Estevam, G. O., Hill, E., Ruffolo, J. A., Silverstein, R. A., Gallagher, J., Kleinstiver, B. P., Meeske, A. J., Cameron, P., & Madani, A. (2026). Customizing CRISPR-Cas PAM specificity with protein language models.. Nature Biotechnology.

Abstract

CRISPR-Cas enzymes must recognize a protospacer-adjacent motif (PAM) to edit a genomic site, greatly limiting the range of targetable sequences in a genome. Although engineering strategies to alter PAM specificity exist, they typically require labor-intensive, iterative experimentation. We introduce an evolution-informed deep learning model, Protein2PAM, to efficiently guide the design of Cas protein variants tailored to recognize specific PAMs. Trained on a dataset of over 45,000 CRISPR-Cas PAMs, Protein2PAM rapidly and accurately predicts PAM specificity directly from Cas proteins across type I, II and V CRISPR-Cas systems. Using in silico mutagenesis, the model identifies residues critical for PAM recognition in Cas9 without using structural information. We use Protein2PAM to computationally evolve Nme1Cas9, generating variants with broadened PAM recognition and up to a 50-fold increase in PAM cleavage rates compared to the wild type in vitro. Our machine learning approach allows Cas enzymes to target sequences that were previously inaccessible because of PAM constraints, potentially increasing target flexibility in personalized genome editing.

Last updated on 04/01/2026
PubMed