What this is
- The system is effective for gene editing but faces challenges with off-target effects, particularly with mismatches in the crRNA-DNA pairing.
- To enhance specificity, a hyper fidelity variant of AsCas12a, named HyperFi-As, was engineered by altering key amino acids.
- HyperFi-As shows comparable on-target activity to wild-type AsCas12a while significantly reducing off-target cleavage events.
Essence
- HyperFi-As, a modified AsCas12a variant, retains on-target editing efficiency while dramatically lowering off-target effects, particularly in regions distal to the .
Key takeaways
- HyperFi-As exhibits reduced off-target effects compared to wild-type AsCas12a, particularly in the -distal region, enhancing its utility for precise genome editing.
- Single-molecule assays reveal that HyperFi-As has lower stability in R-loop complex states with off-target DNA, explaining its reduced off-target activity.
- The engineering approach of modifying amino acid residues near the target DNA backbone effectively increases the fidelity of systems.
Caveats
- The study primarily focuses on engineered variants in vitro, and further validation in clinical settings is necessary to confirm safety and efficacy.
- While HyperFi-As shows improved specificity, the potential for off-target effects remains, particularly in complex genomic environments.
Definitions
- CRISPR-Cas12a: A gene-editing tool that uses RNA to direct a nuclease to specific DNA sequences for modification.
- PAM (protospacer-adjacent motif): A short DNA sequence adjacent to the target DNA sequence that is essential for the binding and cleavage activity of CRISPR nucleases.
AI simplified
Introduction
The clustered regularly interspaced short palindromic repeat (CRISPR)-associated protein (Cas) system allows for a wide range of applications for gene modification when guided by RNA and in the presence of protospacer-adjacent motif (PAM) sequence [1–8]. However, crRNA-DNA mismatches may also induce cleavage activity of Cas nucleases, resulting in undesired cleavage at off-target sites [9–16]. The off-target effects not only confound the reliability of research experiments in the lab but also have serious implications for clinical utility. So far, several strategies have been developed to improve the specificity of the Cas nucleases, including the end-point selection after the generation of random mutagenesis by the error-prone PCR [17], direct evolution [18,19], engineering of the sgRNAs (ggX20 sgRNAs and tru-sgRNAs) [20,21], delivery of Cas9-sgRNA ribonucleoprotein (RNP) complexes [22,23], usage of dimeric RNA-guided FokI Nucleases (RFNs) [24], and directional screening in human cells [25]. In particular, structure-guided protein engineering is a very useful method to design high-fidelity Cas-nuclease variants, which has been demonstrated in SpCas9, such as SpCas9-HF1 [15], HypaCas9 [26], eSpCas9 [27], HeFSpCas9 [28], SaCas9-HF [29], etc.
In addition to the commonly used type II-A Cas9, the type V-A system Cas12a can also cleave genome DNA efficiently in vivo. Unlike Cas9, Cas12a orthologs (As- (Acidaminococcus sp.) and Lb- (Lachnospiraceae bacterium) Cas12a nucleases) have several distinct features [5–7,9]. First, Cas12a recognizes the thymine-rich PAM sequence upstream of the target region and triggers the cleavage of target DNA in the PAM distal position, generating staggered ends [5]. Second, Cas12a can process its own CRISPR crRNA array into mature crRNAs to mediate gene editing at different genome sites simultaneously and does not require a trans-activating crRNA [6,7,30]. Moreover, Cas12a has low off-targeting in human cells due to low crRNA-DNA mismatch tolerance [9,31]. Therefore, Cas12a is considered to be suitable for multiplex gene editing and accurate genome modification.
Although Cas12a nuclease provides a powerful potential for genome engineering, specificity still needs to be improved for clinical application. In the previous studies of genome-wide specificities of CRISPR-AsCas12a nucleases in human cells, several cleavage events happen in the spacer region with a single mismatch base pair [9]. Several AsCas12a variants have been developed to decrease the indels event at the unwanted loci [32–34]. However, mismatches in the PAM-distal region were frequently observed in off-target sites and few strategies were developed to specifically solve the problem. To address the issue, we engineered a hyper fidelity AsCas12a variant with the mutations S186A/R301A/T315A/Q1014A/K414A (termed HyperFi-As) by modifying amino acid residues interacting with the backbone of the target DNA strand and crRNA strand in both the proximal and distal regions to the PAM through structure-guided protein engineering. Using the genome-wide unbiased identification of double-stranded breaks enabled by sequencing (GUIDE-seq) method, we demonstrated that HyperFi-As dramatically reduced the off-targeting in both the proximal and distal regions to the PAM and displayed better fidelity than AsCas12aWT, the recent versions of Cas12a (As ultra, As plus), and the very low off-target Cas9 variant (SuperFi-SpCas9) reported by Taylor etc. in 2022 which attracts great attentions [35]. Furthermore, the results from the new modified methods of single-molecule magnetic tweezers showed that HyperFi-As reduced the binding capacity and decreased the state of R-loop complex at mismatch sites without compromising on-target binding capacity. All these suggested that HyperFi-As could be a very promising accurate CRISPR-Cas genome-editing tool. The strategy that modifying residues in contact with the backbone of the distal end of the crRNA strand combined with mutating nonspecific DNA contacts is very useful for increasing Cas12a specificity.
Results
Structure-guided protein engineering for high-fidelity AsCas12a
Cas12a has been noted as a very prominent tool for gene editing with low off-targeting. Our previous work identified CeCas12a and other variants with stringent PAM recognition to further lower the off-targeting [36,37]. However, the relatively high off-targeting still existed in the distal end of the PAM. To further investigate the off-targeting in the proximal and distal end of the PAM, AsCas12a was selected since its high-resolution structure has been solved [38]. Firstly, 4 amino acid residues (S186, R301, T315, and Q1014) with direct hydrogen bonds to the phosphate backbone of the target DNA strand within a 3.0-Å distance at the proximal end of PAM were identified (S1 Fig). Among these 4 residues, both R301A and T315A exhibited a tendency for reduced cleavage activity at several mismatched sites compared to AsCas12a WT [39]. To study the effects of these mutations systematically, we constructed 4 single amino acid mutants by alanine substitution (S186A/R301A/T315A/Q1014A) and generated all possible double, triple, and quadruple versions by combining these mutations. Then, we employed a previously reported human cell-based enhanced GFP (EGFP) disruption assay to test their off-targeting [36]. The result showed that all 14 variants retained comparable on-target activities to AsCas12aWT when using the fully matched EGFP crRNA (Fig 1A). Next, we sought to evaluate the performance of these variants toward the imperfect crRNA-DNA matching. To do this, we repeated the EGFP disruption assay by combining all variants with EGFP-crRNA expression plasmids that contain 2 substituted bases at positions ranging from 15 to 23 (Fig 1A). We found that the nuclease activity of all triply substituted variants reduced dramatically when 2 adjacent crRNA-DNA base pair mismatches were located at the position from 15 to 19 (reduction ranges from 14.1% to 82.5%) (Fig 1A). Furthermore, the quadruple substitution variant (S186A/R301A/T315A/Q1014A, termed AsCas12a4m) presented the reduced cleavage efficiency ranging from 20.9% to 92.6% compared to AsCas12aWT (Fig 1A).

Identification and characterization of AsCas12a variants bearing substitutions in residues forming nonspecific contacts with target DNA. () Characterization of AsCas12a variants containing alanine substitutions in residues that form hydrogen bonds with the target DNA. Assessment of wild-type AsCas12a and variants by using EGFP disruption assay when paired with fully matched crRNA or partially mismatched crRNAs (mean ± SD;= 3). () Average activity of wild-type AsCas12a and mutant variants on targets with optimal PAM TTTV (V = A; G; C) sequence. () Activities of wild-type AsCas12a and AsCas12a4m on targets with TTTV PAM sequence across 15 endogenous targets measured by T7 endonuclease I assay (mean ± SD;= 3). () Summary of on-target modifications from (), with means and 95% confidence intervals shown. The data underlying this figure can be found in. EGFP, enhanced GFP; PAM, protospacer-adjacent motif. A B C D C n n S1 Data
AsCas12a4m retains high on-target activities in human cells
Modifying amino acid residues in close contact with the target DNA strand or non-target DNA strand might have various impacts on the cleavage activity at the target site [28]. To systematically determine the activity of AsCas12a4m at endogenous chromosomal sites, we compared the activity of AsCas12aWT and all substitution variants (including AsCas12a4m) at 16 endogenous sites using T7E1 (T7 Endonuclease I) assay in 293T cells (Figs 1B, 1C, S1 and S2). We found all variants retained highly comparable activities (90% to 115%) to AsCas12aWT at TTTV (V = A/G/C) PAM sites (Figs 1B and S1), and AsCas12a4m showed at least 90% of the on-target cleavage efficiencies observed with AsCas12aWT at the same sites (Figs 1C, 1D and S2). Based on the results, we selected the AsCas12a4m for the subsequent experiment.
crRNA-DNA mismatch tolerance of AsCas12a4m
For further investigating crRNA-DNA mismatch tolerance of AsCas12a4m in human cells, we arbitrarily selected 2 endogenous target sites (CFTR and B2M) and constructed a series of plasmids encoding crRNAs with 1 or 2 mismatches (Fig 2A and 2B). Firstly, we tested the activity of AsCas12aWT and AsCas12a4m guided by crRNAs with single mismatches along the protospacer complementarity region. For the CFTR site, AsCas12aWT showed 5.7% to 68.5% cleavage efficiency at mismatch positions 2, 3, 4, 6, 7, 9, 10, 11, and 13–23 (Figs 2A and S3). For the B2M site, AsCas12aWT presented 6.9% to 38.1% cleavage efficiency at mismatch positions 1, 8, 9, 10, 11, 13–16, and 18–23 (Figs 2B and S4), and did not tolerate single mismatches at positions 2–7 (Figs 2B and S4). Not surprisingly, the results demonstrated AsCas12aWT could mediate DNA cleavage with imperfect crRNA-DNA matching at positions 1 through 23, which were consistent with the previous studies [31]. However, for AsCas12a4m, no tolerance was detected for any single mismatches at positions 1–19 (Fig 2A and 2B). We also tested the editing of AsCas12aWT and AsCas12a4m at CFTR and B2M sites by using crRNAs with 2 adjacent mismatches (Fig 2A and 2B). The analysis showed that AsCas12aWT and AsCas12a4m processed nearly undetectable cleavage activities with 2 adjacent mismatches through positions 1–18 for both CFTR and B2M sites (except for AsCas12aWT at B2M 9 and 10 mismatch) (Figs 2A, S3 and S4). By contrast, mismatches (single mismatch or 2 adjacent mismatches) at positions 20–23 did not substantially reduce the cleavage activities for both AsCas12aWT and AsCas12a4m (Fig 2A and 2B). Overall, AsCas12a4m exhibited high sensitivity to crRNA-DNA mismatches in both CFTR and B2M sites especially for the close end region of PAM. The results suggested that AsCas12a4m may enhance the fidelity of gene editing.

Tolerance of AsCas12aWT and AsCas12a4m variant to mismatched crRNAs and genome-wide specificities of AsCas12aWT and AsCas12a4m variant with matched crRNAs targeting endogenous sites. (,) Indels induced by AsCas12aWT and AsCas12a4m using crRNAs that contain singly mismatched bases or pairs of mismatched bases toward() and(). Activity determined by T7 endonuclease I assay. Error bars represent SEM for= 3. () Summary of the total number of off-target sites identified by GUIDE-seq for AsCas12aWT and AsCas12a4m with crRNAs targeting 7 endogenous sites. () Percentage of reads detected by GUIDE-seq at the on-target site and off-target sites (ordered by number of mismatches, 0 represented on-target) among total detected reads by AsCas12aWT, AsCas12a4m (top), and numbers of genome-wide off-target sites (bottom). The data underlying this figure can be found in. GUIDE-seq, genome-wide unbiased identification of double-stranded breaks enabled by sequencing. A B A B C D CFTR B2M n S1 Data
Genome-wide targeting specificity of AsCas12a4m
To evaluate the fidelity of AsCas12a4m at endogenous sites in human cells, we used the GUIDE-seq method to assess the genome-wide specificity of AsCas12aWT and AsCas12a4m with 6 different crRNAs targeted to PD1, B2M, HPRT, DNMT1, CLIC4, and NLRC4. To test whether the double-stranded oligodeoxynucleotide (dsODN) tag was integrated into the on-target break site, we performed PCR reactions initiated by 1 primer that specifically annealed to the dsODN and another primer that annealed to the upstream or downstream of on-target break sites (S5 Fig). The dsODN tag was successfully integrated into double-strand breaks (DSBs) induced by AsCas12aWT and AsCas12a4m (S5 Fig). The GUIDE-seq analysis revealed that crRNAs targeting PD1 and B2M generated no unwanted mutations for AsCas12aWT and AsCas12a4m variant. crRNAs targeting HPRT, DNMT1, CLIC4, and NLRC4 generated 1, 10, 4, and 4 off-target sites for AsCas12aWT, respectively, while 0, 1, 1, and 2 off-target sites for AsCas12a4m, respectively (Figs 2C, 2D and S6). GUIDE-seq analysis of AsCas12a4m showed that the number of off-target sites decreased by lower than 3, the on-to off-target read ratio (range 2% to 25%) was improved, and the number of on-target reads still retained, compared to AsCas12aWT (Figs 2D and S6). To validate the off-target sites identified by GUIDE-seq, we confirmed the occurrence of indels using deep sequencing (S7 Fig). All results demonstrated that AsCas12a4m possesses higher fidelity than AsCas12aWT.
Improved specificity of AsRVR and enAsHF1
AsRVR and enAsHF1 have been reported to have a broader targeting range than AsCas12aWT [32,39], while altered PAM preferences might induce extra off-targets at the noncanonical PAM sites [36,40]. We sought to graft AsCas12a4m mutations onto AsRVR and enAsHF1 (AsRVR-4m, enAsHF1-4m) and tested whether these alterations would improve targeting specificity. First, we evaluated the editing activities of AsRVR-4m and enAsHF1-4m in endogenous human genes, and 15 endogenous genomic sites harboring TTTV, TTCV, TATG, TATC, GTTA, CTTA, and GTCA PAMs were selected for T7E1 assay (Figs 3A, 3B and S8). As a result, these mutations did not affect the cleavage activity at all of the tested sites (Fig 3A). Next, we performed crRNA-DNA tolerance assay, according to the previous method (Fig 3C and 3D). Indeed, AsRVR and enAsHF1 exhibited nuclease activity toward crRNA-DNA single mismatch (Figs 3C, 3D and S9), and AsRVR showed more sensitivity to crRNA-DNA mismatch at these sites, compared to enAsHF1 (Figs 3C, 3D and S9). By contrast, AsRVR-4m and enAsHF1-4m caused a significant decrease in the single mismatch cleavage, which was consistent with the above results (Figs 2A, 2B, 3C, 3D and S9).
To globally assess the editing specificity of AsRVR-4m and enAsHF1-4m, we performed GUIDE-seq analysis on 3 endogenous sites that were well studied previously [36,40]. As a result, AsRVR-4m has a dramatically lower number of off-target sites at HEK293 site 1, POLQ1, and POLQ2 (9, 0, and 18, respectively), compared with AsRVR (47, 12, and 77, respectively). Similarly, the number of off-target sites of enAsHF1-4m at the 3 sites (8, 1, and 22) were lower than those of enAsHF1 (36, 7, and 55, respectively) (Figs 3E and S10). On- to off-target read ratios of AsRVR-4m and enAsHF1-4m at HEK293 site 1, POLQ1, and POLQ2 significantly increased (57.3%, 100%, and 19.7% for AsRVR-4m and 29.8%, 90.5%, and 19.1% for enAsHF1), compared with AsRVR and enAsHF1(9.6%, 45.6%, 5.8% and 7.5%, 43.6%, 13.3%, respectively) (Fig 3E–3I). We performed deep sequencing to validate the off-targets sites identified by GUIDE-seq (S11 Fig). As expected, the 4m variant has significantly improved the specificity and dramatically reduced the off-target index (0.67 and 0.65 for AsRVR and enAsHF1, for at canonical TTTV PAM, 0.83, and 0.72 for AsRVR and enAsHF1 at noncanonical PAM) (Fig 3G and 3I).

Refining the specificity of AsRVR and enAsHF1. (,) Mean on-target percent modification for wild-type AsRVR and AsRVR-4m, enAsHF1 and enAsHF1-4m with crRNAs across sites encoding TTTV PAMs (), TTCV PAMs, TATV PAMs, GTTA PAM, CTTA PAM, GTCA PAM (). (,) Indels induced by AsRVR and AsRVR-4m (), enAsHF1 and enAsHF1-4m () using crRNAs that contain singly mismatched bases toward. Activity determined by T7 endonuclease I assay. Error bars represent SEM for= 3. () Percentage of reads detected by GUIDE-seq at the on-target site and off-target sites (ordered by number of mismatches, 0 represented on-target) among total detected reads by AsRVR and AsRVR-4m, enAsHF1 and enAsHF1-4m (top) and numbers of genome-wide off-target sites (bottom). () Summary of the total number of off-target sites with TTTN PAMs identified by GUIDE-seq for AsRVR and AsRVR-4m, enAsHF1 and enAsHF1-4m () and normalization of off-target effect at TTTN PAMs (value was calculated by the ratio of total off-target sites for AsRVR-4m, enAsHF1-4m to the total off-target sites for AsRVR, enAsHF1 within the detected sites) (). () Summary of the total number of off-target sites with base substitutions to the WT-related TTTN PAMs identified by GUIDE-seq for AsRVR and AsRVR-4m, enAsHF1 and enAsHF1-4m () and normalization of off-target effect at these sites (). The data underlying this figure can be found in. GUIDE-seq, genome-wide unbiased identification of double-stranded breaks enabled by sequencing; PAM, protospacer-adjacent motif. A B A B C D C D E F, G F G H, I H I CFTR n S1 Data
Effects of residues contacting with the 3′ end of crRNA
The cleavage activity of AsCas12aWT nuclease was reported to be nearly unaffected by the mismatches in the PAM distal bases between 18 and 23 [9,31]. We also confirmed it by the T7E1 assay (Fig 2A and 2B). To systematically evaluate the effect of the PAM distal bases, we constructed crRNA expression plasmids containing all possible pairs of the mismatched bases between 14 and 23 within the complementarity regions of crRNAs targeted EGFP and repeated EGFP disruption assay. In line with our former experiments (Fig 2A and 2B), the AsCas12a4m variant showed the high sensitivity to mismatched crRNA nucleotides in most places (Fig 4A). However, both AsCas12aWT and AsCas12a4m could mediate enough EGFP disruption when mismatched at the PAM distal positions (Fig 4A). GUIDE-seq analysis for AsCas12aWT and AsCas12a4m were further performed at another well-studied endogenous sites RPL32P3 in 293T cells (Figs 4B and S12). To confirm these GUIDE-seq findings, we used T7E1 assay to assess the frequencies of indel mutations induced by AsCas12aWT and AsCas12a4m at 4 off-target sites (Figs 4C and S12). In detail, at the off-target sites 3 and 4, the dsDNA cleavage events were 15% and 17% for AsCas12aWT, respectively, and nearly not detected for AsCas12a4m (Fig 4C). However, both AsCas12aWT and AsCas12a4m showed enough cleavage events at the off-target sites 1 and 2 (23%, 31%, and 20%, 16% cleavage efficiency for AsCas12aWT and AsCas12a4m at off-target sites 1 and 2, respectively) (Fig 4C). The results indicated that: (1) AsCas12a4m-induced dsDNA breaks with high fidelity; and (2) AsCas12a4m was not sensitive to mismatches positions in the PAM distal regions.
To further improve the fidelity in the PAM distal regions, it was initially hypothesized that the unwanted off-targeting with mismatch positions in the PAM distal regions might be minimized by decreasing nonspecific interactions with the 3′ ends of the guide RNA and the 5′ ends of the target DNA. Based on the crystal structure of AsCas12a in complex with crRNA and target DNA [38], we selected 3 residues (K414, Q286, and W382) and 1 residue (S376) that made direct contacts to 3′ ends of the guide RNA and the 5′ end of the target DNA for alanine substitution (Figs 5A and S13A–S13C). Then, the on-target activity of 4 mutants over 6 target sites with TTTV PAM sequence were evaluated. As shown in Fig 5A, single residue substitutions, except for the W382A hardly perturbed the editing efficiencies at target sites. To evaluate the performance of all possible mutant versions, we performed the T7E1 assay. On-target activity and 4 off-targets activities of all mutant versions were evaluated at the RPL32P3 target site in 293T cells. For the on-target site, we observed that the indels caused by AsCas12a4m-K414A, AsCas12a4m-S376A, and AsCas12a4m-Q286A mutant were about 32%, 35%, and 31%, respectively, compared to AsCas12aWT, 35% and AsCas12a4m, 33% (Figs 5B and S13D). By contrast, other combinations dramatically decreased the activity (Figs 5B and S13E). In addition, deep sequencing analysis showed that AsCas12a4m-K414A keep highly comparable activities (80% to 100%) to AsCas12aWT (S14 Fig). Indels events did not happen for AsCas12aWT and all As mutants at the off-targeting sites, except AsCas12aWT at the off-target 3 and 4. Furthermore, AsCas12a4m-K414A (termed HyperFi-As) and AsCas12a4m-Q286A were revealed to have the lower off-target rates at the off target 1 and 2 (Figs 5B and S13D). Although further proof of GUIDE-seq analysis indicated that HyperFi-As increased the off- to on-target read ratio at off-target 1, the number of total off-target sites decreased by 13 compared to AsCas12a4m-Q286A (Figs 5C, 5D, S12C, and S12D). We further selected SIPRa, a well-studied target site to perform GUIDE-seq analysis for specificity assessment [41]. The result demonstrated that targeting SIPRa showed off-target activity for HyperFi-As at 3 loci, which was less in number than 49, 9, and 7 for AsCas12aWT, AsCas12a4m, and AsCas12a4m-Q286A, respectively (Fig 5E and 5F). In addition, statistics of the GUIDE-seq reads revealed that HyperFi-As possessed the lowest percentage of off-targets with mismatches in the PAM distal region among As, AsCas12a4m, HyperFi-As, and AsCas12a4m-Q286A (Fig 5F).

Performance of AsCas12aWT and AsCas12a4m at PAM-distal mismatches. () Cleavage profiles, assessed by EGFP disruption assay for AsCas12aWT and AsCas12a4m at all possible mismatch pairs of bases within the complementarity regions of crRNAs targetedfrom 14 to 23. The EGFP disruption (%) are the mean of 3 replicates. () The number of potential off-target loci identified by GUIDE-seq analysis for AsCas12aWT and AsCas12a4m atsite. () Validation of AsCas12aWT and AsCas12a4m performance at 4 off-targets that identified by GUIDE-seq in panel (), percent modification of on-target and 4 off-targets of AsCas12aWT and AsCas12a4m assessed by T7E1 assay (from). Error bars represent SEM for= 3. The data underlying this figure can be found in. EGFP, enhanced GFP; GUIDE-seq, genome-wide unbiased identification of double-stranded breaks enabled by sequencing; PAM, protospacer-adjacent motif. A B C B EGFP RPL32P3 n S12 Fig S1 Data

Generation of high-fidelity derivatives of AsCas12a4m by weakening nonspecific crRNA contacts. () Characterization of high-fidelity derivatives of AsCas12a4m containing alanine substitutions in residues that form hydrogen bonds with the PAM-distal region (3′ end of the guide RNA and 5′ end of the target DNA). Assessment of all variants by using T7E1 assay when paired with 6 crRNAs that targetedandloci. () Cleavage profiles, assessed by T7E1 assay for As variants at on-target and 4 off-targets. The percent modification (%) are the mean of 3 replicates. () The number of potential off-target loci identified by GUIDE-seq analysis for As variants atsite. The statistical results for AsCas12aWT and AsCas12a4m are derived from(see). () Specificity ratios of AsCas12aWT, AsCas12a4m, HyperFi-As, and AsCas12a4m+Q, plotted as the ratio of 4 off-target sites GUIDE-seq read counts to on-target. () Comparative analysis of AsCas12aWT, AsCas12a4m, HyperFi-As, and AsCas12a4m+Q with crRNA targetingloci using GUIDE-seq. () Percentage of edited reads detected by GUIDE-seq at on-target site and mismatched sites among total edited reads and the numbers of off-target sites for each AsCas12a. () Comparative analysis of AsCas12aWT, HyperFi-As, As ultra, and As plus with crRNA targetingloci using GUIDE-seq, percentage of reads detected by GUIDE-seq at the on-target site and off-target sites (ordered by number of mismatches, 0 represented on-target) among total detected reads by As variants (top) and numbers of genome-wide off-target sites (bottom). () Genome-wide specificities of AsCas12a and SpCas9 variants. Summary of the total number of genome-wide off-target cleavage sites identified by GUIDE-seq for AsCas12a and SpCas9 variants with gRNAs targeted to nonstandard, repetitive sites. The data underlying this figure can be found in. GUIDE-seq, genome-wide unbiased identification of double-stranded breaks enabled by sequencing; PAM, protospacer-adjacent motif. A B C D E F G H DNMT1 VEGFA RPL32P3 SIPRa PPP1R13L Fig 4B S12 Fig S1 Data
Systematical comparison of AsCas12a variants and SpCas9 variants
To systematically compare the specificity of HyperFi-As with other recent high-fidelity AsCas12a variants and SpCas9 variants, we comprehensively assessed these variants’ capability to reduce genome-wide off-target effects of gRNAs designed against target sites with a large number of homopolymeric sequences in human cells. For the PPP1R13L site, HyperFi-As exhibited a reduced number of off-target sites compared to As variants, and the percentage of on-target was also the highest among variants (Figs 5G and S15). To further confirm it and compare the genome-wide specificities of CRISPR-Cas nucleases including AsCas12aWT, As ultra, As plus, SpCas9, and SuperFi-SpCas9, we repeated GUIDE-seq with 5 gRNAs targeting sites containing overlapping sequences (HEK293 site 2/3/4/5/6) (Figs 5H and S16–S20). For 5 sites, the high-fidelity variants such as As plus and SuperFi-SpCas9 exhibited improved specificity (Fig 5H), which was consistent with previous reports [33,35,42,43]. However, no on-target gene editing events were detected at the HEK293 site 2/4/6 for SuperFi-SpCas9 (S16, S18 and S20 Figs). Notably, HyperFi-As showed the best specificity among the 6 tested variants of AsCas12a and SpCas9 (Figs 5H and S15–S20).
Binding stability and multiple state characteristics of HyperFi-As
To explore whether HyperFi-As has better specificity to distinguish the mutant DNA sequence and thus possibly decreases the off-target efficiency, we compared the stability of DNA-Cas12a-crRNA complexes that containing the HyperFi-As and AsCas12aWT on various mutant DNA sequences using single-molecule DNA unzipping experiments, according to the previous study for determining the stability of DNA-Cas12a-crRNA [44]. However, a proper constant force was used instead for sensitive detection. In the end, the life-time of the intermediate states during the disassembly sensitively detects multiple states by this modified new way.
We constructed a series of 34-bp DNA hairpins containing the PAM and various target/off-target sequences (Figs 6A and S21A) as the substrates. We stretched the hairpin between a paramagnetic magnetic bead and a glass slide through 2 dsDNA handles. In the force-increasing scan at the loading rate of 1 pN/s, the naked DNA hairpin unzipped at approximately 18 pN, while a high force of approximately 30 pN was required to completely unzip the DNA-Cas12a-crRNA complex (S21B Fig). Note that we used the dead mutants of AsCas12a without cleavage activity and the DNA hairpins were protected.
To determine the stability and possible multiple states of each kind of DNA-Cas12a-crRNA complex, we elevated the force from 5 pN to 26 pN and held the force at 26 pN for 120 s (Fig 6B). The measurements were repeated at least 20 times for each molecule, and the results from multiple molecules were yielded. The transient R-loop complex state as the first of 3 states was sensitively detected and 2 steps was illustrated to completely unzip the hairpin that bound crRNA-Cas12a (Fig 6B). Based on the increases in DNA extension, the first step represents the dissociation of the non-target sequence from the R-loop complex, and the second step represents the unzipping of PAM (Fig 6C). The difference between states 1, 2, and 3 is the length of the non-target sequence bound to Cas12a. In state 1, the Cas12a-crRNA-DNA complex forms a tight binding, encompassing both the target sequence and the non-target sequence. State 2 represents a partial dissociation of the Cas12a-crRNA-DNA complex, with the non-target strand upstream of the PAM being released from the R-loop complex. In state 3, the hairpin downstream of the PAM, including the PAM sequence, unfolds under force and is released from the Cas12a-crRNA complex. The transition from state 1 to state 2 indicates the non-target strand upstream of the PAM moving away from the R-loop complex while the transition from state 2 to state 3 indicates the unfolding of the hairpin strand downstream of the PAM including the PAM sequence. The stabilities of the complexes that had the same crRNA but with 3 different DNA sequences (the WT target and 2 real off-target DNA subtracts) were compared. When bound to the on-target, the instability of DNA-Cas12a-crRNA were similar for both HyperFi-As and AsCas12aWT. However, when bound to the off-target, the instability was more affected and increased a lot for HyperFi-As (Fig 6D), which was consistent with the result that HyperFi-As had the similar gene editing efficiency at on-target sites but a lower tolerance for off-target sites than AsCas12aWT. Detailed analysis of the unzipping assay revealed that HyperFi-As was largely less distributed at the first state when bound to the off-target sites (Fig 6E), which might suggest more dissociations of the non-target sequence from the R-loop complex for HyperFi-As. All these shed important insights on how HyperFi-As could behave in the single-molecule level and lead to low off-targeting.

Single-molecule assay for detection of cas12a complex stability. () Sketch of the representation of the experimental setup. A DNA hairpin containing target sequence was attached between the functionalized coverslips and a magnetic bead. As the disassociation of crRNA-Cas12a complex from the target DNA, the extension of DNA molecular stretches with the complete unzip of the hairpin. (Representative extension trace from the DNA hairpin in the presence of the dCas12a–crRNA complex. States 1, 2, 3 mean the different extension state of the DNA hairpin. () Cartoon of Cas12a complex in different states. () Histogram of normalized unbinding events probability on truly mismatch DNA target. () Histogram of times in different states on truly mismatch DNA target. The data underlying this figure can be found in. A B) C D E S1 Data
Discussion
Off-target effects of CRISPR-Cas gene-editing tools pose a challenge for therapeutic applications. Although AsCas12a has been shown to be an innately highly specific enzyme, it still causes off-target effects in mammalian cells [31,33,36]. Off-target effects can occur at levels 10 or even 100 times lower than the target site, but these small DNA breaks can cause chromosome breaks and thus chromosomal translocations [37]. Thus, the less off-target we can get, the more safety we can have in terms of therapies. Genome-wide unbiased identification of DSBs by sequencing (GUIDE-seq) is one of the most popular methods to detect real cleavage sites in living cells after genome editing [13]. It is straightforward, has good sensitivity, and reflects the real situations in living cells across the entire genome unbiased. It is therefore well accepted and widely used in many off-targeting studies for Cas9 and Cas12a systems [45–47]. Using GUIDE-seq method, we and other researchers have shown that gene editing in mammalian cells using the Cas12a system causes unwanted mutations, although less so compared to the Cas9 system [9,11,12,31,36,48,49]. Using an in vivo EGFP disruption assay, we further clarified the off-target effect of AsCas12aWT in the presence of double base pair mismatches. As mentioned above, we observed significant fluorescence reduction when AsCas12aWT was guided by certain mismatched crRNA, particularly when mismatches were located at the distal end of the PAM sequence. To minimize these off-target effects of AsCas12aWT, we engineered a novel AsCas12a variant termed HyperFi-As that shows high gene editing accuracy without compromising on-target efficiency.
A strategy of decreasing nonspecific interactions with its target DNA strand has been successful for high-fidelity Cas9 [15,29]. Since Cas12a is relatively high in fidelity, it was unclear if the strategy could work well for Cas12a in further improving the fidelity. Thus 4 key residues (S186, R301, T315, and Q1014) was identified by analyzing the crystal structure of the AsCas12a-crRNA-target DNA complex. All of them made direct hydrogen bonds to the phosphate backbone of the target DNA strand within a 3.0-Å distance to provide nonspecific contacts. We constructed single amino acid mutant by alanine substitution (S186A/R301A/T315A/Q1014A) and generated a range of possible combinations and demonstrated that the quadruple substitution variant (termed AsCas12a4m) has the lowest mismatch tolerance by both EGFP disruption assay as well as single mismatch and double mismatch cleavage experiments on endogenous genes. As expected, AsCas12a4m showed a decrease in cleavage activity compared to AsCas12aWT at off-target sites, both in terms of the number of off-target sites and in terms of read counts at the sites, as validated by GUIDE-seq analysis across several endogenous loci.
Recent studies revealed that the alteration of contacts between PAM proximal DNA and amino acid residues in the PI domain of AsCas12a can expand the range of PAM recognition [32,39]. For instance, AsCas12a variants AsCas12a-RVR and enAsCas12a-HF1 modify DNA efficiently at TATV, and TTYN (TTTN/TTCN), VTTV (ATTV/CTTV/GTTV), TRTV (TATV/TGTV) noncanonical PAMs, respectively. However, our previous report indicated that the noncanonical PAM recognition by Cas12a might induce extra off-target edits [36,37]. And the side effect was also demonstrated by other studies [36,40]. In this work, we successfully grafted mutations from AsCas12a4m onto AsRVR and enAsCas12a-HF1 and engineered alternative high-fidelity AsRVR-4m and enAsHF1-4m, which preserved the recognition of nonclassical PAMs.
EGFP disruption assay and cleavage experiments on endogenous sites suggested that one-base mismatches or two-base mismatches at the position of the PAM-distal region were highly tolerant. And truncation of the 3′ ends of the guide RNA could not decrease the undesired mutagenesis at some mismatch sites or at least, so the approach limited, as most off-target sites harbor mismatches in the PAM-distal 3′ end [9,50]. Although AsCas12a4m exhibits low tolerance for mismatches between crRNA and target DNA in other regions, it remains insensitive to mismatches located at the termed promiscuous region (positions 19–23). This observation can be explained by the fact that S186 and Q1014/T315/ R301 form hydrogen bonds with the DNA phosphate backbone of the seed region (positions 1–6) and trunk region (positions 7–18), respectively. We speculated that disruption of nonspecific interactions in the promiscuous regions may also contribute to fidelity. Based on the hypothesis, we finally identified that K414A mutation led to an enhancement of specificity in the PAM-distal end and conferred a higher fidelity to AsCas12a4m by altering the contact with the distal 3′ end of crRNA (termed HyperFi-As). In addition, through the novel single-molecule DNA unzipping experiments demonstrating the binding of DNA-Cas12a-crRNA complex, HyperFi-As mutant significantly reduced the stability of the complexes and decreased the R-loop state when containing the off-target DNA substrate. The new modified single-molecule unzipping assay capable of detecting different state distributions on the disassembly of DNA-Cas12a-crRNA complexes could be widely used in future various studies and could be very helpful for deep insights of how CRISPR/Cas stepwisely heaved and guiding various engineering.
In general, the longer the Cas enzyme and crRNA complex are present in the cell, the greater the likelihood of off-targeting. With virus or plasmid-based delivery, the cellular genome is exposed to the Cas enzyme and crRNA complex for an extended period, which results in an increased risk of off-targeting [51]. Genome editing using Cas12a with RNP or mRNA delivery has been successfully applied to limit the duration of exposure of CRISPR/Cas systems in cells [52,53]. Therefore, introducing the HyperFi-AsCas12a system into cells via RNP or mRNA may further reduce off-target cleavage events.
Materials and methods
Plasmids construction
All plasmids and guide RNAs used in this study can be found in S1 Table. AsCas12a human expression plasmid pY010 was purchased from the nonprofit plasmid repository Addgene (Addgene plasmids #69982). All AsCas12a variants were generated by standard site-directed mutagenesis. In brief, using AsCas12a plasmid pY010 as the template and a pair of primers that one primer carrying site mutation nucleotide to amplify 500 bp fragments by PCR (Phanta MAX Super-Fidelity DNA Polymerase P505, Vazyme). After confirming that the bands were correct by agarose gel electrophoresis, the PCR products were purified. Next, taking 1,000 ng PCR products as the circling mutation primers and 100 ng AsCas12a plasmid pY010 as template to carry out PCR again. PCR products were purified and use DpnI to digest the original template at 37°C for 1 h, inactivated at 80°C for 20 min, take 10 μl digested products for transformation. The next day, single colonies were sent for sequencing. Oligonucleotide duplexes corresponding to spacer sequences were PCR amplified and cloned into pU6-As-crRNA plasmids (Addgene plasmids #78956) for U6 promoter-driven U6 promoter-driven transcription of As crRNAs (ClonExpress II One Step Cloning Kit C112, Vazyme).
Cell culture and transfection
The HEK293T cells maintained in DMEM medium supplemented with 10% fetal bovine serum and 100 units mL−1 penicillin, 100 μg mL−1 streptomycin sulfate (all cell culture products were obtained from Gibco) at 37°C in 5% CO2. For plasmid-based genome editing, approximately 1.2 × 105 cells were seeded into per well of 24-well plate a day before transfection, 600 ng Cas12a and 300 ng crRNA expression plasmids were transfected into cells using Hieff Trans Liposomal Transfection Reagent (CAT:40802ES03, Yeasen, Shanghai).
EGFP disruption
For EGFP disruption analysis in HEK293T cells, approximately 2.4 × 104 cells were seeded into per well of a 96-well plate a day before transfection, a total of 100 ng of Cas12a plasmid, 30 ng crRNA expression plasmids, and 30 ng EGFP expression plasmids were transfected into HEK293T cells. A U6 promoter-driven empty plasmid for the substitution of crRNA expression plasmid as a negative control, and 48 h post-transfection, cells were analyzed on the CytoFLEX (Beckman Coulter).
Assessment of gene editing by T7E1
Approximately 48 h post-transfection, cells were collected by centrifugation and the supernatants were removed. The 50 μl Lysis buffer and 0.5 μl Proteases were mixed with cells and incubated at 55°C for 30 min, 95°C for 30 min (Animal Tissue Lysis Component, CAT: 19698ES70, Yeasen, Shanghai). The genomic region flanking the CRISPR target site for each gene was amplified by PCR with Phanta MAX Super-Fidelity DNA Polymerase P505 (Vazyme) using 1 μl cell lysis as template and the primers listed in S1 Table. PCR products were purified and the concentrations were determined; 250 ng of purified PCR products were mixed with 1 μl 10×T7E1 buffer (Vazyme) and ultrapure water to a final volume of 10 μl, and subjected to a re-annealing process to enable heteroduplex formation: 95°C for 3 min, 95°C for 30 s, 90°C for 30 s, 85°C for 30 s, 80°C for 30 s, 75°C for 30 s, 70°C for 30 s, 65°C for 30 s, 60°C for 30 s, 55°C for 30 s, 50°C for 30 s, 45°C for 30 s, 40°C for 30 s, 35°C for 30 s, 30°C for 30 s, and 25°C for 1 min. After re-annealing, products were treated with T7 Endonuclease I (EN303-01/02, Vazyme) for 15 min at 37°C. The reaction mixtures were run on 2% agarose gels and imaged with ChemiDoc XRS+ and analyzed according to strip intensities. Indel percentage was determined by the formula: 100 × (1 –sqrt (b + c)/(a + b + c)), where a is the integrated intensity of the undigested PCR product and b and c are the integrated intensities of the cleavage product.
GUIDE-seq
Briefly, 600 ng Cas12a, 300 ng crRNA expression plasmids, and 10 pmol of the dsODN GUIDE-seq tag were transfected into a 24-well 293T cells (1.2 × 105 cells per well); 48 h after transfection, the genomic DNA was harvested and purified using FastPure Cell/Tissue DNA Isolation Mini Kit DC102 (Vazyme). GUIDE-seq tag integration percentages and on-target modification were assessed by restriction-fragment length polymorphisms (RFLPs) assays and T7E1 assays (as described above), respectively. A total of 1,000 ng genomic DNA was fragmented, end repaired, A-tailing by using Hieff NGS Fast-Pace DNA Fragmentation Reagent (12609ES96, Yeasen, Shanghai). The sequencing library was prepared and sequenced on an Illumina Instrument and data was analyzed using guideseq v1.1 as described previously.
Western blotting
To detect the expression of As variants, cells were collected and lysed after 48 h transfection. Lysates were resolved through SDS-PAGE electrophoresis and transferred onto a polyvinylidene fluoride membrane (Millipore, United States of America). Membranes were blocked by blocking buffer (5% non-fat milk in Tris buffered saline with Tween20 (TBST)) for 2 h. Blots were incubated with HA Mouse primary antibody (CAT#901515, Biolegend, USA) at 1:20,000 dilution and β-actin Rabbit primary antibody (CAT#AC026, ABclonal, China) at 1:20,000 dilution for 3 h at room temperature, respectively. After washing steps in TBST, membranes were incubated for 1 h with HRP Goat anti-Mouse IgG (H+L) for HA (CAT#AS003, ABclonal, China) and HRP Goat anti-Rabbit for β-actin (CAT#AS014, ABclonal, China) at 1:50,000 dilution, respectively.
Preparation of DNA samples for MT experiments
The hairpin region contains the PAM and protospacer in the center, schematic of the single-molecule substrate is available in S21C Fig.
All oligonucleotides were purchased from Sangon Biotech. The tethered DNA substrate was a molecule comprised of a 34-bp hairpin region, a 5-nucleotide loop, and 2 dsDNA handles, 630 bp and 653 bp, allowing attachment to the glass coverslip and the magnetic bead, respectively. The main step of hairpin construction is showed as follow:
We synthesized all the oligos with the following sequences (Sangon Biotech).
hairpin-biotin: bio-ATTTACGCCGGGATATGTCAAGC
hairpin_F1: ATTTACGCCGGGATATGTCAAGCCGAAGCATGAAGTG
hairpin_upR1: TCTGATGGTCCATACCTGTTACACTGCCTGAATGCAGCCATAGGTGC
hairpin_upR2: TGAGGTCTCAAGAAAAACTCACGACGCTTTCTGATGGTCCATACCTGTT
hairpin_upR3: CTGAATGCAGCCATAGGTGC
hairpin-sh: SH-AGTCAGTTGCATCAGTCACAAGGG
hairpin_R1: AGTCAGTTGCATCAGTCACAAGGG
hairpin_downF3: CAGGTATACAGATTAATCCGGC
hairpin_downF1: CATACCTGTTACACTGCCTGAATGCCAGGTATACAGATTAATCCGGC
hairpin_downF2: TGAGGTCTCATTCTCACGACGCTTTCTGATGGTCCATACCTGTTACACTGCCTGA
MT experiments
We used a homemade magnetic tweezers setup to stretch individual DNA molecule, which was described previously [54]. We functionalized the coverslips with (3-aminopropyl) triethoxy silane (APTES, Sigma-Aldrich), which allowed the 5′-thiol end of each DNA molecule to attach to the amine group of APTES via Sulfo-SMCC crosslinker (Hunan Huteng). We attached streptavidin-coated paramagnetic beads (Dynal M270, Thermo Fisher Scientific) to the 5′-biotin end of the DNA molecules. We mixed crRNA and Cas12a at 37°C for 20 min in advance and added 10 nM Cas12a-crRNA complexes into the flow cell to form DNA-Cas12a-crRNA complexes. Magnetic tweezers measurements were collected at room temperature (21 to 23°C) in 10 mM Tris-HCl (pH 7.5), 150 mM KCl, and 0.1 mg/ml BSA buffer.
Statistics analysis and reproducibility
All statistical analyses were performed using GraphPad Prism (v.8.2.1). The exact replication numbers are indicated in the figure legends. The reproducibility was shown by performing at least 2 independent biological replicate experiments.