Location:
ABE/CBE Transcriptome Off-Target Editing Detection

ABE/CBE Transcriptome Off-Target Editing Detection

ABE/CBE Transcriptome Off-Target Editing Detection

1. Background

Single-base editors (Base Editors) fuse deaminases with Cas9 nickases or catalytically inactive Cas proteins to achieve precise base conversion at targeted sites without inducing DNA double-strand breaks. The two most widely used classes of base editors are: Adenine Base Editors (ABE), which employ TadA deaminase to convert A to I (read as G), enabling precise A-to-G substitution; and Cytosine Base Editors (CBE), which utilize deaminases from the APOBEC/AID family to convert C to U (read as T), enabling precise C-to-T substitution. This technology has demonstrated significant value in gene therapy, disease model construction, and functional genomics research. However, the intrinsic sequence tolerance of deaminases or incomplete specificity of gRNAs may lead to "off-target editing"—unintended base modifications at non-target genomic or transcriptomic sites—posing potential safety risks and experimental interference. The transcriptomic off-target mechanism of base editors is specifically manifested as follows: the deaminases fused to CBE and ABE (e.g., APOBEC family or TadA) are biochemically capable of acting on single-stranded RNA. Their free diffusion within cells allows them to act independently of gRNA guidance, directly contacting mRNA and other single-stranded RNA substrates in the transcriptome, catalyzing unintended C→U or A→I base modifications, which can lead to amino acid substitutions, premature termination, and other functional abnormalities at the translational level. A schematic diagram is shown below:

Figure 1. Schematic diagram of the deaminase substrate action mechanism in CBE base editors

In recent years, regulatory guidelines related to gene therapy (such as China NMPA's 2022 "Technical Guidelines for Pharmaceutical Research and Evaluation of In Vivo Gene Therapy Products" and FDA's 2024 updated guidance) have emphasized the need for systematic off-target assessment strategies, encompassing bioinformatics prediction, DNA-level off-target detection, and RNA-level off-target validation, to comprehensively evaluate the specificity and risks of editing systems.

Transcriptome mRNA off-target editing detection, as an indispensable component of this framework, primarily targets the non-directed editing activities of ABE and CBE deaminases at the RNA level (A-to-I/G and C-to-U/T conversions, respectively). Through high-depth RNA sequencing and bioinformatics analysis of edited samples, one can directly assess whether predicted off-target sites exhibit aberrant editing in transcripts. The technical advantages include:

(1) Functional Directness: Detects off-target events in transcriptionally active regions, better reflecting the potential impact on gene expression and function;

(2) High Sensitivity: Capable of identifying low-frequency off-target signals (sensitivity typically reaching 0.1%), suitable for early safety screening;

(3) Mechanistic Insight: Can distinguish between sgRNA-dependent and sgRNA-independent off-target events, and differentiate the RNA off-target profiles of ABE (A-to-I/G) and CBE (C-to-U/T), providing a basis for understanding deaminase activity;

(4) Optimization Guidance: Validates predicted off-target sites and provides data support for optimizing ABE/CBE deaminase variants (e.g., high-fidelity ABEmax, evoAPOBEC), gRNA design, and delivery systems.

GeneRulor, leveraging its proprietary high-depth transcriptome sequencing and targeted analysis pipeline, has developed a systematic mRNA single-target off-target editing detection platform, designed to provide customers with regulatory-compliant, scientifically rigorous off-target risk assessment solutions that support the research, development, optimization, and safety translation of gene editing tools.

2. Detection Principle

Transcriptome mRNA single-target off-target editing detection technology is based on high-depth RNA sequencing. It targets bioinformatically predicted high-risk off-target sites to detect whether editing (e.g., base substitutions, insertions/deletions) occurs at the transcript level, thereby validating the authenticity and functional impact of predicted off-target sites.

Detection Workflow:

(1) Off-target Site Prediction: The Cas-OFFinder tool is used to predict potential off-target sites with sequences similar to the target site, prioritizing sites with ≤6 mismatches located in transcriptionally active regions.

(2) RNA Extraction and Library Construction: High-quality total RNA is extracted from edited samples, and target transcripts are enriched using oligo(dT) magnetic beads for mRNA enrichment or rRNA depletion, followed by high-depth sequencing library construction (recommended sequencing depth: ≥50M reads).

(3) High-Depth Sequencing: DNBSEQ-T7 is used for PE150 paired-end sequencing to ensure sufficient sequencing depth at predicted off-target sites (per-site coverage depth ≥1,000×).

(4) Off-target Editing Detection: Sequencing data are aligned to the reference genome; variant calling (SNV/Indel calling) is performed at predicted off-target sites; off-target editing frequencies are calculated and compared against control groups to validate the authenticity of off-target events.

Data Analysis Strategy:

(1) On-target Editing Efficiency Verification: The editing efficiency at the intended target site is first assessed to confirm successful editing. The editing frequency at the target site is calculated (e.g., C-to-T conversion rate for CBE, A-to-G conversion rate for ABE).

(2) Predicted Off-target Site Editing Detection: For each bioinformatically predicted off-target site, RNA-level editing is assessed. Off-target editing frequencies are calculated, and sites with frequencies significantly above background mutation rates are identified as true off-target sites. Based on prediction results, off-target sites are further classified as sgRNA-dependent or sgRNA-independent.

(3) Functional Impact Assessment of Off-target Editing: Genomic annotations of off-target editing sites (exonic, intronic, UTR, promoter, etc.) are analyzed; the impact of off-target editing on protein-coding sequences (synonymous, missense, and nonsense mutations) is evaluated to predict functional consequences.

(4) Tumor-related Gene Risk Assessment: Based on COSMIC, ONCOGENE, and other databases, the genes harboring off-target sites are evaluated for oncogene or tumor suppressor gene status to assess potential carcinogenic risks.

Compared to conventional Sanger sequencing or targeted amplicon sequencing, high-depth transcriptome sequencing enables simultaneous detection of dozens to hundreds of predicted off-target sites, significantly increasing detection throughput. The high sequencing depth (≥1,000× per site) enables detection of low-frequency off-target events (sensitivity as low as 0.1%), far exceeding traditional methods.

Figure 2. Workflow for base editor off-target detection based on high-depth mRNA sequencing

3. Technical Advantages

3.1 High-Throughput Coverage of Predicted Off-target Sites

(1)A single experiment can simultaneously detect dozens to hundreds of bioinformatically predicted off-target sites

(2)Covers all predicted off-target sites in transcriptionally active regions, with priority identification of functional off-target events

(3)Significantly improves detection efficiency compared to per-site Sanger sequencing or qPCR validation

3.2 High-Sensitivity Low-Frequency Off-target Detection

(1)Recommended sequencing depth ≥50M clean reads to ensure sufficient coverage at predicted off-target sites (≥1,000× per site)

(2)High-depth sequencing enables detection of low-frequency off-target editing events with sensitivity as low as 0.1%

(3)Professional variant calling algorithms (e.g., GATK, VarScan2) are employed to eliminate sequencing errors and background mutation noise

3.3 Functional Off-target Assessment

(1)Detects off-target editing in transcriptionally active regions, reflecting the actual impact of off-target effects on cellular function

(2)Assesses the impact of off-target editing on protein-coding sequences (synonymous/missense/nonsense mutations) to predict functional consequences

(3)Integrates tumor-related gene databases to evaluate the carcinogenic risk of off-target sites

4. Application Scenarios and Service Advantages

4.1 Application Scenarios

(1) IND Filing Support for Gene Editing Products: Provides transcriptome off-target editing detection reports compliant with regulatory requirements, verifying the authenticity of bioinformatically predicted off-target sites and meeting the data requirements of NMPA/FDA for single-target editing off-target safety evaluation.

(2) gRNA Design Optimization and Screening: Compares off-target editing profiles across different gRNA designs to select gRNA sequences with minimal off-target effects and optimize editing strategies.

(3) Safety Evaluation of Editing Tools: Assesses the RNA-level off-target editing profiles of different editing tools to verify their safety advantages.

(4) Preclinical Safety Research: Evaluates single-target gene editing off-target editing in cell lines, organoids, or animal models, providing safety evidence for clinical trial design.

(5) Cell Therapy Product Quality Control: Performs transcriptome off-target editing detection on engineered cell products such as CAR-T and TCR-T to ensure editing specificity and product safety.

4.2 Service Advantages

(1) Experienced Technology Platform: Having completed thousands of gene editing-related transcriptome projects, we have accumulated extensive experience in off-target editing detection, covering multiple editing tools including CRISPR/Cas9, Adenine Base Editors (ABE), Cytosine Base Editors (CBE), and Prime Editors, with particularly deep technical expertise in ABE/CBE RNA off-target detection.

(2) Dual-System Quality Certification: Simultaneously operating under ISO9001 quality management system and CNAS laboratory accreditation standards, ensuring traceable data quality.

(3) Professional Bioinformatics Team: Staffed with experienced bioinformatics analysts providing customized off-target site prediction and analysis solutions, including editing type-specific off-target detection strategies.

(4) Regulatory Submission Support: Provides standardized reports compliant with ICH, NMPA, and FDA requirements, directly supporting IND/BLA submissions.

(5) Integrated One-stop Service: Can integrate off-target site prediction, DNA-level off-target detection, and RNA-level off-target editing detection, providing multi-dimensional off-target assessment.

5. Example Report of Transcriptome Single-Target Off-Target Editing Detection

GeneRulor provides comprehensive transcriptome off-target editing detection reports that meet regulatory requirements. The following example, based on a single-target base editor (CBE) editing project, illustrates the core content of the report (reports for other types of single-target editing projects follow a similar structure):

5.1 Data Quality Assessment

Sequencing data statistics, base quality distribution, GC content analysis, and alignment rate assessment are provided to ensure data quality meets analytical requirements. This includes pre-QC and post-QC data tables. Verification of whether sequencing depth meets the requirements for predicted off-target site detection (≥50M reads; average coverage depth at predicted off-target sites ≥1,000×) is included.

Example raw sequencing data

Sample ID

Total Reads (M)

Total Bases (Gb)

GC Content (%)

Q20 (%)

Q30 (%)

CBE_T1

58.75

8.09

51.33

96.86

93.37

CBE_T2

64.51

9.3

48.85

96.58

94.36

CBE_T3

62.32

8.9

48.73

97.22

92.6

Control_C1

60.99

9.06

48.73

96.28

93.54

Control_C2

56.56

8.03

49.22

96.58

93.78

Control_C3

56.56

9.45

50.1

96.73

92.14

Example QC and alignment data

Sample ID

Clean Reads(M)

Clean Bases(Gb)

GC Content (%)

Q20 (%)

Q30 (%)

Mapping Rate (%)

CBE_T1

56.88

7.76

48.14

98.09

95.79

93.55

CBE_T2

61.61

8.86

51.64

97.37

96.77

93.09

CBE_T3

59.33

8.64

49.04

98.94

94.27

95.31

Control_C1

59.68

8.73

50.65

98.55

94.59

93.43

Control_C2

55.37

7.66

49.25

98.88

94.14

93.12

Control_C3

55.1

9.12

50.08

98.79

94.98

94.17

Example sequencing depth data

Sample ID

Mapped Reads (M)

Uniquely Mapped (M)

Average Coverage Depth (×)

Predicted Off-target Sites Coverage (×)

CBE_T1

53.21

50.55

1242

1002

CBE_T2

57.35

54.49

1441

1326

CBE_T3

56.55

53.72

1222

1283

Control_C1

55.76

52.97

1496

1292

Control_C2

51.56

48.98

1432

1309

Control_C3

51.89

49.29

1260

1030

5.2 On-target Editing Efficiency Verification

The editing efficiency at the intended target site is assessed to confirm successful editing. CBE editing example: The C-to-T conversion frequency at the target site is calculated, and an IGV visualization of target site editing efficiency is provided. The editing profile of the surrounding sequence is shown (e.g., editing window analysis when multiple C bases are present).

Example on-target site detection results

SAMPLE

CHROM

POS

REF

ALT

Read Coverage

Editing Efficiency

CBE_T2

chr5

123319941

C

U

102

6.86%

CBE_T1

chr5

123319942

C

U

76

98.59%

CBE_T2

chr5

123319942

C

U

115

88.24%

CBE_T3

chr5

123319942

C

U

82

90.00%

CBE_T1

chr5

123319943

C

U

76

91.67%

CBE_T2

chr5

123319943

C

U

126

72.55%

CBE_T3

chr5

123319943

C

U

84

81.43%

CBE_T1

chr5

123319953

C

U

76

5.26%

Figure 3. IGV visualization of sequencing coverage and C-to-U editing events at the CBE-edited on-target site

5.3 Off-target Site Editing Detection Results

For each off-target site, detailed editing detection results are provided: sequencing coverage depth, editing frequency (edited reads / total reads), and background mutation rate in the control group. CBE editing example: The C-to-U conversion frequency at each predicted off-target site is assessed, and true off-target sites significantly exceeding background levels are identified. Bar charts of off-target editing frequency and distribution maps of off-target sites are provided.

Example off-target site detection results

SAMPLE

CHROM

POS

REF

ALT

MUTATION

DP

CBE Editing Efficiency

Control Editing Efficiency

PC-rep3

chr1

4596643

U

C

U>C

27

0.923076923

0

PC-rep3

chr1

4596677

C

G

C>G

27

0.923076923

0

PC-rep3

chr1

4596689

A

C

A>C

27

0.923076923

0

PC-rep3

chr1

4596713

A

G

A>G

28

0.923076923

0

PC-rep3

chr1

4596745

G

A

G>A

27

0.916666667

0

PC-rep2

chr1

4758167

U

C

U>C

40

1

0

PC-rep3

chr1

4758167

U

C

U>C

81

1

0

PC-rep4

chr1

4758167

U

C

U>C

43

1

0

PC-rep2

chr1

4758189

A

G

A>G

39

1

0

PC-rep3

chr1

4758189

A

G

A>G

85

1

0

Figure 4. Number of 12 types of base editing events


Figure 5. Manhattan plot of editing efficiency distribution at editing sites

5.4 Functional Impact Analysis of Off-target Sites

The functional impact of detected off-target editing sites is analyzed: genomic annotation (exonic/intronic/UTR/promoter/intergenic regions), impact on protein-coding sequences (synonymous/missense/nonsense/frameshift mutations), and protein functional domain impact analysis. CBE editing example: For C-to-U edits in exonic regions, the predicted functional consequences are assessed.

Example functional impact results of off-target sites

Off-target ID

Chromosome

Position

Gene Symbol

Genomic Annotation

Average Editing Frequency (%)

Mutation Type

Reference Codon

Altered Codon

Reference AA

Altered AA

Protein Domain

Functional Impact

OT-01

chr17

29172140

BRCA1

Exon

4.41

Synonymous

TGC

TGT

C

C

Kinase domain

Low

OT-01

chr17

29172140

TP53

Exon

2.29

Missense

TAC

TAT

Y

D

Zinc finger

Moderate

OT-01

chr17

29172140

EGFR

Exon

11.55

Missense

TAC

TAT

Y

C

Zinc finger

Moderate

OT-02

chr7

187932742

KRAS

Exon

10.43

Missense

GCA

GTA

A

Q

Kinase domain

Moderate

OT-02

chr7

187932742

MYC

Exon

6.46

Missense

GAC

GAT

D

V

None

Moderate

OT-02

chr7

187932742

PTEN

Exon

2.05

Missense

CAA

TAA

Q

A

DNA-binding domain

Moderate

Figure 6. Functional annotation of editing sites

5.5 Predicted Off-target Site List

A complete list of all bioinformatically predicted off-target sites is provided, including: genomic coordinates, mismatch count, mismatch positions, gene information, and genomic annotations (exonic/intronic/UTR/promoter, etc.). CBE editing example: Predicted off-target sites containing C bases are listed with priority.

Example predicted off-target site results

Off-target ID

Chromosome

Position

Strand

Mismatches

Mismatch Positions

PAM Sequence

Contains C

Gene Symbol

Genomic Annotation

OT-03

chr5

138651110

+

1

3

TGG

Yes

RB1

Exon

OT-04

chr2

43942594

+

1

17

TGG

Yes

PIK3CA

5'UTR

OT-05

chr5

175904668

-

1

10

TGG

Yes

BRCA1

Intergenic

OT-07

chr1

129036268

-

1

5

TGG

Yes

KRAS

Promoter

OT-11

chr15

97362498

-

1

4

AGG

Yes

ERBB2

Exon

Based on combined analysis of predicted and detected off-target sites, off-target sites are further classified as sgRNA-dependent or sgRNA-independent.

Sample

off-target type

Num

CBE_T1

sgRNA dependent

2

CBE_T1

sgRNA independent

5

CBE_T2

sgRNA dependent

1

CBE_T2

sgRNA independent

3

CBE_T3

sgRNA dependent

0

CBE_T3

sgRNA independent

1

5.6 Tumor-related Gene Risk Assessment

Based on COSMIC, ONCOGENE, and other databases, the genes harboring off-target sites are evaluated for oncogene or tumor suppressor gene status. A list of tumor-related genes and their off-target editing status is provided, with emphasis on high-risk off-target events (e.g., nonsense mutations in tumor suppressor genes, activating mutations in oncogenes).

Gene

Oncogene/Tumor Suppressor

Cancer Type

Risk Level

CNN3

Oncogene

Breast cancer, Prostate cancer

High

5.7 Off-target Editing Summary and Risk Assessment

A summary table of off-target editing detection is provided: total number of predicted off-target sites evaluated, number of sites with detected off-target editing, number of high-frequency off-target sites (editing frequency ≥1%), number of functional off-target sites (missense/nonsense mutations in exonic regions), and number of high-risk off-target sites (in tumor-related genes). A comprehensive assessment of the off-target risk level (low/medium/high) for single-target editing is provided along with optimization recommendations (e.g., replacing the gRNA, optimizing editing conditions, etc.).

6. Transcriptome Off-target Editing Detection Service Content

Service Item

Description

Off-target Site Prediction

Use tools such as Cas-OFFinder to predict potential off-target sites (mismatches ≤6) and provide a list of predicted off-target sites

RNA Extraction and QC

Extract total RNA from client samples; quality assessment by Agilent 2100 (RIN ≥7.0)

Library Construction

mRNA enrichment or rRNA depletion, fragmentation, reverse transcription, end repair, adapter ligation

High-Depth Sequencing

PE150 mode, ≥50M clean reads/sample, ensuring coverage depth ≥1,000× at predicted off-target sites

Bioinformatics Analysis

QC, alignment, on-target editing efficiency verification, predicted off-target site variant calling, editing frequency statistics, functional impact analysis, tumor gene risk assessment

Deliverables

Raw sequencing data, QC report, on-target editing efficiency report, predicted off-target site list, off-target editing detection results, functional impact analysis, visualization charts, comprehensive analysis report

Service Timeline: Standard workflow: 30–40 business days (from receipt of qualified samples to report delivery, including off-target site prediction)

7. Sample Requirements

Sample Type

Requirements

Total RNA

Concentration ≥100 ng/μL, total amount ≥3 μg, RIN ≥7.0, OD260/280 = 1.8–2.0

Cell Samples

≥2×10⁶ cells/sample, cell viability ≥80% (higher starting material required for high-depth sequencing)

Tissue Samples

≥150 mg/sample, fresh tissue or stored at −80°C

Biological Replicates

Recommended ≥3 biological replicates per group (edited group + control group) to enhance statistical power for off-target detection

Editing Information

Provide target sequence, PAM sequence, gRNA sequence, editing tool type (CRISPR/Cas9, CBE, ABE, PE, etc.), and reference genome version

Notes:

(1) All RNA samples must meet the quality standards above. RNA stabilization reagents or −80°C storage with dry ice shipping is recommended.

(2) Clients may also ship cell or tissue samples for RNA extraction.

(3) Complete editing information (target sequence, gRNA, editing tool type, etc.) must be provided to enable accurate off-target site prediction and editing detection.

(4) For special editing types or specific analytical requirements, please communicate with GeneRulor's technical team in advance.

Contact: Phone: 400-6309596 | Technical Support: service@generulor.com

8. References

[1] National Medical Products Administration, Center for Drug Evaluation. (2022). Technical Guidelines for Pharmaceutical Research and Evaluation of In Vivo Gene Therapy Products (Trial).

[2] FDA. (2024). Human Gene Therapy Products Incorporating Human Genome Editing - Guidance for Industry.

[3] Grunewald, J., et al. (2019). Transcriptome-wide off-target RNA editing induced by CRISPR-guided DNA base editors. Nature, 569(7756), 433-437.

[4] Kim, D., et al. (2019). Genome-wide target specificities of CRISPR RNA-guided programmable deaminases. Nature Biotechnology, 37(4), 430-435.

[5] Zuo, E., et al. (2019). Cytosine base editor generates substantial off-target single-nucleotide variants in mouse embryos. Science, 364(6437), 289-292.