sgRNA Sequence Accuracy Identification
1. Service Overview
Zhuhai GeneRulor specializes in full-process analytical testing solutions for gene editing therapeutics. This service is designed for developers of CRISPR/Cas9, Cas12a, Cas13, base editing, and prime editing gene editing modalities, providing nucleotide-level sequence accuracy identification of chemically synthesized sgRNA.
We employ a workflow combining scaffold-specific reverse transcription, UMI-based single-stranded library construction, and high-depth sequencing, enabling single-molecule traceability-based detection of synthetic variants at frequencies as low as 0.1%. This includes latent G→A deamination byproducts induced by 5'-end modifications during the chemical synthesis process, N±x truncation impurities, and assessment of the full-length major-peak sequence purity covering the complete spacer + scaffold sequence (100 nt).
The method has undergone full ICH Q2(R2) eight-item method validation and is directly applicable to CMC documentation for IND-enabling studies, IND release, and BLA/NDA commercial manufacturing stages, as well as comparability studies following supplier changes or process modifications.
2. Regulatory Background and Compliance Value
2.1 Regulatory Policy Timeline
As gene therapy and gene editing products advance toward clinical and commercial phases, major global regulatory authorities have progressively tightened quality control requirements for gene editing components such as sgRNA. Identity verification of sgRNA sequences has been formally elevated from an optional ancillary test to a mandatory Critical Quality Attribute (CQA) under mainstream regulatory frameworks. Since EMA first specified sgRNA sequence quality requirements in its GTMP guidelines in 2018, PMDA established a gene editing tool quality control framework in 2019, the FDA and EMA jointly issued CMC and cell therapy product guidance in 2020, NMPA/CDE published its first dedicated gene editing guidance in 2022, and the FDA issued its most stringent final guidance on gene editing therapies to date in 2024—a consistent trend has emerged: nucleotide-level sequence identity verification of sgRNA sequences is becoming the central focus of regulatory review worldwide. This service covers all detection capabilities required at each of the above regulatory milestones. Full method validation compliant with ICH Q2(R2) has been completed, and data and reports can directly support FDA, EMA, and CDE regulatory submissions.

Figure 1. Timeline of Global Regulatory Agency sgRNA Quality Control Policy Evolution (2018–2024)
2.2 Key Regulatory Document Interpretation
① FDA Guidance: Human Gene Therapy Products Incorporating Human Genome Editing (2024)
This guidance, formally issued in January 2024, is the most rigorous and comprehensive regulatory document to date specifically addressing gene editing therapeutics on a global scale. The guidance explicitly identifies nucleotide-level identity verification data for gRNA as a core CMC requirement, mandates that applicants provide sequence identity evidence capable of resolving single-nucleotide differences, and emphasizes that sgRNA sequence specificity data should be integrated with off-target risk assessment. For companies seeking to submit INDs or supplemental BLA/NDA applications to the FDA, single-nucleotide resolution sequence identification reports based on high-throughput sequencing (NGS) have become indispensable submission support materials. The detection data and report formats provided by this service directly satisfy the applicable requirements of this guidance.

Figure 2. FDA 2024 Gene Editing Therapy Guidance: CMC Requirements for gRNA Components
② NMPA/CDE Technical Guidelines for Pharmacological Research and Evaluation of Ex Vivo Gene Modification Systems (Trial) (2022)
This guideline is China's first specialized regulatory document governing pharmacological research of gene-modified cell therapy products (covering CAR-T, HSC, and other ex vivo gene-edited products). It formally incorporates sgRNA and other gene editing tools into the identity verification and quality control framework for starting materials, explicitly requiring sequence specificity verification data as the basis for batch release decisions. This service has completed full ICH Q2(R2) method validation (including specificity, accuracy, precision, linearity, range, and sensitivity [LOD/LOQ]), and the report format and data structure are directly applicable to CDE submission documentation, significantly reducing the validation costs and review risks associated with in-house method development.
3. Technical Principles
3.1 Limitations of Conventional Methods
Existing mainstream sgRNA quality control methods all harbor fundamental technical bottlenecks that render NGS irreplaceable for sequence accuracy identification:
·HPLC / IP-RP UPLC: Based on chain length separation, these methods can only provide N, N-1, N-2, and N+x impurity ratios. Point mutations of equivalent length (e.g., G→A at position 15) are entirely undetectable—a fundamental technical blind spot that cannot be overcome by method optimization alone.
·High-Resolution LC-MS: G→A isobaric substitutions (mass difference of only 16 Da) on a 100 nt full-length chain (molecular weight approximately 31,000 Da) produce a signal difference of only ~0.05% in the deconvolution spectrum, far below the instrument resolution limit and practically indistinguishable in real-world testing.
·Sanger Sequencing: Cannot directly analyze RNA; requires prior reverse transcription to cDNA. Sensitivity is only approximately 15%, and low-frequency synthetic artifacts (e.g., point variants at 1–5%) are masked by background signal and cannot be quantitatively detected.
·NGS (this service): Currently the only method capable of quantitative detection of composite variants at frequencies ≤ 0.1% at single-nucleotide resolution. By tagging each RNA molecule with a UMI, post-sequencing single-molecule deduplication and traceability effectively remove PCR amplification errors and sequencing system errors, enabling truly quantitative detection.
3.2 Core Workflow
Zhuhai GeneRulor employs the following three-step workflow to achieve single-molecule-level high-sensitivity sgRNA sequence identification:
·Scaffold-Specific Reverse Transcription: Highly specific reverse transcription primers are designed targeting the sgRNA scaffold region, efficiently converting RNA to cDNA while effectively excluding endogenous RNA background interference and ensuring the signal originates exclusively from the target sgRNA.
·UMI Single-Stranded Library Construction: A Unique Molecular Identifier (UMI) is introduced for each RNA molecule during library construction, enabling post-sequencing single-molecule deduplication and traceability. This decouples sequencing errors (approximately 0.1–0.5%) from true biological variants, allowing the true LOD to reach ≤ 0.1%.
·High-Depth Sequencing: A high-depth sequencing strategy with sequencing depth > 50,000× (standard protocol) is applied to comprehensively cover the full-length spacer + scaffold sequence, ensuring sufficient statistical power at every position with Coverage Uniformity > 80% (depth at each position / mean depth).
4. Core CQA Output Indicators
Each test report includes quantitative results for the following four core quality attributes (CQAs), which can be directly applied to CMC batch release decisions:
CQA Indicator | English Abbreviation | Definition and Regulatory Significance |
Full-Length Sequence Purity | FLP% (Full-Length Perfect-Match Sequence Purity) | The proportion of full-length, perfectly matched sequences without any SNV, indel, or truncation; serves as the primary pass/fail criterion for batch release. |
Single-Nucleotide Variant Profile | SNV Profile (VAF Heatmap) | Visualization of variant allele frequency (VAF) at each position, enabling precise identification of synthesis-related artifact hotspots (e.g., G→A deamination at specific sites). |
Truncation & Addition Impurity Profile | N-x / N+x Truncation & Addition Profile | Chain-length distribution of impurities, which can be cross-validated with IP-RP HPLC data to provide a comprehensive size variant characterization. |
Coverage Uniformity | Coverage Uniformity | Depth uniformity across every nucleotide position of the full-length spacer + scaffold, ensuring statistical reliability of results. |
5. Typical Application Scenarios
·IND submission for CRISPR/Cas9, Cas12a, and Cas13 gene editing drug candidates
·sgRNA / pegRNA QC for next-generation editing tools based on base editing / prime editing
·sgRNA raw material batch release for ex vivo cell therapies (CAR-T, TCR-T, HSC, iPSC)
·GMP sgRNA batch release for in vivo LNP-CRISPR therapies
·Comparability studies following sgRNA supplier changes or manufacturing process modifications
·Sequence degradation monitoring during long-term stability studies (ICH Q1A(R2) stability protocol support)
·Root cause analysis of OOS / OOT deviations (tracing synthesis artifact hotspot positions)
6. Sample Submission Specifications
Parameter | Requirement |
Submission Format | Lyophilized powder or aqueous solution; solution should be prepared in RNase-free water or TE buffer (10 mM Tris-HCl, pH 7.5–8.0). |
Minimum Concentration | ≥ 1 μg/μL (solution); for lyophilized powder, QC data (Bioanalyzer or NanoDrop) must be provided. |
Recommended Amount | ≥ 5 μg total (sufficient for three technical replicates of library construction plus one reserve aliquot). |
Packaging & Shipping | Sealed, RNase-free microcentrifuge tubes; dry ice shipping; avoid repeated freeze-thaw cycles (≤ 2 freeze-thaw cycles). |
Sequence Information | Full-length sgRNA sequence (spacer + scaffold) must be provided; a reference FASTA file is recommended. |
Additional Information | Synthesis route (chemical synthesis / IVT); presence of modifications (type and position); batch number; whether a comparator reference standard is included for co-submission. |
Note: For multi-target sgRNA samples, please package each target separately and label with a unique sample ID. When submitting comparability study batches simultaneously, please indicate "Reference Sample" and "Test Sample" clearly to facilitate matched analysis.
7. Service Workflow and Timeline
Standard turnaround time is 30 business days; expedited service is 20 business days (+20% surcharge). For multi-target bulk testing, a customized service plan is available—please contact the Zhuhai GeneRulor team for specific timelines and pricing.
Stage | Timeline (Business Days) | Key Activities |
Pre-analytical Preparation | 5 | Sample receipt and QC (Bioanalyzer / NanoDrop); design and validation of scaffold-specific primers; confirmation of experimental protocol. |
Library Construction & QC | 5 | Scaffold-specific reverse transcription; UMI single-stranded library construction; library QC and normalization. |
High-Depth Sequencing | 5–10 | Sequencing run (effective depth ≥ 50,000×); raw data QC (Q20/Q30 metrics); FASTQ archiving with SHA-256 checksum. |
Data Analysis & Reporting | 5–10 | Sequence alignment (FASTQ → BAM → VCF); calculation of four CQA indicators; report drafting and authorized signature. |
8. Deliverables
The following items are included as standard deliverables for each test. All documents comply with FDA electronic records and data integrity regulations (21 CFR Part 11), with complete audit trails, version-controlled archiving, and electronic signature records, ready for direct use in regulatory submissions and internal quality system archiving.
Core Deliverables (Included with Every Order):
·Bilingual (Chinese/English) Test Report: Contains quantitative results for all four core indicators—FLP%, SNV VAF heatmap, N±x truncation profile, and Coverage Uniformity; includes a definitive pass/fail determination.
·Raw Data Archive Package: FASTQ + BAM + VCF raw data files; version-controlled archiving.
·Analytical Method Summary Document: Summary of key method parameters, instrument configuration, and QC metrics; formatted to comply with CTD Module 3 for direct use in CMC submissions.
·Data Integrity Statement: 21 CFR Part 11 compliant; includes audit trail and electronic signature records.
Supportive Documents (Provided Upon Request; Requires Prior Confirmation):
·Method Validation Report: Complete six-item ICH Q2(R2) validation data, including specificity, accuracy, precision, linearity, range, and sensitivity (LOD/LOQ). Recommended for first-time collaborations or new sgRNA formats; reusable for subsequent tests.
·Regulatory Submission Assistance Documents: CMC chapter reference paragraphs or Pre-IND meeting background materials tailored to the client's specific submission needs; document scope and format must be confirmed in advance.