Exogenous Protein Allergenicity Analysis
1. Background
This protocol provides a scientifically rigorous bioinformatics assessment service for the potential allergenicity of heterologous gene expression products in genetically modified microorganisms and novel foods used in food processing, in full compliance with applicable Chinese national standards and international regulatory frameworks. The protocol strictly adheres to relevant national standards of the People's Republic of China and references the guidance principles of the Codex Alimentarius Commission, providing critical scientific evidence for the food safety assessment of the relevant products.
Food allergy is a global public health concern. Accordingly, any novel protein introduced into the food chain must undergo rigorous allergenicity risk assessment. Bioinformatics analysis efficiently and sensitively identifies structural similarities between a test protein and known allergens through multi-dimensional alignment against authoritative allergen databases, enabling prediction of potential cross-reactivity risk. It constitutes an indispensable first line of defense in allergenicity safety assessment.
2. Technical Principles and Analysis Pipeline
The core principle of bioinformatics allergenicity assessment is as follows: if the amino acid sequence of an exogenous protein exhibits significant similarity to a known allergen protein sequence, it may share similar structural and immunological features (such as IgE-binding epitopes), and could therefore trigger allergic reactions through cross-reactivity. The detailed principle is illustrated in Figure 1.

Figure 1. Schematic Diagram of the Allergenicity Assessment Principle
This figure illustrates the assessment logic, from sequence alignment between an unknown exogenous protein (left) and known allergens (right), to identification of similar epitopes (bottom left) and prediction of potential immune cross-reactivity.
This protocol employs a multi-criteria, multi-tiered analytical strategy that is internationally recognized and adopted by national standards, ensuring the comprehensiveness, scientific rigor, and reliability of assessment results. The overall analysis workflow is shown in Figure 2.

Figure 2. Allergenicity Analysis Workflow Diagram
This workflow illustrates the complete steps from sequence input to final risk assessment, encompassing three parallel sequence alignment methods with their respective decision criteria, and the integrated assessment logic based on the "positive-if-any" principle.
3. Result Interpretation and Decision Criteria
Result determination is strictly implemented in accordance with national standards, using a "three-criteria" comprehensive assessment system. If any single criterion is met, the test protein is classified as having potential allergenicity risk. The result interpretation approach is illustrated in Figure 3.

Figure 3. Schematic Diagram of Result Determination
This figure illustrates the visualization process of the three alignment methods and provides an example of result determination. In this example, although Methods 1 and 3 are negative, Method 2 detects significant identity to a peanut allergen; therefore, the final assessment conclusion is high risk.
4. Technical Advantages
(1) Regulatory Compliance:Strictly follows the national standard "Bioinformatics Methods for Allergenicity Assessment of Exogenous Proteins in the Food Safety Testing of Transgenic Organisms and Their Products" (Ministry of Agriculture Announcement No. 1485-18-2010)..
(2) Authoritative Databases with Continuous Updates:Utilizes internationally authoritative allergen databases including AllergenOnline, WHO/IUIS, and COMPARE, covering 2,000+ verified allergens across food, inhalant, and contact categories, with quarterly updates to ensure analyses are based on the most current and comprehensive scientific knowledge.
(3) Multi-criteria Comprehensive Assessment Strategy:Innovatively integrates three methods - full-length sequence alignment, exact epitope matching, and sliding-window alignment - evaluating allergenicity risk from global to local and from structure to epitope in multiple dimensions, reducing false-negative rates and providing more comprehensive and reliable safety assurance.
(4) High Sensitivity and Conservative Principle:Employs a conservative "positive-if-any" decision strategy that prioritizes consumer safety, preferring over-warning to missing potential risks, in line with the precautionary principle in food safety.
(5) Traceability and Transparency:Provides complete raw alignment data, analysis parameters, and database version information, ensuring results are reproducible and auditable.
5. Application Scenarios
This allergenicity bioinformatics analysis service is widely applicable across food safety, biotechnology, and regulatory compliance domains, providing critical safety assessment support for diverse product types and R&D stages:
(1) Regulatory Compliance:Strictly follows the national standard "Bioinformatics Methods for Allergenicity Assessment of Exogenous Proteins in the Food Safety Testing of Transgenic Organisms and Their Products" (Ministry of Agriculture Announcement No. 1485-18-2010)..
(2) Authoritative Databases with Continuous Updates:Utilizes internationally authoritative allergen databases including AllergenOnline, WHO/IUIS, and COMPARE, covering 2,000+ verified allergens across food, inhalant, and contact categories, with quarterly updates to ensure analyses are based on the most current and comprehensive scientific knowledge.
(3) Multi-criteria Comprehensive Assessment Strategy:Innovatively integrates three methods - full-length sequence alignment, exact epitope matching, and sliding-window alignment - evaluating allergenicity risk from global to local and from structure to epitope in multiple dimensions, reducing false-negative rates and providing more comprehensive and reliable safety assurance.
(4) High Sensitivity and Conservative Principle:Employs the "“positive-if-any”" conservative decision strategy that prioritizes consumer safety, preferring over-warning to missing potential risks, in line with the precautionary principle in food safety.
(5) Traceability and Transparency:Provides complete raw alignment data, analysis parameters, and database version information, ensuring results are reproducible and auditable.
6. Sample Report
6.1 Example Allergenicity Assessment Result Report
The final delivered report will contain a detailed results summary table clearly showing the alignment results of each test protein under the three criteria, the integrated assessment conclusion, and the risk level, as shown in Table 1. The tablepresentsthe analysis results of 5 proteins to illustrate the format of the final report. Among these, Proteins A, B, and D are classified as high risk because they tested positive in at least one criterion.

7. Service Contents and Sample Requirements
7.1 Service Contents
We provide a one-stop service from project consultation to final report delivery, ensuring a clear, efficient, and compliant project process.
Service Item | Service Content |
Project Consultation | Senior technical experts assist in understanding regulatory requirements and defining the analysis plan and information needs. |
Data Analysis | Execution of the standardized bioinformatics analysis pipeline described above, ensuring accuracy and traceability at every step. |
Report Delivery | Delivery of a comprehensive PDF report and complete analysis result files within the committed turnaround time (typically 10 business days). |
After-sales Support | Professional report interpretation and ongoing technical consultation to assist clients in understanding the significance of the analysis results. |
7.2 Sample and Information Requirements
To ensure the accuracy and reliability of analysis results, clients must provide samples and information strictly according to the following requirements:
Requirement Category | Item | Specific Requirements |
Required Information | Protein Sequence | Provide amino acid sequences in FASTA format. Each sequence must have a unique and identifiable ID. |
Sequence Origin | Clearly state the origin of each protein (e.g., host strain, inserted gene, species, etc.). | |
Supplementary Information | Project Background | Briefly describe the intended application of the test protein (e.g., food processing aid, novel food ingredient, etc.). |
Turnaround | Detection Turnaround Time | 10 business days |
8. References
[1] Ministry of Agriculture of the People's Republic of China. (2010). Ministry of Agriculture Announcement No. 1485-18-2010: Bioinformatics Methods for Allergenicity Assessment of Exogenous Proteins in the Food Safety Testing of Transgenic Organisms and Their Products.
[2] Ladics, G. S., Cressman, R. F., Herouet-Guicheney, C., Herman, R. A., Privalle, L., & Song, P. (2011). Bioinformatics and the allergy assessment of agricultural biotechnology products: industry practices and recommendations. Regulatory Toxicology and Pharmacology, 60(1), 52-61.
[3] Goodman, R. E., Ebisawa, M., Ferreira, F., Sampson, H. A., van Ree, R., Vieths, S., ... & Taylor, S. L. (2016). AllergenOnline: a peer‐reviewed, curated allergen database to assess novel food proteins for potential cross‐reactivity. Molecular Nutrition & Food Research, 60(5), 1183-1198.
[4] Hileman, R. E., Silvanovich, A., Goodman, R. E., Rice, E. A., Holleschak, G., Astwood, J. D., & Hefle, S. L. (2002). Bioinformatic methods for allergenicity assessment using a comprehensive allergen database. International Archives of Allergy and Immunology, 128(4), 280-291.