Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

GPAS: an online AI system for rapid and accurate pathogen identification and LLM-based interpretation

Tingting Li, Hao Hong, Duchangjiang Fan, Jin Li, Ting Li, Jiaqi Wu, Shuai Jiang, Xianxing Xie, Yawei Zhang, ManDong Hu, Xiaoyao Yin, Yizhe Zhang, Heping Ma, Zhehan Liu, Zhihui Su, Xiping Yu, Yu Liu, Hetian Yuan, Weifan Zheng, Haoyuan Liu, Mingyue Ma, Xingyue Li, Yezhuang Shen, Cheng Zhang, Yuyi Wang, Bing Zhao, Liming Sun, Qiuying Han, Jing Chen, Ke Zhang, Liang Chen, Na Wang, Weihua Li, Jianghong Man, Kun He, Fangting Dong, Fei Du, Yan Yi, Ailing Li, Tao Zhou, Xuemin Zhang, Tao Li
doi: https://doi.org/10.64898/2026.02.18.26346517
Tingting Li
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: tli{at}ncba.ac.cn
Hao Hong
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Duchangjiang Fan
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jin Li
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ting Li
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jiaqi Wu
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shuai Jiang
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xianxing Xie
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yawei Zhang
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
ManDong Hu
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiaoyao Yin
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yizhe Zhang
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Heping Ma
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zhehan Liu
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zhihui Su
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiping Yu
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yu Liu
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hetian Yuan
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Weifan Zheng
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Haoyuan Liu
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mingyue Ma
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xingyue Li
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yezhuang Shen
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cheng Zhang
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yuyi Wang
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bing Zhao
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Liming Sun
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Qiuying Han
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jing Chen
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ke Zhang
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Liang Chen
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Na Wang
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Weihua Li
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jianghong Man
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kun He
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Fangting Dong
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Fei Du
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yan Yi
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ailing Li
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tao Zhou
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xuemin Zhang
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tao Li
1Nanhu Laboratory, State Key Laboratory of Biomedical Analysis (SKLBA, also known as National Center of Biomedical Analysis, NCBA), Beijing, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Accurate identification of unknown pathogens is critical for medicine and public health, yet current metagenomic workflows remain heavily dependent on specialized bioinformatics expertise and manual interpretation, creating substantial bottlenecks in time-sensitive diagnostic settings1. The key challenges lie in achieving precise species identification amidst high background noise and translating complex microbial data into clinically actionable insights2,3. Here we present the Global Pathogen Analysis System (GPAS), an integrated computational framework that combines rapid and accurate pathogen identification with large language model (LLM)-based semantic interpretation. Central to GPAS is a dynamic-library alignment mechanism informed by prior probabilities of inter-species misclassification. By integrating a hybrid machine learning model that couples elastic neural networks with Bayesian inference, this approach substantially reduces both false positives and false negatives, achieving species-level accuracy superior to existing state-of-the-art tools. To enable clinical interpretation, we constructed a unified microbial knowledge graph integrating global metagenomic and metaviromic sample repositories, and trained a pathogen-specialized LLM agent. Through end-to-end reinforcement learning, the agent autonomously executes multi-step reasoning workflows extracting pathogen-specific insights from complex data and generating human-readable, evidence-based reports. Application to throat swab samples demonstrates that GPAS not only accurately identifies pathogenic microorganisms but also reveals how SLE-associated immune dysregulation reshapes the respiratory microbiome and promotes pathobiont overgrowth, providing clinically instructive interpretations. By substantially lowering technical barriers to pathogen identification, GPAS offers an accessible yet powerful platform for clinical diagnostics, public health surveillance, and microbiome research. The system is freely available at: https://gpas.nh.ac.cn/.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work was supported by State Key Laboratory of Biomedical Analysis and grants from the China National Natural Science Foundation (No. 82550131, No. 81925017 and No. 82130052 to Tao Li, No. 62503496 to Hao Hong).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Samples from participants were collected in accordance with medical ethics requirements, under the approval of the Ethics Committee of National Center of Biomedical Analysis (NCBA, Approval No. AF/SC-08/02.240N and AF/SC-08/02.453N).

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • ↵* E-mails: zhangxuemin{at}cashq.ac.cn(X-M.Z.), tzhou{at}ncba.ac.cn (T.Z), alli{at}ncba.ac.cn (A-L.L.) & tingtingli{at}xmail.ncba.ac.cn (T.-T.L.)

Data Availability

All data produced in the present study are available upon reasonable request to the authors

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted February 20, 2026.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
GPAS: an online AI system for rapid and accurate pathogen identification and LLM-based interpretation
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
GPAS: an online AI system for rapid and accurate pathogen identification and LLM-based interpretation
Tingting Li, Hao Hong, Duchangjiang Fan, Jin Li, Ting Li, Jiaqi Wu, Shuai Jiang, Xianxing Xie, Yawei Zhang, ManDong Hu, Xiaoyao Yin, Yizhe Zhang, Heping Ma, Zhehan Liu, Zhihui Su, Xiping Yu, Yu Liu, Hetian Yuan, Weifan Zheng, Haoyuan Liu, Mingyue Ma, Xingyue Li, Yezhuang Shen, Cheng Zhang, Yuyi Wang, Bing Zhao, Liming Sun, Qiuying Han, Jing Chen, Ke Zhang, Liang Chen, Na Wang, Weihua Li, Jianghong Man, Kun He, Fangting Dong, Fei Du, Yan Yi, Ailing Li, Tao Zhou, Xuemin Zhang, Tao Li
medRxiv 2026.02.18.26346517; doi: https://doi.org/10.64898/2026.02.18.26346517
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
GPAS: an online AI system for rapid and accurate pathogen identification and LLM-based interpretation
Tingting Li, Hao Hong, Duchangjiang Fan, Jin Li, Ting Li, Jiaqi Wu, Shuai Jiang, Xianxing Xie, Yawei Zhang, ManDong Hu, Xiaoyao Yin, Yizhe Zhang, Heping Ma, Zhehan Liu, Zhihui Su, Xiping Yu, Yu Liu, Hetian Yuan, Weifan Zheng, Haoyuan Liu, Mingyue Ma, Xingyue Li, Yezhuang Shen, Cheng Zhang, Yuyi Wang, Bing Zhao, Liming Sun, Qiuying Han, Jing Chen, Ke Zhang, Liang Chen, Na Wang, Weihua Li, Jianghong Man, Kun He, Fangting Dong, Fei Du, Yan Yi, Ailing Li, Tao Zhou, Xuemin Zhang, Tao Li
medRxiv 2026.02.18.26346517; doi: https://doi.org/10.64898/2026.02.18.26346517

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Public and Global Health
Subject Areas
All Articles
  • Addiction Medicine (576)
  • Allergy and Immunology (867)
  • Anesthesia (306)
  • Cardiovascular Medicine (4480)
  • Dentistry and Oral Medicine (449)
  • Dermatology (385)
  • Emergency Medicine (614)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1528)
  • Epidemiology (15276)
  • Forensic Medicine (31)
  • Gastroenterology (1133)
  • Genetic and Genomic Medicine (6644)
  • Geriatric Medicine (671)
  • Health Economics (1006)
  • Health Informatics (4603)
  • Health Policy (1378)
  • Health Systems and Quality Improvement (1623)
  • Hematology (544)
  • HIV/AIDS (1275)
  • Infectious Diseases (except HIV/AIDS) (15960)
  • Intensive Care and Critical Care Medicine (1111)
  • Medical Education (626)
  • Medical Ethics (147)
  • Nephrology (674)
  • Neurology (6693)
  • Nursing (346)
  • Nutrition (1006)
  • Obstetrics and Gynecology (1152)
  • Occupational and Environmental Health (961)
  • Oncology (3369)
  • Ophthalmology (988)
  • Orthopedics (370)
  • Otolaryngology (421)
  • Pain Medicine (437)
  • Palliative Medicine (131)
  • Pathology (668)
  • Pediatrics (1703)
  • Pharmacology and Therapeutics (699)
  • Primary Care Research (717)
  • Psychiatry and Clinical Psychology (5494)
  • Public and Global Health (9285)
  • Radiology and Imaging (2223)
  • Rehabilitation Medicine and Physical Therapy (1375)
  • Respiratory Medicine (1201)
  • Rheumatology (598)
  • Sexual and Reproductive Health (720)
  • Sports Medicine (535)
  • Surgery (720)
  • Toxicology (100)
  • Transplantation (290)
  • Urology (267)