Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Model-based reasoning methods for diagnosis in integrative medicine based on electronic medical records and natural language processing

Wenye Geng, Xuanfeng Qin, Zhuo Wang, Qing Kong, Zihui Tang, Lin Jiang
doi: https://doi.org/10.1101/2020.07.12.20151746
Wenye Geng
1Department of integrative medicine, Fudan university Huashan hospital, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xuanfeng Qin
2Department of neurosurgery, Fudan university Huashan hospital, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zhuo Wang
3Shanghai Sunjian Informatics Technology Company Limited
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Qing Kong
1Department of integrative medicine, Fudan university Huashan hospital, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zihui Tang
1Department of integrative medicine, Fudan university Huashan hospital, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: dr_zhtang@yeah.net jianglinhappy@126.com
Lin Jiang
1Department of integrative medicine, Fudan university Huashan hospital, Shanghai, China
4Healthcare center, Fudan university Huashan hospital, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: dr_zhtang@yeah.net jianglinhappy@126.com
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Background This study aimed to investigate model-based reasoning (MBR) algorithms for the diagnosis of integrative medicine based on electronic medical records (EMRs) and natural language processing.

Methods A total of 14,075 medical records of clinical cases were extracted from the EMRs as the development dataset, and an external test dataset consisting of 1,000 medical records of clinical cases was extracted from independent EMRs. MBR methods based on word embedding, machine learning, and deep learning algorithms were developed for the automatic diagnosis of syndrome pattern in integrative medicine. MBR algorithms combining rule-based reasoning (RBR) were also developed. A standard evaluation metrics consisting of accuracy, precision, recall, and F1 score were used for the performance estimation of the methods. The association analyses were conducted on the sample size, number of syndrome pattern type, and diagnosis of lung diseases with the best algorithms.

Results The Word2Vec CNN MBR algorithms showed high performance (accuracy of 0.9586 in the test dataset) in the syndrome pattern diagnosis. The Word2Vec CNN MBR combined with RBR also showed high performance (accuracy of 0.9229 in the test dataset). The diagnosis of lung diseases could enhance the performance of the Word2Vec CNN MBR algorithms. Each group sample size and syndrome pattern type affected the performance of these algorithms.

Conclusion The MBR methods based on Word2Vec and CNN showed high performance in the syndrome pattern diagnosis in integrative medicine in lung diseases. The parameters of each group sample size, syndrome pattern type, and diagnosis of lung diseases were associated with the performance of the methods.

Strengths and limitations of this study

  1. A novel application of artificial intelligence – natural language processing approaches on diagnosis of integrative medicine

  2. A study of medical artificial intelligence based on real-world data of electronic medical records

  3. Multiple approaches on artificial intelligence to include traditional machine learning algorithms, neural network, and deep learning algorithms

  4. Rule-based combining model-based reasoning to be explored in this dataset

Competing Interest Statement

The authors have declared no competing interest.

Clinical Trial

NCT03274908

Funding Statement

Grants from the Institutes of Integrative Medicine of Fudan University. ClinicalTrials.gov Identifier: NCT03274908; and China Postdoctoral Science Foundation funded project (2017M611461).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The study was approved by Ethics Committee of the Huashan Hospital (approval number: HIRB-2018-166) and performed in accordance with the Declaration of Helsinki.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • Funding sources: grants from the Institutes of Integrative Medicine of Fudan University. ClinicalTrials.gov Identifier: NCT03274908; and China Postdoctoral Science Foundation funded project (2017M611461).

  • Author’s email:

  • W.G: drug{at}fudan.edu.cn

  • X.Q: qinxuanfeng777{at}163.com

  • Z.W: flezze{at}163.com

  • Q.K: kq2016829{at}163.com

  • Z.T: dr_zhtang{at}yeah.net

  • L.J: jianglinhappy{at}126.com

Data Availability

The datasets generated and/or analyzed during the current study are not publicly available due to private information but are available from the corresponding author on reasonable request. Dataset are from the study whose authors may be contacted at Center of Bioinformatics and Biostatistics, Institutes of Integrative Medicine, Fudan University. The data concerning external test dataset and an example of development of dataset were available in https://github.com/zihuitang/clincial_decision_support_system_im .

https://github.com/zihuitang/clincial_decision_support_system_im

  • Abbreviations

    ANN
    Artificial neural network
    CI
    Confidence interval
    CNN
    Convolutional neural network
    EMRs
    Electronic medical records
    XGBoost
    Extreme gradient boosting
    KNN
    K-nearest neighbor
    MBR
    Model-based reasoning
    MLP
    Multilayer perceptron
    NLP
    Natural language processing
    RF
    Random forest
    RBR
    Rule-based reasoning
    SVM
    Support vector machines
    TCM
    Traditional Chinese medicine
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
    Back to top
    PreviousNext
    Posted July 18, 2020.
    Download PDF
    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Model-based reasoning methods for diagnosis in integrative medicine based on electronic medical records and natural language processing
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Model-based reasoning methods for diagnosis in integrative medicine based on electronic medical records and natural language processing
    Wenye Geng, Xuanfeng Qin, Zhuo Wang, Qing Kong, Zihui Tang, Lin Jiang
    medRxiv 2020.07.12.20151746; doi: https://doi.org/10.1101/2020.07.12.20151746
    Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    Model-based reasoning methods for diagnosis in integrative medicine based on electronic medical records and natural language processing
    Wenye Geng, Xuanfeng Qin, Zhuo Wang, Qing Kong, Zihui Tang, Lin Jiang
    medRxiv 2020.07.12.20151746; doi: https://doi.org/10.1101/2020.07.12.20151746

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Health Informatics
    Subject Areas
    All Articles
    • Addiction Medicine (230)
    • Allergy and Immunology (507)
    • Anesthesia (111)
    • Cardiovascular Medicine (1258)
    • Dentistry and Oral Medicine (207)
    • Dermatology (148)
    • Emergency Medicine (283)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (538)
    • Epidemiology (10049)
    • Forensic Medicine (5)
    • Gastroenterology (502)
    • Genetic and Genomic Medicine (2482)
    • Geriatric Medicine (239)
    • Health Economics (482)
    • Health Informatics (1653)
    • Health Policy (757)
    • Health Systems and Quality Improvement (638)
    • Hematology (250)
    • HIV/AIDS (536)
    • Infectious Diseases (except HIV/AIDS) (11891)
    • Intensive Care and Critical Care Medicine (626)
    • Medical Education (255)
    • Medical Ethics (75)
    • Nephrology (269)
    • Neurology (2301)
    • Nursing (140)
    • Nutrition (354)
    • Obstetrics and Gynecology (458)
    • Occupational and Environmental Health (537)
    • Oncology (1258)
    • Ophthalmology (377)
    • Orthopedics (134)
    • Otolaryngology (226)
    • Pain Medicine (158)
    • Palliative Medicine (50)
    • Pathology (326)
    • Pediatrics (737)
    • Pharmacology and Therapeutics (315)
    • Primary Care Research (282)
    • Psychiatry and Clinical Psychology (2294)
    • Public and Global Health (4850)
    • Radiology and Imaging (846)
    • Rehabilitation Medicine and Physical Therapy (493)
    • Respiratory Medicine (654)
    • Rheumatology (288)
    • Sexual and Reproductive Health (241)
    • Sports Medicine (228)
    • Surgery (273)
    • Toxicology (44)
    • Transplantation (130)
    • Urology (100)