Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Heterogeneity in COVID-19 severity patterns among age-gender groups: an analysis of 778 692 Mexican patients through a meta-clustering technique

View ORCID ProfileLexin Zhou, Nekane Romero, Juan Martínez-Miranda, View ORCID ProfileJ Alberto Conejero, Juan M García-Gómez, View ORCID ProfileCarlos Sáez
doi: https://doi.org/10.1101/2021.02.21.21252132
Lexin Zhou
aBiomedical Data Science Lab, Instituto Universitario de Tecnologías de la Información y Comunicaciones (ITACA), Universitat Politècnica de València (UPV), Camino de Vera s/n, Valencia 46022, España.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lexin Zhou
Nekane Romero
aBiomedical Data Science Lab, Instituto Universitario de Tecnologías de la Información y Comunicaciones (ITACA), Universitat Politècnica de València (UPV), Camino de Vera s/n, Valencia 46022, España.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Juan Martínez-Miranda
cCONACyT - Centro de Investigación Científica y de Educación Superior de Ensenada - CICESE-UT3, Mexico
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
J Alberto Conejero
bInstituto Universitario de Matemática Pura y Aplicada (IUMPA), Universitat Politècnica de València, Valencia, Spain.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for J Alberto Conejero
Juan M García-Gómez
aBiomedical Data Science Lab, Instituto Universitario de Tecnologías de la Información y Comunicaciones (ITACA), Universitat Politècnica de València (UPV), Camino de Vera s/n, Valencia 46022, España.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Carlos Sáez
aBiomedical Data Science Lab, Instituto Universitario de Tecnologías de la Información y Comunicaciones (ITACA), Universitat Politècnica de València (UPV), Camino de Vera s/n, Valencia 46022, España.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Carlos Sáez
  • For correspondence: carsaesi@upv.es
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

We describe age-gender unbiased COVID-19 subphenotypes regarding severity patterns including prognostic, ICU and morbimortality outcomes, from patterns in clinical phenotypes, habits and demographic features. We used the Mexican Government COVID-19 open data including 778692 SARS-CoV-2 patient-level data as of September 2020. We applied a two-stage clustering approach combining dimensionality reduction and hierarchical clustering: 56 clusters from independent age-gender analyses supported 11 clinically distinguishable meta-clusters (MCs). MCs 1-3 showed high recovery rates (90.27-95.22%), including healthy patients of all ages, children with comorbidities with priority in medical resources, and young obese, smoker patients. MCs 4-5 showed moderate recovery rates (81.3-82.81%): patients with hypertension or diabetes of all ages, and obese patients with pneumonia, hypertension and diabetes. MCs 6-11 showed low recovery rates (53.96-66.94%): immunosuppressed patients with high comorbidity rate, CKD patients with poor survival and recovery, elderly smokers with COPD, severe diabetic elderly with hypertension, and oldest obese smokers with COPD and mild cardiovascular disease. Group outcomes conformed to the recent literature on dedicated age-gender groups. Combination of unhealthy habits and comorbidities were associated with mortality in older patients. Centenarians tended to better outcomes. Immunosuppression was not found as a relevant factor for severity alone but did when present along with CKD. Mexican states and type of clinical institution revealed relevant heterogeneity in severity, relevant for consideration in further studies. The resultant eleven MCs provide bases for a deep understanding of the epidemiological and phenotypical severity presentation of COVID-19 patients based on comorbidities, habits, demographic characteristics, and on patient provenance and type of clinical institutions, as well as revealing the correlations between the above characteristics to anticipate the possible clinical outcomes of each patient with a specific profile. These results can establish groups for automated stratification or triage towards personalized treatment enabling a personalized evaluation of the patient’s expected outcomes.

Code available at https://github.com/bdslab-upv/covid19-metaclustering

Dynamic results visualization at http://covid19sdetool.upv.es/?tab=mexicoGov

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work was supported by Universitat Politècnica de València contract no. UPV-SUB.2-1302 and FONDO SUPERA COVID-19 by CRUE-Santander Bank grant: Severity Subgroup Discovery and Classification on COVID-19 Real World Data through Machine Learning and Data Quality assessment (SUBCOVERWD-19).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Using Open Data from the Government of Mexico, terms available at: https://datos.gob.mx/libreusomx

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • ↵† Senior authors

    1. Simplify article.

    2. Figures revised.

    3. Supplemental files updated.

Data Availability

The studied sample is available in our GitHub repository.

https://github.com/bdslab-upv/covid19-metaclustering

  • Abbreviations

    COPD
    Chronic Obstructive Pulmonary Disease
    CKD
    Chronic Kidney Disease
    INMUSUPR
    Immunosuppression
    ICU
    Intensive Care Unit
    EHR
    Electronic Health Record
    ML
    Machine Learning
    DQ
    Data Quality
    RR
    Recovery Rate
    MC
    Meta-Cluster
    DIF
    National System for Integral Family Development
    IMSS
    Mexican Institute of Social Security
    ISSSTE
    Institute for Social Security and Services for State Workers
    PEMEX
    Mexican Petroleum Institution
    SEDENA
    Secretariat of the National Defense
    SEMAR
    Secretariat of the Navy
    SSA
    Secretariat of Health
    TIC
    Type of Clinical Institution
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
    Back to top
    PreviousNext
    Posted March 03, 2021.
    Download PDF

    Supplementary Material

    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Heterogeneity in COVID-19 severity patterns among age-gender groups: an analysis of 778 692 Mexican patients through a meta-clustering technique
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Heterogeneity in COVID-19 severity patterns among age-gender groups: an analysis of 778 692 Mexican patients through a meta-clustering technique
    Lexin Zhou, Nekane Romero, Juan Martínez-Miranda, J Alberto Conejero, Juan M García-Gómez, Carlos Sáez
    medRxiv 2021.02.21.21252132; doi: https://doi.org/10.1101/2021.02.21.21252132
    Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
    Citation Tools
    Heterogeneity in COVID-19 severity patterns among age-gender groups: an analysis of 778 692 Mexican patients through a meta-clustering technique
    Lexin Zhou, Nekane Romero, Juan Martínez-Miranda, J Alberto Conejero, Juan M García-Gómez, Carlos Sáez
    medRxiv 2021.02.21.21252132; doi: https://doi.org/10.1101/2021.02.21.21252132

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Epidemiology
    Subject Areas
    All Articles
    • Addiction Medicine (160)
    • Allergy and Immunology (412)
    • Anesthesia (90)
    • Cardiovascular Medicine (855)
    • Dentistry and Oral Medicine (156)
    • Dermatology (97)
    • Emergency Medicine (247)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (392)
    • Epidemiology (8534)
    • Forensic Medicine (4)
    • Gastroenterology (381)
    • Genetic and Genomic Medicine (1739)
    • Geriatric Medicine (167)
    • Health Economics (370)
    • Health Informatics (1234)
    • Health Policy (618)
    • Health Systems and Quality Improvement (467)
    • Hematology (196)
    • HIV/AIDS (369)
    • Infectious Diseases (except HIV/AIDS) (10271)
    • Intensive Care and Critical Care Medicine (552)
    • Medical Education (192)
    • Medical Ethics (51)
    • Nephrology (210)
    • Neurology (1666)
    • Nursing (97)
    • Nutrition (247)
    • Obstetrics and Gynecology (325)
    • Occupational and Environmental Health (450)
    • Oncology (925)
    • Ophthalmology (262)
    • Orthopedics (100)
    • Otolaryngology (172)
    • Pain Medicine (110)
    • Palliative Medicine (40)
    • Pathology (249)
    • Pediatrics (534)
    • Pharmacology and Therapeutics (246)
    • Primary Care Research (205)
    • Psychiatry and Clinical Psychology (1757)
    • Public and Global Health (3826)
    • Radiology and Imaging (622)
    • Rehabilitation Medicine and Physical Therapy (317)
    • Respiratory Medicine (518)
    • Rheumatology (207)
    • Sexual and Reproductive Health (164)
    • Sports Medicine (156)
    • Surgery (190)
    • Toxicology (36)
    • Transplantation (100)
    • Urology (74)