Natural language processing for scalable feature engineering and ultra-high-dimensional confounding adjustment in healthcare database studies
Richard Wyss, Jie Yang, Sebastian Schneeweiss, Joseph M. Plasek, Li Zhou, Thomas Deramus, Janick G. Weberpals, Kerry Ngan, Theodore N. Tsacogianis, Kueiyu Joshua Lin
doi: https://doi.org/10.1101/2025.01.30.25321403
Richard Wyss
1Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
PhDJie Yang
1Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
PhDSebastian Schneeweiss
1Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
MD, PhDJoseph M. Plasek
2Division of General Internal Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
PhDLi Zhou
2Division of General Internal Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
PhDThomas Deramus
1Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
PhDJanick G. Weberpals
1Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
PhDKerry Ngan
1Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
MSTheodore N. Tsacogianis
1Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
MSKueiyu Joshua Lin
1Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
3Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
MD, PhD
Article usage
Posted January 31, 2025.
Natural language processing for scalable feature engineering and ultra-high-dimensional confounding adjustment in healthcare database studies
Richard Wyss, Jie Yang, Sebastian Schneeweiss, Joseph M. Plasek, Li Zhou, Thomas Deramus, Janick G. Weberpals, Kerry Ngan, Theodore N. Tsacogianis, Kueiyu Joshua Lin
medRxiv 2025.01.30.25321403; doi: https://doi.org/10.1101/2025.01.30.25321403
Natural language processing for scalable feature engineering and ultra-high-dimensional confounding adjustment in healthcare database studies
Richard Wyss, Jie Yang, Sebastian Schneeweiss, Joseph M. Plasek, Li Zhou, Thomas Deramus, Janick G. Weberpals, Kerry Ngan, Theodore N. Tsacogianis, Kueiyu Joshua Lin
medRxiv 2025.01.30.25321403; doi: https://doi.org/10.1101/2025.01.30.25321403
Subject Area
Subject Areas
- Addiction Medicine (576)
- Allergy and Immunology (868)
- Anesthesia (306)
- Cardiovascular Medicine (4483)
- Dermatology (385)
- Emergency Medicine (615)
- Epidemiology (15281)
- Forensic Medicine (31)
- Gastroenterology (1133)
- Genetic and Genomic Medicine (6649)
- Geriatric Medicine (671)
- Health Economics (1006)
- Health Informatics (4606)
- Health Policy (1378)
- Hematology (544)
- HIV/AIDS (1276)
- Medical Education (626)
- Medical Ethics (147)
- Nephrology (674)
- Neurology (6698)
- Nursing (346)
- Nutrition (1006)
- Obstetrics and Gynecology (1153)
- Oncology (3370)
- Ophthalmology (988)
- Orthopedics (370)
- Otolaryngology (421)
- Pain Medicine (437)
- Palliative Medicine (131)
- Pathology (669)
- Pediatrics (1704)
- Primary Care Research (717)
- Public and Global Health (9287)
- Radiology and Imaging (2225)
- Respiratory Medicine (1201)
- Rheumatology (598)
- Sports Medicine (536)
- Surgery (722)
- Toxicology (100)
- Transplantation (290)
- Urology (267)




