Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Crowdsourced feature tagging for scalable and privacy-preserved autism diagnosis

View ORCID ProfilePeter Washington, Qandeel Tariq, Emilie Leblanc, Brianna Chrisman, Kaitlyn Dunlap, Aaron Kline, Haik Kalantarian, Yordan Penev, Kelley Paskov, Catalin Voss, Nathaniel Stockham, Maya Varma, Arman Husic, Jack Kent, Nick Haber, Terry Winograd, Dennis P. Wall
doi: https://doi.org/10.1101/2020.12.15.20248283
Peter Washington
1Department of Bioengineering, Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Peter Washington
Qandeel Tariq
2Research Scientist, Amazon, Seattle, Washington, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Emilie Leblanc
3Department of Pediatrics (Systems Medicine), Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Brianna Chrisman
1Department of Bioengineering, Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kaitlyn Dunlap
3Department of Pediatrics (Systems Medicine), Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Aaron Kline
3Department of Pediatrics (Systems Medicine), Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Haik Kalantarian
3Department of Pediatrics (Systems Medicine), Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yordan Penev
3Department of Pediatrics (Systems Medicine), Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kelley Paskov
4Department of Biomedical Data Science, Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Catalin Voss
5Department of Computer Science, Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nathaniel Stockham
6Department of Neuroscience, Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Maya Varma
5Department of Computer Science, Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Arman Husic
3Department of Pediatrics (Systems Medicine), Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jack Kent
3Department of Pediatrics (Systems Medicine), Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nick Haber
7Graduate School of Education, Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Terry Winograd
5Department of Computer Science, Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dennis P. Wall
3Department of Pediatrics (Systems Medicine), Stanford University, Stanford, California, USA
4Department of Biomedical Data Science, Stanford University, Stanford, California, USA
8Department of Psychiatry and Behavioral Sciences (by courtesy), Stanford University, Stanford, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: dpwall@stanford.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

Standard medical diagnosis of mental health conditions often requires licensed experts who are increasingly outnumbered by those at risk, limiting reach. We test the hypothesis that a trustworthy crowd of non-experts can efficiently label features needed for accurate machine learning detection of the common childhood developmental disorder autism. We implement a novel process for creating a trustworthy distributed workforce for video feature extraction, selecting a workforce of 102 workers from a pool of 1,107. Two previously validated binary autism logistic regression classifiers were used to evaluate the quality of the curated crowd’s ratings on unstructured home videos. A clinically representative balanced sample (N=50 videos) of videos were evaluated with and without face box and pitch shift privacy alterations, with AUROC and AUPRC scores >0.98. With both privacy-preserving modifications, sensitivity is preserved (96.0%) while maintaining specificity (80.0%) and accuracy (88.0%) at levels that exceed classification methods without alterations. We find that machine learning classification from features extracted by a curated nonexpert crowd achieves clinical performance for pediatric autism videos and maintains acceptable performance when privacy-preserving mechanisms are applied. These results suggest that privacy-based crowdsourcing of short videos can be leveraged for rapid and mobile assessment of behavioral health.

Competing Interest Statement

DW is the founder of Cognoa.com. This company is developing digital health solutions for pediatric care. CV, AK, and NH work as part-time consultants to Cognoa.com. All other authors declare no competing interests.

Funding Statement

This work was supported in part by funds to DPW from the National Institutes of Health (1R01EB025025-01, 1R21HD091500- 01, 1R01LM013083), the National Science Foundation (Award 2014232), The Hartwell Foundation, Bill and Melinda Gates Foundation, Coulter Foundation, Lucile Packard Foundation, the Weston Havens Foundation, and program grants from Stanford's Human Centered Artificial Intelligence Program, Stanford's Precision Health and Integrated Diagnostics Center (PHIND), Stanford's Beckman Center, Stanford's Bio-X Center, Predictives and Diagnostics Accelerator (SPADA) Spectrum, Stanford's Spark Program in Translational Research, and from Stanford's Wu Tsai Neurosciences Institute's Neuroscience: Translate Program. We also acknowledge generous support from David Orr, Imma Calvo, Bobby Dekesyer and Peter Sullivan. P.W. would like to acknowledge support from Mr. Schroeder and the Stanford Interdisciplinary Graduate Fellowship (SIGF) as the Schroeder Family Goldman Sachs Graduate Fellow.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

All methods described below were carried out in accordance with global, federal, state, and university guidelines and regulations for research and reviewed and approved by the Stanford University Institutional Review Board (IRB) prior to taking place.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

All code will be made available upon request.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted December 17, 2020.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Crowdsourced feature tagging for scalable and privacy-preserved autism diagnosis
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Crowdsourced feature tagging for scalable and privacy-preserved autism diagnosis
Peter Washington, Qandeel Tariq, Emilie Leblanc, Brianna Chrisman, Kaitlyn Dunlap, Aaron Kline, Haik Kalantarian, Yordan Penev, Kelley Paskov, Catalin Voss, Nathaniel Stockham, Maya Varma, Arman Husic, Jack Kent, Nick Haber, Terry Winograd, Dennis P. Wall
medRxiv 2020.12.15.20248283; doi: https://doi.org/10.1101/2020.12.15.20248283
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Crowdsourced feature tagging for scalable and privacy-preserved autism diagnosis
Peter Washington, Qandeel Tariq, Emilie Leblanc, Brianna Chrisman, Kaitlyn Dunlap, Aaron Kline, Haik Kalantarian, Yordan Penev, Kelley Paskov, Catalin Voss, Nathaniel Stockham, Maya Varma, Arman Husic, Jack Kent, Nick Haber, Terry Winograd, Dennis P. Wall
medRxiv 2020.12.15.20248283; doi: https://doi.org/10.1101/2020.12.15.20248283

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (240)
  • Allergy and Immunology (521)
  • Anesthesia (125)
  • Cardiovascular Medicine (1422)
  • Dentistry and Oral Medicine (217)
  • Dermatology (158)
  • Emergency Medicine (291)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (584)
  • Epidemiology (10303)
  • Forensic Medicine (6)
  • Gastroenterology (527)
  • Genetic and Genomic Medicine (2632)
  • Geriatric Medicine (254)
  • Health Economics (497)
  • Health Informatics (1736)
  • Health Policy (790)
  • Health Systems and Quality Improvement (674)
  • Hematology (267)
  • HIV/AIDS (566)
  • Infectious Diseases (except HIV/AIDS) (12098)
  • Intensive Care and Critical Care Medicine (649)
  • Medical Education (273)
  • Medical Ethics (83)
  • Nephrology (289)
  • Neurology (2465)
  • Nursing (145)
  • Nutrition (378)
  • Obstetrics and Gynecology (493)
  • Occupational and Environmental Health (568)
  • Oncology (1324)
  • Ophthalmology (402)
  • Orthopedics (147)
  • Otolaryngology (237)
  • Pain Medicine (168)
  • Palliative Medicine (51)
  • Pathology (343)
  • Pediatrics (782)
  • Pharmacology and Therapeutics (330)
  • Primary Care Research (296)
  • Psychiatry and Clinical Psychology (2402)
  • Public and Global Health (5014)
  • Radiology and Imaging (894)
  • Rehabilitation Medicine and Physical Therapy (529)
  • Respiratory Medicine (682)
  • Rheumatology (309)
  • Sexual and Reproductive Health (256)
  • Sports Medicine (245)
  • Surgery (298)
  • Toxicology (45)
  • Transplantation (141)
  • Urology (108)