Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Identification of Key Influencers for Secondary Distribution of HIV Self-Testing among Chinese MSM: A Machine Learning Approach

View ORCID ProfileFengshi Jing, Yang Ye, Yi Zhou, Yuxin Ni, Xumeng Yan, Ying Lu, Jason J Ong, View ORCID ProfileJoseph D Tucker, Dan Wu, Yuan Xiong, Chen Xu, Xi He, Shanzi Huang, Xiaofeng Li, Hongbo Jiang, Cheng Wang, Wencan Dai, Liqun Huang, Wenhua Mei, View ORCID ProfileWeibin Cheng, View ORCID ProfileQingpeng Zhang, View ORCID ProfileWeiming Tang
doi: https://doi.org/10.1101/2021.04.19.21255584
Fengshi Jing
1Institute for Healthcare Artificial Intelligence, Guangdong Second Provincial General Hospital, Guangzhou, China
2University of North Carolina at Chapel Hill Project-China, Guangzhou, China
3School of Data Science, City University of Hong Kong, Hong Kong SAR, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Fengshi Jing
Yang Ye
3School of Data Science, City University of Hong Kong, Hong Kong SAR, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yi Zhou
4Zhuhai Center for Diseases Control and Prevention, Zhuhai, China
5Faculty of Medicine, Macau University of Science and Technology, Macau SAR, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yuxin Ni
2University of North Carolina at Chapel Hill Project-China, Guangzhou, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xumeng Yan
2University of North Carolina at Chapel Hill Project-China, Guangzhou, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ying Lu
2University of North Carolina at Chapel Hill Project-China, Guangzhou, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jason J Ong
6Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, UK
7Central Clinical School, Faculty of Medicine, Monash University, Melbourne, Australia
8Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Joseph D Tucker
2University of North Carolina at Chapel Hill Project-China, Guangzhou, China
6Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, UK
9Institute for Global Health and Infectious Diseases, School of Medicine, University of North Carolina at Chapel Hill, North Carolina, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Joseph D Tucker
Dan Wu
2University of North Carolina at Chapel Hill Project-China, Guangzhou, China
6Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yuan Xiong
2University of North Carolina at Chapel Hill Project-China, Guangzhou, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chen Xu
2University of North Carolina at Chapel Hill Project-China, Guangzhou, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xi He
10Zhuhai Xutong Voluntary Services Center, Zhuhai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shanzi Huang
4Zhuhai Center for Diseases Control and Prevention, Zhuhai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiaofeng Li
4Zhuhai Center for Diseases Control and Prevention, Zhuhai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hongbo Jiang
11Department of Epidemiology and Biostatistics, School of Public Health, Guangdong Pharmaceutical University, Guangzhou, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cheng Wang
12Dermatology Hospital of South Medical University, Guangzhou, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wencan Dai
4Zhuhai Center for Diseases Control and Prevention, Zhuhai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Liqun Huang
4Zhuhai Center for Diseases Control and Prevention, Zhuhai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wenhua Mei
4Zhuhai Center for Diseases Control and Prevention, Zhuhai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Weibin Cheng
1Institute for Healthcare Artificial Intelligence, Guangdong Second Provincial General Hospital, Guangzhou, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Weibin Cheng
Qingpeng Zhang
3School of Data Science, City University of Hong Kong, Hong Kong SAR, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Qingpeng Zhang
  • For correspondence: qingpeng.zhang@cityu.edu.hk weiming_tang@med.unc.edu
Weiming Tang
1Institute for Healthcare Artificial Intelligence, Guangdong Second Provincial General Hospital, Guangzhou, China
2University of North Carolina at Chapel Hill Project-China, Guangzhou, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Weiming Tang
  • For correspondence: qingpeng.zhang@cityu.edu.hk weiming_tang@med.unc.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background HIV self-testing (HIVST) has been rapidly scaled up and additional strategies further expand testing uptake. Secondary distribution has people (indexes) apply for multiple kits and pass these kits to people (alters) in their social networks. However, identifying key influencers is difficult. This study aimed to develop an innovative ensemble machine learning approach to identify key influencers among Chinese men who have sex with men (MSM) for HIVST secondary distribution.

Method We defined three types of key influencers: 1) key distributors who can distribute more kits; 2) key promoters who can contribute to finding first-time testing alters; 3) key detectors who can help to find positive alters. Four machine learning models (logistic regression, support vector machine, decision tree, random forest) were trained to identify key influencers. An ensemble learning algorithm was adopted to combine these four models. Simulation experiments were run to validate our approach.

Results 309 indexes distributed kits to 269 alters. Our approach outperformed human identification (self-reported scales cut-off), exceeding by an average accuracy of 11·0%, could distribute 18·2% (95%CI: 9·9%-26·5%) more kits, find 13·6% (95%CI: 1·9%-25·3%) more first-time testing alters and 12·0% (95%CI: -14·7%-38·7%) more positive-testing alters. Our approach could also increase simulated intervention efficiency by 17·7% (95%CI: -3·5%-38·8%) than human identification.

Conclusion We built machine learning models to identify key influencers among Chinese MSM who were more likely to engage in HIVST secondary distribution.

Key Findings (can also be found in Figure.2-Infographic) Our proposed ensemble machine learning approach outperformed human identification (self-reported scales cut-off) in accuracy & F1 by classification metrics and in intervention efficiency by simulation experiments. Our model could also distribute more kits, find more first-time/positive-testing alters than human identification.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This study is funded by National Natural Science Foundation of China [NSFC 81903371], U.S. National Institutes of Health [NIAID K24AI143471], UNC Center for AIDS Research [NIAID 5P30AI050410], and SESH Global projects.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Our dataset is from a survey and our ethical review of biomedical research has been obtained from the Ethics Committee of Zhuhai Center for Disease Control and Prevention prior to study enrollment (Number: ZhuhaiCDC-201901). For the survey data collection, all participants have be provided online consents and sign it electronically prior to taking part in our studies.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

Training set data for machine learning modeling will be made available to others after obtaining the relevant data sharing agreement and finishing the future quasi-experiment.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted April 20, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Identification of Key Influencers for Secondary Distribution of HIV Self-Testing among Chinese MSM: A Machine Learning Approach
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Identification of Key Influencers for Secondary Distribution of HIV Self-Testing among Chinese MSM: A Machine Learning Approach
Fengshi Jing, Yang Ye, Yi Zhou, Yuxin Ni, Xumeng Yan, Ying Lu, Jason J Ong, Joseph D Tucker, Dan Wu, Yuan Xiong, Chen Xu, Xi He, Shanzi Huang, Xiaofeng Li, Hongbo Jiang, Cheng Wang, Wencan Dai, Liqun Huang, Wenhua Mei, Weibin Cheng, Qingpeng Zhang, Weiming Tang
medRxiv 2021.04.19.21255584; doi: https://doi.org/10.1101/2021.04.19.21255584
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Identification of Key Influencers for Secondary Distribution of HIV Self-Testing among Chinese MSM: A Machine Learning Approach
Fengshi Jing, Yang Ye, Yi Zhou, Yuxin Ni, Xumeng Yan, Ying Lu, Jason J Ong, Joseph D Tucker, Dan Wu, Yuan Xiong, Chen Xu, Xi He, Shanzi Huang, Xiaofeng Li, Hongbo Jiang, Cheng Wang, Wencan Dai, Liqun Huang, Wenhua Mei, Weibin Cheng, Qingpeng Zhang, Weiming Tang
medRxiv 2021.04.19.21255584; doi: https://doi.org/10.1101/2021.04.19.21255584

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • HIV/AIDS
Subject Areas
All Articles
  • Addiction Medicine (215)
  • Allergy and Immunology (495)
  • Anesthesia (106)
  • Cardiovascular Medicine (1093)
  • Dentistry and Oral Medicine (195)
  • Dermatology (141)
  • Emergency Medicine (274)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (499)
  • Epidemiology (9757)
  • Forensic Medicine (5)
  • Gastroenterology (480)
  • Genetic and Genomic Medicine (2303)
  • Geriatric Medicine (222)
  • Health Economics (462)
  • Health Informatics (1553)
  • Health Policy (732)
  • Health Systems and Quality Improvement (602)
  • Hematology (236)
  • HIV/AIDS (501)
  • Infectious Diseases (except HIV/AIDS) (11631)
  • Intensive Care and Critical Care Medicine (616)
  • Medical Education (236)
  • Medical Ethics (67)
  • Nephrology (256)
  • Neurology (2139)
  • Nursing (134)
  • Nutrition (335)
  • Obstetrics and Gynecology (426)
  • Occupational and Environmental Health (517)
  • Oncology (1172)
  • Ophthalmology (363)
  • Orthopedics (128)
  • Otolaryngology (220)
  • Pain Medicine (145)
  • Palliative Medicine (50)
  • Pathology (309)
  • Pediatrics (694)
  • Pharmacology and Therapeutics (298)
  • Primary Care Research (265)
  • Psychiatry and Clinical Psychology (2172)
  • Public and Global Health (4645)
  • Radiology and Imaging (775)
  • Rehabilitation Medicine and Physical Therapy (455)
  • Respiratory Medicine (623)
  • Rheumatology (274)
  • Sexual and Reproductive Health (225)
  • Sports Medicine (208)
  • Surgery (250)
  • Toxicology (43)
  • Transplantation (120)
  • Urology (94)