Abstract
The GCNMLP is implemented on three different datasets of side effects, namely, the SIDER, OFFSIDERS, and FAERS. Our results show that the performance of the GCNMLP on these three datasets is superior to the non-negative matrix factorization method (NMF) and some well-known machine learning methods with respect to various evaluation scales. Moreover, new side effects of drugs can be obtained using the GCNMLP.
Author summary The GCNMLP enables us to get better drug side effect prediction, which improves personalized medicine prescriptions.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
Yin-Tzer Shih was supported by the Ministry of Science and Technology of Taiwan through projects MOST 109-2115-M-005-003-MY2 and 110-2634-F-005 -006.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
We have added two datasets OFFSIDES and FDA FAERS to our experiments. Table 6 shows that the performance of the GCNMLP on these two datasets is superior to that of the original dataset SIDER we have used in our previous experiments.
Data Availability
The datasets in this study were downloaded from open resources, where the drug information was obtained from DrugBank Online (\url{https://go.drugbank.com/}), and the side effects information from three Adverse Drug Event (ADE) on three databases: the Side Effect Resource (SIDER) (\url{http://sideeffects.embl.de/}), OFFSIDES \cite{OFFSIDES} (\url{http://www.pharmgkb.org/downloads.jsp}), and the United States Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS) (\url{https://open.fda.gov/data/faers/}). The datasets of side effects (refer to github.com/yishingene/gcnmlp) contains four columns: `drugbank\_id' is the identification number of the database from the University of Alberta, `drugbank name' is the drug name, `umls cui from meddra' is the coded number of the Unified Medical Language System, and `side\_effect\_name' is the reported side effects.
https://github.com/timilsinamohan/sideeffects/tree/master/data