PT - JOURNAL ARTICLE AU - Bozcuk, Hakan Şat TI - Machine Learning Assisted Web Application for Identifying Beneficial Drug Candidates for Genetic Alterations in Cancer Patients AID - 10.1101/2024.06.03.24308392 DP - 2024 Jan 01 TA - medRxiv PG - 2024.06.03.24308392 4099 - http://medrxiv.org/content/early/2024/06/04/2024.06.03.24308392.short 4100 - http://medrxiv.org/content/early/2024/06/04/2024.06.03.24308392.full AB - Background Precision medicine in oncology relies heavily on molecular genetic data, primarily obtained from Next-generation sequencing (NGS) tests. However, the complexity of these data and the need to match genetic alterations with specific drug candidates pose significant challenges for clinicians. To simplify this process, a user-friendly web application has been developed. This app facilitates the matching, graphical presentation, and clustering of treatment options for specific genetic alterations, making it easier for clinicians to interpret and apply the results in patient care.Materials and Methods Utilizing the application programming interface (API) of the Drug-Gene Interaction Database (DGIdb 4.0), a web application was developed in Python to list drugs that interact with specific genetic changes. The application features a user-friendly display achieved through graphical representation and web scraping for gene-related information. Additionally, unsupervised machine learning, specifically K-means cluster analysis, was employed to categorize drug candidates based on their interaction scores with the genetic alteration in question. To enhance the interpretability of the results, the web app also provides key references and web links to the relevant drug interactions.Results The developed web application successfully filtered, listed, and displayed the gene interaction results. Utilizing an unsupervised machine learning algorithm, the app identified three optimal clusters of drug candidates based on their efficacy potentials using the Elbow Method. The cluster analysis demonstrated strong performance, evidenced by the following metrics for BRAF mutations: a Silhouette score of 0.74, a Davies-Bouldin index of 0.44, and a Calinski-Harabasz index of 475.96. Additionally, the web app effectively extracted and defined relevant gene information and identified key references for each genetic alteration within the cloud database.Conclusion The web application developed in this study provides a user-friendly platform for classifying and interpreting drug candidates based on the presence of specific genetic alterations in cancer patients. This tool is expected to enhance the accessibility and usability of genetic data, aiding clinicians in making informed treatment decisions.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study did not receive any fundingAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Source data was openly available to the public at the website address of “https://www.dgidb.org/”.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesAll data produced are available online at “https://www.dgidb.org/”. https://www.dgidb.org/