Abstract
In pharmaceutical research, assessing drug candidates’ odds of success as they move through clinical research often relies on crude methods based on historical data. However, the rapid progress of machine learning offers a new tool to identify the more promising projects. To evaluate its usefulness, we trained and validated several machine learning algorithms on a large database of projects. Using various project descriptors as input data we were able to predict the clinical success and failure rates of projects with an average balanced accuracy of 83% to 89%, which compares favorably with the 56% to 70% balanced accuracy of the method based on historical data. We also identified the variables that contributed most to trial success and used the algorithm to predict the success (or failure) of assets currently in the industry pipeline. We conclude by discussing how pharmaceutical companies can use such model to improve the quantity and quality of their new drugs, and how the broad adoption of this technology could reduce the industry’s risk profile with important consequences for industry structure, R&D investment, and the cost of innovation
Competing Interest Statement
We thank Evaluate Ltd. for giving us access to the EvaluatePharma data used in this study and for their valuable feedback. The views expressed in this work are those of the authors. Prof. Riccaboni and Mr. Munos have been members of the Evaluate Ltd. "Forecasting Advisory Board" 2018-2019.
Funding Statement
No external funding has been received
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
No human specific study
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
bernard.munos{at}gmail.com, janniederreiter{at}yahoo.de, massimo.riccaboni{at}imtlucca.it
We thank Evaluate Ltd. for giving us access to the EvaluatePharma® data used in this study and for their valuable feedback. The views expressed in this work are those of the authors. Prof. Riccaboni and Mr. Munos have been members of the Evaluate Ltd. “Forecasting Advisory Board” 2018-2019.
Data Availability
Data is proprietary and cannot be shared publicly