Abstract
Background HPV-associated oropharyngeal cancer (HPV+OPSCC) is the most common HPV-associated cancer in the United States yet unlike cervical cancer lacks a screening test. HPV+OPSCCs are presumed to start developing 10-15 years prior to clinical diagnosis. Circulating tumor HPV DNA (ctHPVDNA) is a sensitive and specific biomarker for HPV+OPSCC. Taken together, blood-based screening for HPV+OPSCC may be feasible years prior to diagnosis.
Methods We developed an HPV whole genome sequencing assay, HPV-DeepSeek, with 99% sensitivity and specificity at clinical diagnosis. 28 plasma samples from HPV+OPSCC patients collected 1.3-10.8 years prior to diagnosis along with 1:1 age and gender-matched controls were run on HPV-DeepSeek and an HPV serology assay.
Results 22/28 (79%) of cases and 0/28 controls screened positive for HPV+OPSCC with 100% detection within four years of diagnosis and a maximum lead time of 7.8 years. We next applied a machine learning model classifying 27/28 cases (96%) with 100% detection within 10 years. Plasma-based PIK3CA gene mutations, viral genome integration events and HPV serology were used to orthogonally validate cancer detection with 68% (19/28) of the cohort having multiple cancer signals detected. Molecular fingerprinting of HPV genomes was performed across patients demonstrating that each viral genome was unique, ruling out contamination. In patients with tumor blocks from diagnosis (15/28), molecular fingerprinting was performed within patients confirming the same viral genome across time.
Conclusions We demonstrate accurate blood-based detection of HPV-associated cancers with lead times up to 10 years before clinical cancer diagnosis and in doing so, highlight the enormous potential of ctDNA-based cancer screening.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
Daniel Faden receives salary support from NIH K23, R03 and R21 grants
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Ethics committee/IRB of Dana Farber/ Harvard Cancer Center gave ethical approval for this work
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
The following sections were added to the manuscript. Acknowledgments Funding Source Funding for this work came from NIH/NIDCR R03DE030550 (Daniel L. Faden). Daniel L. Faden receives salary support from NIH/NIDCR K23 DE029811, NIH/NIDCR R03 DE030550 and NIH/NCI R21 CA267152. Declaration of interests Daniel L. Faden has received research funding or in-kind funding from Bristol-Myers Squibb, Calico, Predicine, BostonGene and Neogenomics. He has received consulting fees from Merck, Noetic, Chrysalis Biomedical Advisors, Arcadia and Focus. None of these sources relate to the work in this manuscritpt.
Data Availability
All data produced in the present work are contained in the manuscript