%0 Journal Article %A Havell Markus %A Dineika Chandrananda %A Elizabeth Moore %A Florent Mouliere %A James Morris %A James Brenton %A Christopher G. Smith %A Nitzan Rosenfeld %T Refined characterization of circulating tumor DNA through biological feature integration %D 2021 %R 10.1101/2021.08.11.21261907 %J medRxiv %P 2021.08.11.21261907 %X Circulating tumor DNA (ctDNA) in blood plasma is present at very low concentrations compared to cell-free DNA (cfDNA) of non-tumor origin. To enhance ctDNA detection, recent studies have been focused on understanding the non-random fragmentation pattern of cfDNA. These studies have investigated fragment sizes, genomic position of fragment end points, and fragment end motifs. Although these features have been described and shown to be aberrant in cancer patients, there is a lack of understanding of how the individual and integrated analysis of these features enrich ctDNA fraction and enhance ctDNA detection.Using whole genome sequencing and copy number analysis of plasma samples from 5 high grade serious ovarian cancer patients, we observed that 1) ctDNA is enriched not only in fragments shorter than mono-nucleosomes (∼167bp), but also in those shorter than di-nucleosomes (∼240-330bp) (28-159% enrichment). 2) fragments that start and end at the border or within the nucleosome core are enriched in ctDNA (5-46% enrichment). 3) certain DNA motifs conserved in regions 10bp up- and down-stream of fragment ends (i.e. cleavage sites) could be used to detect tumor-derived fragments (10-44% enrichment). We further show that the integrated analysis of these three features resulted in a higher enrichment of ctDNA when compared to using fragment size alone (additional 7-25% enrichment after fragment size selection).We believe these genome wide features, which are independent of genetic mutational changes, could allow new ways to analyze and interpret cfDNA data, as significant aberrations of these features from a healthy state could improve its utility as a diagnostic biomarker.Significance In recent years circulating tumor DNA (ctDNA) has received much attention, and investment, as a biomarker that could transform the clinical care of cancer patients. Despite this, there is much that is not known about this biomarker. Recently, it has been demonstrated that the biological properties of ctDNA can be leveraged to improve ctDNA based assays. Here we build on this by carrying out an in-depth analysis of three genome wide fragmentation patterns of cell-free DNA; specifically fragment size, positioning of fragment end points with respect to nucleosome occupancy, and fragment end motifs. Whilst previous studies have described these features in an individual manner and used them as point statistics for comparison between healthy individuals and cancer patients, our study is the first to show that the individual and integrative analysis of these features can be used to enrich for ctDNA and enable enhanced ctDNA detection. The features described are independent of specific genomic alterations, with signal integrated across the breadth of the genome. This allows informative analysis that leverages a larger proportion of sequencing reads, further building the case for the use of cost-effective approaches like shallow whole genome sequencing for cancer diagnostics.Competing Interest StatementN.R. and J.B. are co-founders, present/former officers or consultants and/or shareholders of Inivata Ltd, a cancer genomics company that commercialises circulating DNA analysis. C.G.S. has consulted for Inivata Ltd. Inivata had no role in the conception, design, data collection and analysis of the study. Other co-authors have no conflict of interests.Funding StatementThis study was supported by grants from CRUK Cambridge Institute (Core Grant, A29580) and by funding from the ERC under the European Union's Seventh Framework Programme (FP/2007-2013)/ERC Grant Agreement n. 337905.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The plasma samples from cancer patients were collected at Addenbrooke Hospital, Cambridge, UK, approved by the local research ethics committee (REC reference number: 07/Q0106/63; and National Research Ethics Service Committee East of England Cambridge Central 03/018).All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAnonymized data created for the study are or will be available in a persistent repository upon publication. %U https://www.medrxiv.org/content/medrxiv/early/2021/08/12/2021.08.11.21261907.1.full.pdf