PT - JOURNAL ARTICLE AU - Joana M. Xavier AU - Ramiro Magno AU - Roslin Russell AU - Bernardo P. de Almeida AU - Ana Jacinta-Fernandes AU - André Duarte AU - Mark Dunning AU - Shamith Samarajiwa AU - Martin O’Reilly AU - Cátia L. Rocha AU - Nordiana Rosli AU - Bruce A. J. Ponder AU - Ana Teresa Maia TI - Mapping of <em>cis</em>-regulatory variants by differential allelic expression analysis identifies candidate risk variants and target genes of 27 breast cancer risk loci AID - 10.1101/2022.03.08.22271889 DP - 2022 Jan 01 TA - medRxiv PG - 2022.03.08.22271889 4099 - http://medrxiv.org/content/early/2022/03/10/2022.03.08.22271889.short 4100 - http://medrxiv.org/content/early/2022/03/10/2022.03.08.22271889.full AB - Background Breast cancer (BC) genome-wide association studies (GWAS) have identified hundreds of risk-loci that require novel approaches to reveal the causal variants and target genes within them. As causal variants are most likely regulators of gene expression, we hypothesize that their identification is facilitated by pinpointing the variants with greater regulatory potential within risk-loci.Methods We performed genome-wide differential allelic expression (DAE) analysis using microarrays data from 64 normal breast tissue samples. Then, we mapped the variants associated with DAE (daeQTLs) and intersected these with GWAS data to reveal candidate risk regulatory variants. Finally, we validated our approach by functionally analysing the 5q14.1 breast cancer risk-locus.Results We found widespread gene expression regulation by cis-acting variants in breast tissue, with 80% of coding and non-coding expressed genes displaying DAE (daeGenes). We identified over 23K daeQTLs for 2753 (16%) daeGenes, including at 154 known BC risk- loci. And in 31 of these risk-loci, we found risk-associated variant(s) and daeQTLs in strong linkage disequilibrium suggesting that the risk-causing variants are cis-regulatory, and in 27 risk-loci we propose 37 candidate target genes. As validation, we identified five candidate causal variants at the 5q14.1 risk-locus targeting the ATG10, RPS23, and ATP6AP1L genes, likely via modulation of miRNA binding, alternative transcription, and transcription factor binding.Conclusion Our study shows the power of DAE analysis and daeQTL mapping to identify causal regulatory variants and target genes at BC risk loci, including those with complex regulatory landscapes, and provides a genome-wide resource of variants associated with DAE for future functional studies.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was supported by national Portuguese funding through FCT - Fundação para a Ciência e a Tecnologia and CRESC ALGARVE 2020, institutional support CBMR-UID/BIM/04773/2013, POCI-01-0145-FEDER-022184 "GenomePT", the contract DL 57/2016/CP1361/CT0042 (J.M.X.) and individual postdoctoral fellowship SFRH/BPD/99502/2014 (J.M.X.). The research leading to these results has received funding from the People Programme (Marie Curie Actions) of the European Union's Seventh Framework Programme FP7/2007-2013/under REA grant agreement no. 303745 (A.T.M.), a Maratona da Saúde Award (A.T.M.) and a BCRF project Grant.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Seventy-six samples of normal breast tissue were collected from women submitted to a reduction mastectomy, for reasons unrelated to cancer, at Addenbrooke's Hospital, Cambridge, United Kingdom. Samples were collected with approval from Addenbrooke's Hospital Local Research Ethics Committee (REC reference 06/Q0108/221).I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll data produced are available online at https://github.com/maialab/DaeBreastMicroarrays. https://github.com/maialab/DaeBreastMicroarrays