AnFiSA: An open-source computational platform for the analysis of sequencing data for rare genetic disease

J Biomed Inform. 2022 Sep:133:104174. doi: 10.1016/j.jbi.2022.104174. Epub 2022 Aug 23.

Abstract

Despite genomic sequencing rapidly transforming from being a bench-side tool to a routine procedure in a hospital, there is a noticeable lack of genomic analysis software that supports both clinical and research workflows as well as crowdsourcing. Furthermore, most existing software packages are not forward-compatible in regards to supporting ever-changing diagnostic rules adopted by the genetics community. Regular updates of genomics databases pose challenges for reproducible and traceable automated genetic diagnostics tools. Lastly, most of the software tools score low on explainability amongst clinicians. We have created a fully open-source variant curation tool, AnFiSA, with the intention to invite and accept contributions from clinicians, researchers, and professional software developers. The design of AnFiSA addresses the aforementioned issues via the following architectural principles: using a multidimensional database management system (DBMS) for genomic data to address reproducibility, curated decision trees adaptable to changing clinical rules, and a crowdsourcing-friendly interface to address difficult-to-diagnose cases. We discuss how we have chosen our technology stack and describe the design and implementation of the software. Finally, we show in detail how selected workflows can be implemented using the current version of AnFiSA by a medical geneticist.

Keywords: Clinical genomics; Diagnostic clinical genomics; Explainable models; Genome annotation; Genome filtering; Genome sequencing; OLAP.

MeSH terms

  • Computational Biology / methods
  • Database Management Systems
  • Databases, Genetic
  • Genomics* / methods
  • Reproducibility of Results
  • Software*
  • Workflow