PT - JOURNAL ARTICLE AU - Aksenen, Cleber Furtado AU - Ferreira, Débora Maria Almeida AU - Jeronimo, Pedro Miguel Carneiro AU - Costa, Thais de Oliveira AU - de Souza, Ticiane Cavalcante AU - Lino, Bruna Maria Nepomuceno Sousa AU - de Farias, Allysson Allan AU - Miyajima, Fabio TI - Enhancing SARS-CoV-2 Lineage surveillance through the integration of a simple and direct qPCR-based protocol adaptation with established machine learning algorithms AID - 10.1101/2024.08.09.24310239 DP - 2024 Jan 01 TA - medRxiv PG - 2024.08.09.24310239 4099 - http://medrxiv.org/content/early/2024/08/10/2024.08.09.24310239.short 4100 - http://medrxiv.org/content/early/2024/08/10/2024.08.09.24310239.full AB - The emergence of the SARS-CoV-2 and continuous spread of its descendent lineages have posed unprecedented challenges to the global public healthcare system. Here we present an inclusive approach integrating genomic sequencing and qPCR-based protocols to increment monitoring of variant Omicron sublineages. Viral RNA samples were fast tracked for genomic surveillance following the detection of SARS-CoV-2 by diagnostic laboratories or public health network units in Ceara (Brazil) and analyzed using paired-end sequencing and integrative genomic analysis. Validation of a key structural variation was conducted with gel electrophoresis for the presence of a specific ORF7a deletion within the “BE.9” lineages. A simple intercalating dye-based qPCR assay protocol was tested and optimized through the repositioning primers from the ARTIC v.4.1 amplicon panel, which was able to distinguish between “BE.9” and “non-BE.9” lineages, particularly BQ.1. Three ML models were trained with the melting curve of the intercalating dye-based qPCR that enabled lineage assignment with elevated accuracy. Amongst them, the Support Vector Machine (SVM) model had the best performance and after fine-tuning showed ∼96.52% (333/345) accuracy in comparison to the test dataset. The integration of these methods may allow rapid assessment of emerging variants and increment molecular surveillance strategies, especially in resource-limited settings. Our approach not only provides a cost-effective alternative to complement traditional sequencing methods but also offers a scalable analytical solution for enhanced monitoring of SARS-CoV-2 variants for other laboratories through easy-to-train ML algorithms, thus contributing to global efforts in pandemic control.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study did not receive any fundingAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Ethical approval for the study was issued by the research ethics committee from "Centro de Hematologia e Hemoterapia do Ceara" (HEMOCE) part of the "Secretaria da Saude do Estado do Ceara" (SESA-CE), Brazil, under the reference number 5.290.730. As to SARS-CoV-2 molecular screening and pathogen sequencing, this was conducted as part of public health diagnostic assistance and laboratory surveillance of COVID-19 and related notifiable disease monitoring network, which our group is part of.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesAll data produced in the present work are contained in the manuscript