Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Performance Analysis of Speech Recognition Models in Automated Scoring of the QuickSIN Test

View ORCID ProfileArman Hassanpour, Yan Jiang, View ORCID ProfilePaula Folkeard, View ORCID ProfileEwan Macpherson, View ORCID ProfileSusan D. Scollie, View ORCID ProfileVijay Parsa
doi: https://doi.org/10.1101/2025.07.25.25332211
Arman Hassanpour
1National Centre for Audiology, Western University, London, Canada
2Health and Rehabilitation Program, Faculty of Health Sciences, Western University, London, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Arman Hassanpour
  • For correspondence: ahassanp{at}uwo.ca
Yan Jiang
1National Centre for Audiology, Western University, London, Canada
2Health and Rehabilitation Program, Faculty of Health Sciences, Western University, London, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Paula Folkeard
1National Centre for Audiology, Western University, London, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Paula Folkeard
Ewan Macpherson
1National Centre for Audiology, Western University, London, Canada
2Health and Rehabilitation Program, Faculty of Health Sciences, Western University, London, Canada
3School of Communication Sciences and Disorders, Western University, London, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ewan Macpherson
Susan D. Scollie
1National Centre for Audiology, Western University, London, Canada
2Health and Rehabilitation Program, Faculty of Health Sciences, Western University, London, Canada
3School of Communication Sciences and Disorders, Western University, London, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Susan D. Scollie
Vijay Parsa
1National Centre for Audiology, Western University, London, Canada
2Health and Rehabilitation Program, Faculty of Health Sciences, Western University, London, Canada
3School of Communication Sciences and Disorders, Western University, London, Canada
4Department of Electrical and Computer Engineering, Western University, London, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Vijay Parsa
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Purpose Best practices in audiology recommend assessing speech understanding in noisy environments, especially for those with communication difficulties. Speech-in-noise (SiN) assessments such as the QuickSIN are used for validating signal processing in hearing aids (HAs) and are linked to HA satisfaction. This project seeks to enhance QuickSIN test efficiency by applying recent advancements in automatic speech recognition (ASR) technologies.

Method Twenty-three adults with sensorineural hearing loss were fitted bilaterally with Unitron Moxi HAs and were administered the QuickSIN test in low and high reverberation environments. Testing was performed with two different HA programs: an omnidirectional program and a fixed directional microphone program. QuickSIN sentences were presented from 0° azimuth and competing babble from either 0°, laterally from 90° or 270°, or simultaneously from 90°, 180°, and 270° azimuths. Participants’ verbal responses to QuickSIN stimuli were scored by an audiologist and were recorded in parallel for offline transcription and scoring by ASR models from Amazon, Microsoft, NVIDIA, and Picovoice. The ASR-derived QuickSIN scores were compared to the corresponding audiologist-derived scores.

Results Repeated Measures ANOVA results revealed that all ASR models overestimated the QuickSIN scores across most test conditions. Bland-Altman analyses showed that the Amazon ASR model had the least bias and the narrowest range for the limits of agreement, in comparison to the manual scoring by an experienced audiologist.

Conclusions Some ASR models, such as Amazon, demonstrated performance comparable to that of an audiologist in automatically scoring QuickSIN tests. However, further refinements are necessary to increase the robustness of the ASR models in scoring low SNR loss test conditions.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This research was supported by the Ontario Research Fund Grant RE08-072 (PI: Dr. Susan Scollie) and the NSERC Discovery Grant to Dr. Vijay Parsa.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The Health Sciences Research Ethics Board (HSREB) of Western University gave ethical approval for this work. Approval was issued on April 23, 2024, for the study titled 'Speech in Noise Test Scoring Using Automatic Speech Recognition', Project ID 124196, Review Reference 2024-124196-91975.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data Availability

Data Availability Statement: All data produced and analyzed in the present study including QuickSIN scores, ASR transcriptions, and related statistical analyses are available upon reasonable request to the corresponding author. Due to ethical constraints associated with participant privacy and institutional research ethics board (REB) approval, raw audio recordings are not publicly shared but may be made available under appropriate data sharing agreements. Supplementary performance metrics and tables are included in the manuscript and supplementary material.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted July 25, 2025.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Performance Analysis of Speech Recognition Models in Automated Scoring of the QuickSIN Test
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Performance Analysis of Speech Recognition Models in Automated Scoring of the QuickSIN Test
Arman Hassanpour, Yan Jiang, Paula Folkeard, Ewan Macpherson, Susan D. Scollie, Vijay Parsa
medRxiv 2025.07.25.25332211; doi: https://doi.org/10.1101/2025.07.25.25332211
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Performance Analysis of Speech Recognition Models in Automated Scoring of the QuickSIN Test
Arman Hassanpour, Yan Jiang, Paula Folkeard, Ewan Macpherson, Susan D. Scollie, Vijay Parsa
medRxiv 2025.07.25.25332211; doi: https://doi.org/10.1101/2025.07.25.25332211

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Otolaryngology
Subject Areas
All Articles
  • Addiction Medicine (576)
  • Allergy and Immunology (867)
  • Anesthesia (306)
  • Cardiovascular Medicine (4480)
  • Dentistry and Oral Medicine (449)
  • Dermatology (385)
  • Emergency Medicine (614)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1528)
  • Epidemiology (15276)
  • Forensic Medicine (31)
  • Gastroenterology (1133)
  • Genetic and Genomic Medicine (6643)
  • Geriatric Medicine (671)
  • Health Economics (1006)
  • Health Informatics (4602)
  • Health Policy (1378)
  • Health Systems and Quality Improvement (1622)
  • Hematology (544)
  • HIV/AIDS (1275)
  • Infectious Diseases (except HIV/AIDS) (15959)
  • Intensive Care and Critical Care Medicine (1110)
  • Medical Education (626)
  • Medical Ethics (147)
  • Nephrology (674)
  • Neurology (6692)
  • Nursing (346)
  • Nutrition (1006)
  • Obstetrics and Gynecology (1152)
  • Occupational and Environmental Health (961)
  • Oncology (3369)
  • Ophthalmology (988)
  • Orthopedics (370)
  • Otolaryngology (421)
  • Pain Medicine (437)
  • Palliative Medicine (131)
  • Pathology (668)
  • Pediatrics (1703)
  • Pharmacology and Therapeutics (699)
  • Primary Care Research (717)
  • Psychiatry and Clinical Psychology (5494)
  • Public and Global Health (9284)
  • Radiology and Imaging (2223)
  • Rehabilitation Medicine and Physical Therapy (1375)
  • Respiratory Medicine (1201)
  • Rheumatology (598)
  • Sexual and Reproductive Health (720)
  • Sports Medicine (535)
  • Surgery (720)
  • Toxicology (100)
  • Transplantation (290)
  • Urology (266)