ASR Systems Under Acoustic Challenges: A Multilingual Study

Sergei Katkov; Antonio Liotta; Alessandro Vietti

doi:10.1007/978-3-031-80607-0_16

Back

ASR Systems Under Acoustic Challenges: A Multilingual Study

Conference proceeding

Peer reviewed

ASR Systems Under Acoustic Challenges: A Multilingual Study

Sergei Katkov, Antonio Liotta and Alessandro Vietti

AIxIA 2024 – Advances in Artificial Intelligence: XXIIIrd International Conference of the Italian Association for Artificial Intelligence, AIxIA 2024, Bolzano, Italy, November 25–28, 2024 Proceedings, Vol.15450, pp.200-213

Lecture Notes in Computer Science, 15450

23rd International Conference of the Italian Association for Artificial Intelligence (AIxIA) (Bolzano, 25/11/2024–28/11/2024)

2025

DOI: https://doi.org/10.1007/978-3-031-80607-0_16

Handle:

https://hdl.handle.net/10863/50284

Abstract

Data-driven Artificial Intelligence (D2AI)

Automatic speech recognition

The performance of automatic speech recognition (ASR) systems in acoustically challenging environments is crucial for the effectiveness of various voice-controlled applications. This study presents an extensive experimental evaluation of the robustness of different ASR models against a range of acoustic disturbances, including white noise, reverberation, time stretch, and pitch shift. By comparing the performance of these models in English, Italian, and German, this research provides a cross-linguistic perspective. The findings reveal a significant decline in performance across all models when subjected to these audio distortions, highlighting the varying degrees of resilience across different languages. By incorporating multiple languages, this study offers valuable insights into the unique challenges and potential opportunities for enhancing ASR technologies, addressing both well-researched and less-explored linguistic domains. Our comparative study highlights that although ASRs are reaching near-human accuracy in ideal acoustic conditions, ASR performance under the whole range of distortions is still well below human performance

Files and links (1)

url

https://doi.org/10.1007/978-3-031-80607-0_16View

Details

Title: ASR Systems Under Acoustic Challenges: A Multilingual Study
Creators: Sergei Katkov - Free University of Bozen-Bolzano
Antonio Liotta - Free University of Bozen-Bolzano
Alessandro Vietti - Free University of Bozen-Bolzano
Publication Details: AIxIA 2024 – Advances in Artificial Intelligence: XXIIIrd International Conference of the Italian Association for Artificial Intelligence, AIxIA 2024, Bolzano, Italy, November 25–28, 2024 Proceedings, Vol.15450, pp.200-213
Editor(s): Artale A, Cortellessa G, Montali M
ISBN: 9783031806063
EISBN: 9783031806070
ISSN: 0302-9743
EISSN: 1611-3349
Conference: 23rd International Conference of the Italian Association for Artificial Intelligence (AIxIA) (Bolzano, 25/11/2024–28/11/2024)
Series / Volume: Lecture Notes in Computer Science
15450
Publisher: Springer Science and Business Media Deutschland GmbH
Format: Print
Number of pages: 14
Identifiers: 978-3-031-80606-3
(UNIBZ)88830235
991007196542501241
Scopus ID: 2-s2.0-85215672598
Academic Unit: Faculty of Education
Faculty of Engineering
Language: English
Resource Type: Conference proceeding
Author Names String: Katkov S, Liotta A, Vietti A
Additional Description: Editors/Supervisors: Artale A, Cortellessa G, Montali M
unibz-area: Data-driven Artificial Intelligence (D2AI)
MIURSSD: Sistemi di elaborazione delle informazioni
MIURSSDCODE: ING-INF/05

Metrics

1 Record Views