Features in Pathological Voice Distortions

Sergei Katkov; Mohammad Shirdel; Antonio Liotta; Alessandro Vietti; E Sciuto; M D'amico; M Raciti; V Saita

doi:10.1109/ICHMS65439.2025.11154163

Back

Features in Pathological Voice Distortions

Conference proceeding

Peer reviewed

Features in Pathological Voice Distortions

Sergei Katkov, Mohammad Shirdel, Antonio Liotta, Alessandro Vietti, E Sciuto, M D'amico, M Raciti and V Saita

2025 IEEE 5th International Conference on Human-Machine Systems (ICHMS), pp.319-324

IEEE International Conference on Human-Machine Systems (Abu Dhabi, 26/05/2025–28/05/2025)

2025

DOI: https://doi.org/10.1109/ICHMS65439.2025.11154163

Handle:

https://hdl.handle.net/10863/50286

Abstract

Data-driven Artificial Intelligence (D2AI)

Feature importance

Pathological voice classification

environmental noise

A pathological voice is one that exhibits abnormal quality due to a dysfunction or disease of the vocal mechanism. Pathological voice classification systems enable automated detection of voice disorders and play an increasingly important role in Human-Machine Interaction (HMI) technologies. By classifying speaker attributes, including voice pathologies, these systems enhance the accessibility and adaptability of assistive devices, telemedicine platforms, and voice-controlled interfaces. However, environmental noise can significantly impact the performance of such systems, particularly in real-world settings where recording conditions are less controlled. This study investigates how varying levels of white noise affect pathological voice classification, using a dataset of real clinical recordings collected in collaboration with a hospital, as the benchmark. We analyze changes in classification accuracy, feature importance rankings, and inter-feature correlations as noise levels increase. Our findings reveal that although noise influences feature rankings, especially at noise levels above 50 dB, most of the top 10 features remain stable across different noise levels, underscoring their robustness. We also observe that training on clean data and testing on noisy data up to moderate noise levels yields similar performance to training and testing on data with the same noise levels, a finding that is crucial for real-world applicability. Furthermore, we find that adding noise increases the correlation among features, which may contribute to decreased classification performance by potentially confounding the model. These insights, derived from a dataset created with the support of a hospital and authentic pathological voice recordings, highlight the importance of considering environmental noise in developing robust HMI systems and offer guidance for feature selection and system optimization in noisy conditions.

Files and links (1)

url

https://doi.org/10.1109/ICHMS65439.2025.11154163View

Details

Title: Features in Pathological Voice Distortions
Creators: Sergei Katkov - Free University of Bozen-Bolzano
Mohammad Shirdel - Free University of Bozen-Bolzano
Antonio Liotta - Free University of Bozen-Bolzano
Alessandro Vietti - Free University of Bozen-Bolzano
E Sciuto - Ospedale Cannizzaro
M D'amico
M Raciti - Ospedale Cannizzaro
V Saita - Ospedale Cannizzaro
Publication Details: 2025 IEEE 5th International Conference on Human-Machine Systems (ICHMS), pp.319-324
ISBN: 9798331521653
EISBN: 9798331521646
Conference: IEEE International Conference on Human-Machine Systems (Abu Dhabi, 26/05/2025–28/05/2025)
Publisher: Institute of Electrical and Electronics Engineers Inc.
Format: Online
Number of pages: 6
Identifiers: 979-8-3315-2165-3
(UNIBZ)92034431
991007196440701241
Scopus ID: 2-s2.0-105017661351
Academic Unit: Faculty of Education
Faculty of Engineering
Language: English
Resource Type: Conference proceeding
Author Names String: Katkov S, Shirdel M, Liotta A, Vietti A, Sciuto E, D'amico M, Raciti M, Saita V
Additional Description: unibz-area: Data-driven Artificial Intelligence (D2AI)
MIURSSD: Sistemi di elaborazione delle informazioni
MIURSSDCODE: ING-INF/05

Metrics

1 Record Views