On incrementing interpretability of machine learning models from the foundations: a study on syllabic speech units

Vincenzo Norman Vitale; Loredana Schettino; Francesco Cutugno

Back

On incrementing interpretability of machine learning models from the foundations: a study on syllabic speech units

Conference proceeding

Open access

Peer reviewed

On incrementing interpretability of machine learning models from the foundations: a study on syllabic speech units

Vincenzo Norman Vitale, Loredana Schettino and Francesco Cutugno

Proceedings of the 9th Italian Conference on Computational Linguistics , pp.1-7

CEUR Workshop Proceedings

Ninth Italian Conference on Computational Linguistics - CLiC-it 2023 (Venezia, 30/11/2023–02/12/2023)

2023

Handle:

https://hdl.handle.net/10863/44838

Abstract

English. Modern ASR systems generally encode information by employing representations that favour performance indicators such as Word Error Rate (WER), making the interpretation of results and the diagnosis of any error extremely difficult if not impossible. In particular, within the context of end-to-end ASR systems, studies have been devoted to investigating the degrees of explainability of such systems by considering the use of different sets of linguistic features. This work explores the potential of different machine learning algorithms by considering features extracted from syllabic units of analysis and highlights that relying on syllabic Mel-Frequency Cepstral Coefficients increases the interpretability of complex techniques. In fact, the latter currently extract basic units in ways that are highly skewed toward operational convenience. The proposed method would reduce the need for computational resources both in training and in the inference phases, which results in economical and less time-consuming processes.

Files and links (3)

pdf

21_Vitale_et_al_CLIC2023Download View

Open Access

url

https://ceur-ws.org/Vol-3596/View

url

https://ceur-ws.org/Vol-3596/paper51.pdfView

Details

Title: On incrementing interpretability of machine learning models from the foundations: a study on syllabic speech units
Creators: Vincenzo Norman Vitale
Loredana Schettino
Francesco Cutugno
Publication Details: Proceedings of the 9th Italian Conference on Computational Linguistics , pp.1-7
Editor(s): Boschetti F, Lebani GE, Magnini B, Novielli N
ISBN: 9791255000846
ISSN: 1613-0073
Conference: Ninth Italian Conference on Computational Linguistics - CLiC-it 2023 (Venezia, 30/11/2023–02/12/2023)
Series / Volume: CEUR Workshop Proceedings
Publisher: Accademia University Press
Torino
Format: Online
Number of pages: 7
Identifiers: 9791255000846
(UNIBZ)71788078
991006939775401241
Scopus ID: 2-s2.0-85181166326
Copyright: Open Access
Academic Unit: Faculty of Education
Language: English
Resource Type: Conference proceeding
Author Names String: Vitale VN, Schettino L, Cutugno F
Additional Description: Editors/Supervisors: Boschetti F, Lebani GE, Magnini B, Novielli N

Metrics

4 File views/ downloads

1 Record Views