The voice pathology identification task has recently gained great attention. However, several research questions remain open. This study proposes an explainable AI framework to address the implicit role of age in voice pathology recognition and to investigate vocal quality improvement after surgical treatment in organic voice disorders. The aim is also to define an optimal features subset through predictor importance analysis. A set of 287 patients diagnosed with benign lesions of vocal folds (BLVF) and unilateral vocal fold paralysis (UVFP) was enrolled. Classification experiments were performed for female (F) and male (M) groups: they aimed at distinguishing BLVF from UVFP in age-unbalanced (E1) and age-balanced (E2) datasets, differentiating BLVF subclasses (E3), and detecting pre- and post-treatment conditions (E4). The comparison between E1 and E2 suggests that age does not influence the classification performance. In E1, 76% (F) and 81% (M) accuracies were obtained. The best features concerned vocal fold dynamics and articulator positioning for F and M datasets. In E3, an accuracy of 60% was achieved, suggesting that larger datasets are required. In E4, the best models showed 76% (F) and 72% (M) accuracy, with a good sensitivity in detecting pre-treatment patients. The error rate analysis proved that UVFP was the most misclassified group. Moreover, an agreement between the AI outcome and perceptual evaluations was detected for misclassified recordings. These results suggest their clinical relevance to highlight key aspects of voice quality recovery and to define acoustic parameters that otolaryngologists could employ to monitor the patient's follow-up.

Towards an explainable Artificial intelligence system for voice pathology identification and post-treatment characterisation / Calà, Federico; Frassineti, Lorenzo; Cantarella, Giovanna; Buccichini, Giulia; Battilocchi, Ludovica; Manfredi, Claudia; Lanata', Antonio. - In: BIOMEDICAL SIGNAL PROCESSING AND CONTROL. - ISSN 1746-8094. - ELETTRONICO. - 104:(2025), pp. 107530.0-107530.0. [10.1016/j.bspc.2025.107530]

Towards an explainable Artificial intelligence system for voice pathology identification and post-treatment characterisation

Calà, Federico
;
Frassineti, Lorenzo;Buccichini, Giulia;Manfredi, Claudia;Lanata', Antonio
2025

Abstract

The voice pathology identification task has recently gained great attention. However, several research questions remain open. This study proposes an explainable AI framework to address the implicit role of age in voice pathology recognition and to investigate vocal quality improvement after surgical treatment in organic voice disorders. The aim is also to define an optimal features subset through predictor importance analysis. A set of 287 patients diagnosed with benign lesions of vocal folds (BLVF) and unilateral vocal fold paralysis (UVFP) was enrolled. Classification experiments were performed for female (F) and male (M) groups: they aimed at distinguishing BLVF from UVFP in age-unbalanced (E1) and age-balanced (E2) datasets, differentiating BLVF subclasses (E3), and detecting pre- and post-treatment conditions (E4). The comparison between E1 and E2 suggests that age does not influence the classification performance. In E1, 76% (F) and 81% (M) accuracies were obtained. The best features concerned vocal fold dynamics and articulator positioning for F and M datasets. In E3, an accuracy of 60% was achieved, suggesting that larger datasets are required. In E4, the best models showed 76% (F) and 72% (M) accuracy, with a good sensitivity in detecting pre-treatment patients. The error rate analysis proved that UVFP was the most misclassified group. Moreover, an agreement between the AI outcome and perceptual evaluations was detected for misclassified recordings. These results suggest their clinical relevance to highlight key aspects of voice quality recovery and to define acoustic parameters that otolaryngologists could employ to monitor the patient's follow-up.
2025
104
0
0
Goal 3: Good health and well-being
Calà, Federico; Frassineti, Lorenzo; Cantarella, Giovanna; Buccichini, Giulia; Battilocchi, Ludovica; Manfredi, Claudia; Lanata', Antonio...espandi
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S1746809425000412-main.pdf

accesso aperto

Tipologia: Pdf editoriale (Version of record)
Licenza: Open Access
Dimensione 1.03 MB
Formato Adobe PDF
1.03 MB Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1434094
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 4
social impact