Health data analysis faces the challenge of missing data and multivariate outliers, making Multiple Imputation (MI) techniques and multivariate outlier detection and correction (MODC) essential to avoid discarding patients and to ensure data quality. This research proposes an integrated pipeline to predict functional outcomes from post-stroke rehabilitation, combining MI and MODC with cross-validation ofmachine learning (ML) algorithm, using predictors collected at admission. The analysis involved 220 patients undergoing intensive rehabilitation. With random forest, a median absolute error of 9.72 points on the estimated modified Barthel Index was achieved. Compared to single imputation and MI methods integrated to ML, MI + MODC improved prognostic performances by14.82% and 8.65%, respectively.

Multiple Imputation and Multivariate Outliers Detection and Correction Integrated to Machine Learning: A Prediction of Functional Outcome in Post-Stroke Survivors / Marra, E.; Finocchi, A.; Hakiki, B.; Macchi, C.; Grippo, A.; Cecchi, F.; Grilli, L.; Mannini, A.. - STAMPA. - 32:(2024), pp. 557-561. (Intervento presentato al convegno 6th International Conference on Neurorehabilitation (ICNR 2024) tenutosi a La Granja, Spain nel November 5–8, 2024) [10.1007/978-3-031-77584-0_109].

Multiple Imputation and Multivariate Outliers Detection and Correction Integrated to Machine Learning: A Prediction of Functional Outcome in Post-Stroke Survivors

Hakiki, B.;Cecchi, F.;Grilli, L.;
2024

Abstract

Health data analysis faces the challenge of missing data and multivariate outliers, making Multiple Imputation (MI) techniques and multivariate outlier detection and correction (MODC) essential to avoid discarding patients and to ensure data quality. This research proposes an integrated pipeline to predict functional outcomes from post-stroke rehabilitation, combining MI and MODC with cross-validation ofmachine learning (ML) algorithm, using predictors collected at admission. The analysis involved 220 patients undergoing intensive rehabilitation. With random forest, a median absolute error of 9.72 points on the estimated modified Barthel Index was achieved. Compared to single imputation and MI methods integrated to ML, MI + MODC improved prognostic performances by14.82% and 8.65%, respectively.
2024
Converging Clinical and Engineering Research on Neurorehabilitation V - Proceedings of the 6th International Conference on Neurorehabilitation (ICNR 2024), November 5–8, 2024, La Granja, Spain - Volume 2
6th International Conference on Neurorehabilitation (ICNR 2024)
La Granja, Spain
November 5–8, 2024
Goal 3: Good health and well-being
Marra, E.; Finocchi, A.; Hakiki, B.; Macchi, C.; Grippo, A.; Cecchi, F.; Grilli, L.; Mannini, A.
File in questo prodotto:
File Dimensione Formato  
978-3-031-77584-0_109.pdf

Accesso chiuso

Descrizione: contributo
Tipologia: Pdf editoriale (Version of record)
Licenza: Tutti i diritti riservati
Dimensione 214.24 kB
Formato Adobe PDF
214.24 kB Adobe PDF   Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1406252
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact