Health data analysis faces the challenge of missing data and multivariate outliers, making Multiple Imputation (MI) techniques and multivariate outlier detection and correction (MODC) essential to avoid discarding patients and to ensure data quality. This research proposes an integrated pipeline to predict functional outcomes from post-stroke rehabilitation, combining MI and MODC with cross-validation ofmachine learning (ML) algorithm, using predictors collected at admission. The analysis involved 220 patients undergoing intensive rehabilitation. With random forest, a median absolute error of 9.72 points on the estimated modified Barthel Index was achieved. Compared to single imputation and MI methods integrated to ML, MI + MODC improved prognostic performances by14.82% and 8.65%, respectively.
Multiple Imputation and Multivariate Outliers Detection and Correction Integrated to Machine Learning: A Prediction of Functional Outcome in Post-Stroke Survivors / Marra, E.; Finocchi, A.; Hakiki, B.; Macchi, C.; Grippo, A.; Cecchi, F.; Grilli, L.; Mannini, A.. - STAMPA. - 32:(2024), pp. 557-561. (Intervento presentato al convegno 6th International Conference on Neurorehabilitation (ICNR 2024) tenutosi a La Granja, Spain nel November 5–8, 2024) [10.1007/978-3-031-77584-0_109].
Multiple Imputation and Multivariate Outliers Detection and Correction Integrated to Machine Learning: A Prediction of Functional Outcome in Post-Stroke Survivors
Hakiki, B.;Cecchi, F.;Grilli, L.;
2024
Abstract
Health data analysis faces the challenge of missing data and multivariate outliers, making Multiple Imputation (MI) techniques and multivariate outlier detection and correction (MODC) essential to avoid discarding patients and to ensure data quality. This research proposes an integrated pipeline to predict functional outcomes from post-stroke rehabilitation, combining MI and MODC with cross-validation ofmachine learning (ML) algorithm, using predictors collected at admission. The analysis involved 220 patients undergoing intensive rehabilitation. With random forest, a median absolute error of 9.72 points on the estimated modified Barthel Index was achieved. Compared to single imputation and MI methods integrated to ML, MI + MODC improved prognostic performances by14.82% and 8.65%, respectively.File | Dimensione | Formato | |
---|---|---|---|
978-3-031-77584-0_109.pdf
Accesso chiuso
Descrizione: contributo
Tipologia:
Pdf editoriale (Version of record)
Licenza:
Tutti i diritti riservati
Dimensione
214.24 kB
Formato
Adobe PDF
|
214.24 kB | Adobe PDF | Richiedi una copia |
I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.