This paper describes an experiment that compares the performance of a Conditional Random Fields model on identification of Multiword expressions in corpora of spoken and written Italian. The model is trained on a corpus of spoken language and a corpus of written language annotated with Multiword expressions, then tested on two other corpora (one written and one spoken). This methodology provides very good results regarding Precision.
Identification of Multiword Expressions: comparing the performance of a Conditional Random Fields model on corpora of written and spoken Italian / Manfredi, I., Gregori, L.. - ELETTRONICO. - (2023), pp. 0-0.
Identification of Multiword Expressions: comparing the performance of a Conditional Random Fields model on corpora of written and spoken Italian
Manfredi I.;Gregori L.
2023
Abstract
This paper describes an experiment that compares the performance of a Conditional Random Fields model on identification of Multiword expressions in corpora of spoken and written Italian. The model is trained on a corpus of spoken language and a corpus of written language annotated with Multiword expressions, then tested on two other corpora (one written and one spoken). This methodology provides very good results regarding Precision.I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.