This work focuses on the automatic identification of action concepts, performed through machine learning algorithms and applied to a linguistic dataset derived from the IMAGACT ontology of actions. IMAGACT contains a set of 1,010 actions, represented by video scenes and enriched with linguistic data in several languages. In particular, each scene is linked to the complete set of verbs that can refer to the depicted action in every considered language. Starting from these data, automatic clustering of scenes has been performed using the linked lexical items as a feature set, following the idea that a similar group of verbs can refer to similar actions. Hierarchical agglomerative clustering has been performed to set up an evaluation campaign and create a gold standard of validated clusters. Then, a semi-supervised method based on Affinity Propagation was trained on these data. An evaluation of cluster coherence has been performed, reporting promising results. An interactive web version of the action map has also been created, to allow users to browse the clusters of videos.

Towards a Crosslinguistic Identification of Action Concepts. Automatic Clustering of Video Scenes Based on the IMAGACT Multilingual Ontology / Lorenzo Gregori; Massimo Moneglia; Alessandro Panunzi. - ELETTRONICO. - (2022), pp. 1-9. (Intervento presentato al convegno AREA II workshop. Annotation, Recognition and Evaluation of Action).

Towards a Crosslinguistic Identification of Action Concepts. Automatic Clustering of Video Scenes Based on the IMAGACT Multilingual Ontology

Lorenzo Gregori;Massimo Moneglia;Alessandro Panunzi
2022

Abstract

This work focuses on the automatic identification of action concepts, performed through machine learning algorithms and applied to a linguistic dataset derived from the IMAGACT ontology of actions. IMAGACT contains a set of 1,010 actions, represented by video scenes and enriched with linguistic data in several languages. In particular, each scene is linked to the complete set of verbs that can refer to the depicted action in every considered language. Starting from these data, automatic clustering of scenes has been performed using the linked lexical items as a feature set, following the idea that a similar group of verbs can refer to similar actions. Hierarchical agglomerative clustering has been performed to set up an evaluation campaign and create a gold standard of validated clusters. Then, a semi-supervised method based on Affinity Propagation was trained on these data. An evaluation of cluster coherence has been performed, reporting promising results. An interactive web version of the action map has also been created, to allow users to browse the clusters of videos.
2022
AREA II workshop. Annotation, Recognition and Evaluation of Action
AREA II workshop. Annotation, Recognition and Evaluation of Action
Lorenzo Gregori; Massimo Moneglia; Alessandro Panunzi
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1294321
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact