IMAGACT is a cross-linguistic ontology of action, in which action concepts are represented trough Prototypes (3D animations or brief films). The interface IMAGACT4ALL allows mother tongue informants to assign verbs of their language to each prototype and has been used to implement languages belonging to different families. The Ontology specifies the range of different actions which may fall in the extension of each action verb and the set of verbs which can identify each entry, ensuring an adequate translation to action verbs, which show high ambiguity and cross-linguistic semantic variability. A large initiative for the implementation of various Indian languages (Hindi, Urdu, Sanskrit, Bengali, Odia, Assamese, Magahi, Manipuri, Tamil) was undertaken. The paper sketches the status of the work, whose main achievement is the full implementation of Hindi/Urdu and focus on “taking events”, that are very relevant in ordinary communication, but feature strong differences in lexical encoding cross-languages. Hindi requires 7 different verbs to cover the actions extended by the general verb take. The main translator लेना(lenA) is also a general verb, but its application has specific semantic boundaries. The paper specifies how features are induced from prototypes, exploiting IMAGACT for the semantic interpretation of Hindi verbs.

“Taking Events” in Hindi. A Case Study from the Annotation of Indian Languages in IMAGACT / Massimo Moneglia, Alessandro Panunzi, Lorenzo Gregori. - ELETTRONICO. - (2018), pp. 46-51.

“Taking Events” in Hindi. A Case Study from the Annotation of Indian Languages in IMAGACT

Massimo Moneglia;Alessandro Panunzi;Lorenzo Gregori
2018

Abstract

IMAGACT is a cross-linguistic ontology of action, in which action concepts are represented trough Prototypes (3D animations or brief films). The interface IMAGACT4ALL allows mother tongue informants to assign verbs of their language to each prototype and has been used to implement languages belonging to different families. The Ontology specifies the range of different actions which may fall in the extension of each action verb and the set of verbs which can identify each entry, ensuring an adequate translation to action verbs, which show high ambiguity and cross-linguistic semantic variability. A large initiative for the implementation of various Indian languages (Hindi, Urdu, Sanskrit, Bengali, Odia, Assamese, Magahi, Manipuri, Tamil) was undertaken. The paper sketches the status of the work, whose main achievement is the full implementation of Hindi/Urdu and focus on “taking events”, that are very relevant in ordinary communication, but feature strong differences in lexical encoding cross-languages. Hindi requires 7 different verbs to cover the actions extended by the general verb take. The main translator लेना(lenA) is also a general verb, but its application has specific semantic boundaries. The paper specifies how features are induced from prototypes, exploiting IMAGACT for the semantic interpretation of Hindi verbs.
2018
979-10-95546-09-2
Proceedings of the LREC 2018 Workshop “WILDRE4 – 4th Workshop on Indian Language Data: Resources and Evaluation”
46
51
Massimo Moneglia, Alessandro Panunzi, Lorenzo Gregori
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1146564
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact