This paper briefly introduces the Language into Act Theory (L-AcT), that proposes a pragmatic framework for the corpus-based collection and analysis of spontaneous speech. The L-AcT methodology takes the utterance (i.e. the counterpart of a speech act) as the reference unit for analysis. A set of large-scale Romance corpora has been collected in accordance with the L-AcT methodology (LABLITA Corpus, C-ORAL-ROM, C-ORAL-BRASIL, Cor-DiAL). Data for each corpus can be compared across languages, since they are built using the same corpus design, which entails a set of variation parameters relevant for representing spontaneous speech and, specifically, its pragmatic variation. LABLITA-C-ORAL corpora are text/sound aligned at the utterance level. Empirical research carried out by LABLITA has verified a systematic correspondence between stretches of speech ending with a terminal prosodic break and the accomplishment of an illocutionary force, thus identifying utterances. Within the latter, a correspondence between chunks separated by non-terminal breaks and information functions has been identified. The IPIC database was created for the cross-linguistic comparison of information structure in Romance languages. With regard to the pragmatic classification of utterances, a working repertory of illocutionary types has been established, induced empirically from pragmatic and prosodic features shared in Romance corpora.

The Language into Act Theory: A Pragmatic Approach to Speech in Real-Life / Emanuela Cresti, Lorenzo Gregori, Massimo Moneglia, Alessandro Panunzi. - ELETTRONICO. - (2018), pp. 20-25.

The Language into Act Theory: A Pragmatic Approach to Speech in Real-Life

Emanuela Cresti;Lorenzo Gregori;Massimo Moneglia;Alessandro Panunzi
2018

Abstract

This paper briefly introduces the Language into Act Theory (L-AcT), that proposes a pragmatic framework for the corpus-based collection and analysis of spontaneous speech. The L-AcT methodology takes the utterance (i.e. the counterpart of a speech act) as the reference unit for analysis. A set of large-scale Romance corpora has been collected in accordance with the L-AcT methodology (LABLITA Corpus, C-ORAL-ROM, C-ORAL-BRASIL, Cor-DiAL). Data for each corpus can be compared across languages, since they are built using the same corpus design, which entails a set of variation parameters relevant for representing spontaneous speech and, specifically, its pragmatic variation. LABLITA-C-ORAL corpora are text/sound aligned at the utterance level. Empirical research carried out by LABLITA has verified a systematic correspondence between stretches of speech ending with a terminal prosodic break and the accomplishment of an illocutionary force, thus identifying utterances. Within the latter, a correspondence between chunks separated by non-terminal breaks and information functions has been identified. The IPIC database was created for the cross-linguistic comparison of information structure in Romance languages. With regard to the pragmatic classification of utterances, a working repertory of illocutionary types has been established, induced empirically from pragmatic and prosodic features shared in Romance corpora.
2018
979-10-95546-16-0
Proceedings of the LREC 2018 Workshop “LB-ILR2018 and MMC2018 Joint Workshop”
20
25
Emanuela Cresti, Lorenzo Gregori, Massimo Moneglia, Alessandro Panunzi
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1146554
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact