C-ORAL-ROM (Integrated Reference Corpora for Spoken Romance Languages) is a Multilingual Language resource achieved in an EU project coordinated by Massimo Moneglia and Emanuela Cresti of the University of Florence in the IST program during the Fifth Framework Program (IST2000-26228- 1.2 MEuros funding). The project results have been published by Benjamin for large academic distribution and by ELDA for exploitation in Human Language Technology Applications. The C-ORAL-ROM DVD, accomplished under the scientific direction of Massimo Moneglia, provides a unique set of comparable corpora of spontaneous speech for the main Romance languages, French, Italian, Portuguese and Spanish. Each corpus is built to the same design using identical sampling techniques, and each corpus is presented in multimedia format, allowing simultaneous access to aligned acoustic and textual information. Each corpus totals 300,000 words and presents formal and informal speech in a variety of contexts of use, dialogue structure and text genres, semantic domains and speech act typologies. This multimedia resource is distributed in association with a book (E. Cresti & M. Moneglia eds. C-ORAL-ROM - Integrated Reference Corpora for Spoken Romance Languages ) and is presented in the chapter attached to this record.

C-ORAL-ROM Integrated Reference Corpora For Spoken Romance Languages. Encrypted and Compressed edition / M. MONEGLIA. - ELETTRONICO. - (2005).

C-ORAL-ROM Integrated Reference Corpora For Spoken Romance Languages. Encrypted and Compressed edition

MONEGLIA, MASSIMO
2005

Abstract

C-ORAL-ROM (Integrated Reference Corpora for Spoken Romance Languages) is a Multilingual Language resource achieved in an EU project coordinated by Massimo Moneglia and Emanuela Cresti of the University of Florence in the IST program during the Fifth Framework Program (IST2000-26228- 1.2 MEuros funding). The project results have been published by Benjamin for large academic distribution and by ELDA for exploitation in Human Language Technology Applications. The C-ORAL-ROM DVD, accomplished under the scientific direction of Massimo Moneglia, provides a unique set of comparable corpora of spontaneous speech for the main Romance languages, French, Italian, Portuguese and Spanish. Each corpus is built to the same design using identical sampling techniques, and each corpus is presented in multimedia format, allowing simultaneous access to aligned acoustic and textual information. Each corpus totals 300,000 words and presents formal and informal speech in a variety of contexts of use, dialogue structure and text genres, semantic domains and speech act typologies. This multimedia resource is distributed in association with a book (E. Cresti & M. Moneglia eds. C-ORAL-ROM - Integrated Reference Corpora for Spoken Romance Languages ) and is presented in the chapter attached to this record.
M. MONEGLIA
File in questo prodotto:
File Dimensione Formato  
moneglia-coral-rom-1.pdf

Accesso chiuso

Tipologia: Versione finale referata (Postprint, Accepted manuscript)
Licenza: DRM non definito
Dimensione 2.08 MB
Formato Adobe PDF
2.08 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/2158/243708
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact