Gathering a corpus of multimodal computer-mediated meetings with focus on text and audio interaction

Saturnino Luz, Matt Mouley Bouamrane, Masood Masoodian

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference contributionProfessional

3 Sitaatiot (Scopus)

Abstrakti

In this paper we describe the gathering of a corpus of synchronised speech and text interaction over the network. The data collection scenarios characterise audio meetings with a significant textual component. Unlike existing meeting corpora, the corpus described in this paper emphasises temporal relationships between speech and text media streams. This is achieved through detailed logging and time stamping of text editing operations, actions on shared user interface widgets and gesturing, as well as generation of speech activity profiles. A set of tools has been developed specifically for these purposes which can be used as a data collection platform for the development of meeting browsers. The data gathered to date consists of nearly 30 hours of recorded audio and time stamped editing operations and gestures.

AlkuperäiskieliEnglanti
OtsikkoProceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2006
Sivut407-412
Sivumäärä6
TilaJulkaistu - 2006
OKM-julkaisutyyppiD3 Ammatillisen konferenssin julkaisusarja
TapahtumaInternational Conference on Language Resources and Evaluation - Genoa, Italia
Kesto: 22 toukokuuta 200628 toukokuuta 2006
Konferenssinumero: 5

Conference

ConferenceInternational Conference on Language Resources and Evaluation
LyhennettäLREC
Maa/AlueItalia
KaupunkiGenoa
Ajanjakso22/05/200628/05/2006

Sormenjälki

Sukella tutkimusaiheisiin 'Gathering a corpus of multimodal computer-mediated meetings with focus on text and audio interaction'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä