A multi-software integration platform and support for multimedia transcripts of language

Abstract : Using and sharing multimedia corpora is a vital feature for research about language, but the number of different and often not easily compatible tools available makes this difficult to do. As the aims of the COLAJE project are to use multimodal linguistic data about language development in oral and sign languages, it was necessary to create a system (VICLO) that allowed sharing and using data coming from at least three different sources Clan (CHILDES), Elan (MPI) and Praat (U. of Amsterdam). For this reason, a multi-purpose storage format based on the TEI was created, which allowed us to store information coming from all (these) origins, and include every type of specific information. When part of the information is processed by a specific software, the changes are integrated later in the system without loosing information specific to other software. Thus it is possible to store information shared and not shared between the different corpus editing tools. This common base allowed us to implement complementary features such as fine-grained participant and metadata information, common visualisation and data-retrieval tools. VICLO is based on XML technology and all data can be displayed using all purpose web browsers.
Type de document :
Communication dans un congrès
LREC 2010 : Workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, May 2010, La Valette, Malta. pp.106-110, 2010


https://halshs.archives-ouvertes.fr/halshs-00495648
Contributeur : Christophe Parisse <>
Soumis le : lundi 28 juin 2010 - 14:14:06
Dernière modification le : mardi 11 octobre 2016 - 14:33:11
Document(s) archivé(s) le : jeudi 30 septembre 2010 - 17:53:41

Fichier

2010-3-Parisse-Morgenstern-LRE...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : halshs-00495648, version 1

Collections

Citation

Christophe Parisse, Aliyah Morgenstern. A multi-software integration platform and support for multimedia transcripts of language. LREC 2010 : Workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, May 2010, La Valette, Malta. pp.106-110, 2010. <halshs-00495648>

Exporter

Partager

Métriques

Consultations de
la notice

239

Téléchargements du document

112