Les outils modernes pour la transcription de corpus de parole

Abstract : Computer tools and formats for linguistic transcription and for the annotation of linguistic corpora are reviewed. Standardization of these tools and formats will facilitate the coding, exchange, and dissemination of information.
A method of annotation for corpora of spoken language, developed as part of a program to archive linguistic field recordings, is presented as an example. The method relies as far as possible on emerging standards for structured text (XML, Unicode). Data formats for both sound and annotation and processing tools (editors, parsers, browsers) are discussed.
Complete list of metadatas

Cited literature [4 references]  Display  Hide  Download

Contributor : Michel Jacobson <>
Submitted on : Friday, March 10, 2006 - 5:03:15 PM
Last modification on : Tuesday, July 23, 2019 - 3:58:04 PM
Long-term archiving on : Saturday, April 3, 2010 - 10:47:30 PM


  • HAL Id : halshs-00009579, version 1


Michel Jacobson. Les outils modernes pour la transcription de corpus de parole. Revue PAROLE, Université de Mons-Hainaut, 2002, pp.213-229. ⟨halshs-00009579⟩



Record views


Files downloads