Combiner parseur automatique et révision manuelle pour la constitution d'un corpus arboré de parole spontanée : retour d'expérience sur le corpus ODIL_syntaxe

Abstract : This paper describes a syntactic annotation platform (Contemplata) that integrates a parser (Stanford Parser precisely) to automatically annotate written text or oral transcriptions and then allows their manual revision by an expert, in order to ease the annotation task. This tool was used in the ODIL Project to produce a phrase-structure treebank based on a corpus of spontaneous speech. In this paper, we present the annotation process that has been implemented as well as our annotation guidelines and plan to provide a demonstration of the annotation tool during the presentation.
Complete list of metadatas

Cited literature [11 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02295494
Contributor : Ilaine Wang <>
Submitted on : Wednesday, September 25, 2019 - 10:38:31 AM
Last modification on : Thursday, September 26, 2019 - 1:26:10 AM

File

LIFT_ODIL_v2.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02295494, version 2

Citation

Ilaine Wang, Aurore Pelletier, Jakub Waszczuk, Anaıs Lefeuvre-Halftermeyer, Jean-Yves Antoine, et al.. Combiner parseur automatique et révision manuelle pour la constitution d'un corpus arboré de parole spontanée : retour d'expérience sur le corpus ODIL_syntaxe. Journées scientifiques du groupement de recherche « Linguistique informatique, formelle et de terrain », Nov 2019, Orléans, France. ⟨hal-02295494v2⟩

Share

Metrics

Record views

317

Files downloads

31