Intonosyntactic Data Structures : The Rhapsodie Treebank of Spoken French - HAL-SHS - Sciences de l'Homme et de la Société Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Intonosyntactic Data Structures : The Rhapsodie Treebank of Spoken French

Résumé

In this work, we present the data structures that were developed for the Rhapsodie project, an intonosyntactic annotation project of spoken French. Phoneticians and syntacticians work on different base units: a time aligned sound file for the former, and a partially ordered list of tokens for the latter. The alignment between the sound-file and the tokens is partial and non-trivial. We propose to encode this data with a small set of interconnected structures: lists, constituent trees, and directed acyclic graphs (DAGs). Our query language remains simple, similar to the Annis Query language, as the precedence and including relations are handled in accordance with the requested objects and their type of alignment: The order between prosodic units is time-based, whereas the order between syntactic units is lex-eme-based.

Mots clés

Domaines

Linguistique
Fichier principal
Vignette du fichier
rhapsodie-intonosyntax-law2012.pdf (291.41 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

halshs-01509071 , version 1 (19-09-2019)

Identifiants

  • HAL Id : halshs-01509071 , version 1

Citer

Kim Gerdes, Sylvain Kahane, Anne Lacheret, Arthur Truong, Paola Pietrandrea. Intonosyntactic Data Structures : The Rhapsodie Treebank of Spoken French. Sixth Linguistic Annotation Workshop (LAW VI) held in conjunction with ACL-2012, 2012, Jeju, South Korea. ⟨halshs-01509071⟩
59 Consultations
67 Téléchargements

Partager

Gmail Facebook X LinkedIn More