Issues underlying a common Sign Language Corpora annotation scheme - HAL-SHS - Sciences de l'Homme et de la Société Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Issues underlying a common Sign Language Corpora annotation scheme

Résumé

Corpus-based Sign Language linguistics has emerged as a new linguistic domain, and as a consequence large-scale and controlled video data repositories are under construction for different Sign Languages. Nevertheless, as pointed by (Johnston, 2008) no unified annotation scheme is yet available, which compromises any chance of comparing or reusing corpora across research teams. Another related issue is the comparability of descriptions and formalizations between SL linguistics and mainstream linguistics. In this paper, we address the issue of the definition of a common annotation scheme for Sign Language corpora annotation, distribution, exchange and comparison. In section 2. we discuss the challenge of building inter-operable corpora for corpus-based linguistics. We also examine existing annotation schemes or strategies proposed for SL linguistics. In section 3. we propose a small set of annotation tiers, based on Frame-Semantics, as a common annotation scheme. We also propose to add text-level as well as utterance-level metadata to this common annotation scheme, in order to broaden the range of future uses of SL corpora.

Domaines

Linguistique
Fichier principal
Vignette du fichier
Balvet_SL-Workshop_Issues (2).pdf (607.05 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

halshs-01077785 , version 1 (31-10-2014)

Licence

Paternité - Partage selon les Conditions Initiales

Identifiants

  • HAL Id : halshs-01077785 , version 1

Citer

Antonio Balvet. Issues underlying a common Sign Language Corpora annotation scheme. LREC 2010, May 2010, Valetta, Malta. ⟨halshs-01077785⟩
74 Consultations
59 Téléchargements

Partager

Gmail Facebook X LinkedIn More