Macrosyntactic corpus annotation. The case of Zaar - HAL Accéder directement au contenu
Pré-publication, Document de travail Année : 2018

Macrosyntactic corpus annotation. The case of Zaar

Résumé

This paper argues for a minimal annotation representing in a simple and concise way the interface between information structure and syntax. The article uses the concept of macrosyntax, based on illocutionary units, for a new level of annotation using existing morphosyntactic tiers in Elan. One of the main assets of this system of annotation lies in the notion of piles it uses to represent the oral discursive flow and account for dysfluencies, discontinuities and ellipses. A pilot 15,000 words corpus has been annotated in Elan to run a preliminary study of the information structure of illocutionary components in Zaar, a Chadic language spoken in Nigeria. Their micro-and macro-syntactic properties are represented using Universal Dependencies Grammar.
Fichier principal
Vignette du fichier
13_CAR.pdf ( 1 Mo ) Télécharger
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

halshs-01701816, version 1 (06-02-2018)

Identifiants

  • HAL Id : halshs-01701816 , version 1

Citer

Bernard Caron. Macrosyntactic corpus annotation. The case of Zaar. 2018. ⟨halshs-01701816⟩
59 Consultations
96 Téléchargements
Dernière date de mise à jour le 07/04/2024
comment ces indicateurs sont-ils produits

Partager

Gmail Facebook Twitter LinkedIn Plus