Establishing a Language by Annotating a Corpus - HAL-SHS - Sciences de l'Homme et de la Société Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Establishing a Language by Annotating a Corpus

Marine Courtin
Sylvain Kahane

Résumé

In this paper, we show that building a treebank can be used as a way to establish a language. Annotated corpus can be used as tools when arguing that some linguistic data belongs to a separate language (rather than a dialect or variety of another established language). We provide here a case study on a treebank of Naija, a Post-creole spoken in Nigeria which presents us with significant differences from treebanks of English in terms of existing constructions and frequency of several syntactic units.

Domaines

Linguistique
Fichier principal
Vignette du fichier
courtin.pdf (986.02 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

halshs-01958330 , version 1 (17-12-2018)

Identifiants

  • HAL Id : halshs-01958330 , version 1

Citer

Marine Courtin, Bernard Caron, Kim Gerdes, Sylvain Kahane. Establishing a Language by Annotating a Corpus: The Case of Naija, a Post-creole Spoken in Nigeria. annDH 2018 Annotation in Digital Humanities, Aug 2018, Sofia, Bulgaria. pp.7-11. ⟨halshs-01958330⟩
125 Consultations
129 Téléchargements

Partager

Gmail Facebook X LinkedIn More