Fast Development of Basic NLP Tools: Towards a Lexicon and a POS Tagger for Kurmanji Kurdish - HAL-SHS - Sciences de l'Homme et de la Société Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Fast Development of Basic NLP Tools: Towards a Lexicon and a POS Tagger for Kurmanji Kurdish

Résumé

The development of basic NLP resources for minority languages is still a challenge to both formal and computational linguists. In this paper, we show how we were able to develop a medium-scale morphological lexicon for Kurmanji Kurdish in a few days time using only freely accessible resources. We also developed a preliminary POS tagger that shall be used as a pre-annotation tool for developing a POS-annotated corpus, based solely on raw text and on our morphological lexicon.
Fichier principal
Vignette du fichier
clg10kmr.pdf (76.87 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00510999 , version 1 (23-08-2010)

Licence

Paternité

Identifiants

  • HAL Id : hal-00510999 , version 1

Citer

Géraldine Walther, Benoît Sagot, Karen Fort. Fast Development of Basic NLP Tools: Towards a Lexicon and a POS Tagger for Kurmanji Kurdish. International Conference on Lexis and Grammar, Sep 2010, Belgrade, Serbia. ⟨hal-00510999⟩
437 Consultations
414 Téléchargements

Partager

Gmail Facebook X LinkedIn More