Developing a Large-Scale Lexicon for a Less-Resourced Language: General Methodology and Preliminary Experiments on Sorani Kurdish

Abstract : In this paper, we describe a general methodology for developing a large-scale lexicon for a less-resourced language, i.e., a language for which raw internet-based corpora and general-purpose grammars are virtually the only existing resources. We apply this methodology to the development of a morphological lexicon for Sorani Kurdish, an Iranian language mostly spoken in northern Iraq and north-western Iran. Although preliminary, our results demonstrate the relevance of this methodology
Keywords : C-AFF
Type de document :
Communication dans un congrès
Proceedings of the 7th SaLTMiL Workshop on Creation and use of basic lexical resources for less-resourced languages (LREC 2010 Workshop), 2010, Valetta, Malta. 2010
Liste complète des métadonnées

Littérature citée [21 références]  Voir  Masquer  Télécharger

https://halshs.archives-ouvertes.fr/halshs-00751634
Contributeur : Géraldine Walther <>
Soumis le : mercredi 14 novembre 2012 - 16:07:31
Dernière modification le : jeudi 21 février 2019 - 12:52:02
Document(s) archivé(s) le : vendredi 15 février 2013 - 03:39:28

Fichier

saltmil10soralex_1.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : halshs-00751634, version 1

Collections

Citation

Géraldine Walther, Benoît Sagot. Developing a Large-Scale Lexicon for a Less-Resourced Language: General Methodology and Preliminary Experiments on Sorani Kurdish. Proceedings of the 7th SaLTMiL Workshop on Creation and use of basic lexical resources for less-resourced languages (LREC 2010 Workshop), 2010, Valetta, Malta. 2010. 〈halshs-00751634〉

Partager

Métriques

Consultations de la notice

408

Téléchargements de fichiers

327