Webaffix: Discovering Morphological Links on the WWW - HAL-SHS - Sciences de l'Homme et de la Société Accéder directement au contenu
Communication Dans Un Congrès Année : 2002

Webaffix: Discovering Morphological Links on the WWW

Nabil Hathout
Ludovic Tanguy

Résumé

This paper presents a new language-independent method for finding morphological links between newly appeared words (i.e. absent from reference word lists). Using the WWW as a corpus, the Webaffix tool detects the occurrences of new derived lexemes based on a given suffix, proposes a base lexeme following a standard scheme (such as noun-verb), and then performs a compatibility test on the word pairs produced, using the Web again, but as a source of cooccurrences. The resulting pairs of words are used to build generic morphological databases useful for a number of NLP tasks. We develop and comment an example use of Webaffix to find new noun/verb pairs in French.

Mots clés

Domaines

Linguistique
Fichier principal
Vignette du fichier
Webaffix-lrec2002.pdf (64.35 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

halshs-01322326 , version 1 (27-05-2016)

Identifiants

  • HAL Id : halshs-01322326 , version 1

Citer

Nabil Hathout, Ludovic Tanguy. Webaffix: Discovering Morphological Links on the WWW. LREC 2002, 2002, Las Palmas, Spain. ⟨halshs-01322326⟩
91 Consultations
244 Téléchargements

Partager

Gmail Facebook X LinkedIn More