METRICC: Harnessing Comparable Corpora for Multilingual Lexicon Development - HAL-SHS - Sciences de l'Homme et de la Société Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

METRICC: Harnessing Comparable Corpora for Multilingual Lexicon Development

Résumé

Research on comparable corpora has grown in recent years bringing about the possibility of developing multilingual lexicons through the exploitation of comparable corpora to create corpus-driven multilingual dictionaries. To date, this issue has not been widely addressed. This paper focuses on the use of the mechanism of collocational networks proposed by Williams (1998) for exploiting comparable corpora. The paper first provides a description of the METRICC project, which is aimed at the automatically creation of comparable corpora and describes one of the crawlers developed for comparable corpora building, and then discusses the power of collocational networks for multilingual corpus-driven dictionary development.
Fichier principal
Vignette du fichier
EURALEX_pp389-403_Alonso_Blancafort_De_Groc_Millon_and_Williams.pdf (408.96 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

halshs-00725224 , version 1 (24-08-2012)

Identifiants

  • HAL Id : halshs-00725224 , version 1

Citer

Araceli Alonso, Helena Blancafort, Clément de Groc, Chrystel Million, Geoffrey Williams. METRICC: Harnessing Comparable Corpora for Multilingual Lexicon Development. 15th EURALEX International Congress, Aug 2012, Oslo, Norway. pp.389-403. ⟨halshs-00725224⟩
488 Consultations
674 Téléchargements

Partager

Gmail Facebook X LinkedIn More