Ambiguity rates - HAL-SHS - Sciences de l'Homme et de la Société Accéder directement au contenu
Autre Publication Scientifique Année : 1996

Ambiguity rates

Résumé

We analysed a French textual corpus in order to evaluate its rate of lexical ambiguity (number of lexical tags per word). Since this rate theoretically depends on the tagset and on whether compounds are delimited by tagging, the experiment was repeated with eight different tagsets. The results show that, although the information content of the tags is very different depending on the tagsets, the variation of the rate of lexical ambiguity is limited: when one shifts from the least to the most informative of the tagsets, the rate increases only from 1.6 to 2.0 tags per word.
Fichier principal
Vignette du fichier
ambiguity_rates.pdf (1.67 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

halshs-00276956 , version 1 (06-05-2008)

Identifiants

  • HAL Id : halshs-00276956 , version 1

Citer

Eric Laporte, Max Silberztein. Ambiguity rates: Automatic analysis of French text corpora and computation of ambiguity rates for different tagsets. 1996. ⟨halshs-00276956⟩
148 Consultations
37 Téléchargements

Partager

Gmail Facebook X LinkedIn More