Skip to Main content Skip to Navigation
Journal articles

La textométrie en question

Abstract : Considering that textometric statistical analysis of textual data is based on plain counts (which are just numeric values) of words (which are defined as rough character string tokens), this approach shows obvious limits from a linguistic point of view. Moreover, Textometry uses formal and mathematical models to analyse corpora, but results are often delivered in a very informal manner, without any quantified evaluation procedure like the ones that are applied in the Natural Language Processing field. The present paper aims at giving an in-depth understanding of the textometric methodology, so that such critical points may not be relevant anymore. Then, Textometry can meet the requirements of a linguistic-aware and scientific study of textual data.
Complete list of metadatas

Cited literature [39 references]  Display  Hide  Download

https://halshs.archives-ouvertes.fr/halshs-02902088
Contributor : Bénédicte Pincemin <>
Submitted on : Friday, July 17, 2020 - 5:53:08 PM
Last modification on : Friday, July 24, 2020 - 3:44:16 AM

File

pincemin_francaismoderne20_200...
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution - NonCommercial - ShareAlike 4.0 International License

Identifiers

  • HAL Id : halshs-02902088, version 1

Citation

Bénédicte Pincemin. La textométrie en question. Le Français Moderne - Revue de linguistique Française, CILF (conseil international de la langue française), 2020, Linguistique et traitements quantitatifs, 88 (1), pp.26-43. ⟨halshs-02902088⟩

Share

Metrics

Record views

8

Files downloads

5