Skip to Main content Skip to Navigation
Journal articles

Vers un outillage informatique optimisé pour corpus langagiers oraux en vue d'une exploitation textométrique : le cas des interrogatives partielles dans ESLO

Abstract : To answer the increasing trend of corpora sharing and data format heterogeneity, we present a method for converting spoken language corpora to several tool formats in order to facilitate linguistic analysis. For this research, we take as an example the ESLO corpus for several reasons: its open-source licence, its standard format used for its construction, its size, and its sociolinguistic and micro-diacronic characteristics. Our study is based on a compilation of the ESLO corpus in order to make it compatible with the textometric tool TXM. We operate a set of operations to use all the possibilities the tool offers. Finally, we present a fine-grained and multidimensional analysis of the interrogatives utterances used in the ESLO corpus.
Document type :
Journal articles
Complete list of metadata

https://halshs.archives-ouvertes.fr/halshs-03133017
Contributor : Flora Badin <>
Submitted on : Friday, February 5, 2021 - 3:42:57 PM
Last modification on : Wednesday, June 2, 2021 - 4:26:50 PM
Long-term archiving on: : Friday, May 7, 2021 - 8:28:02 AM

File

revue_corpus_Badin_Thiberge_Li...
Files produced by the author(s)

Identifiers

  • HAL Id : halshs-03133017, version 1

Citation

Flora Badin, Loïc Liégeois, Gabriel Thiberge, Christophe Parisse. Vers un outillage informatique optimisé pour corpus langagiers oraux en vue d'une exploitation textométrique : le cas des interrogatives partielles dans ESLO. Corpus, 2021. ⟨halshs-03133017⟩

Share

Metrics

Record views

235

Files downloads

24