Skip to Main content Skip to Navigation
Journal articles

Vers un outillage informatique optimisé pour corpus langagiers oraux en vue d'une exploitation textométrique : le cas des interrogatives partielles dans ESLO

Abstract : To answer the increasing trend of corpora sharing and data format heterogeneity, we present a method for converting spoken language corpora to several tool formats in order to facilitate linguistic analysis. For this research, we take as an example the ESLO corpus for several reasons: its open-source licence, its standard format used for its construction, its size, and its sociolinguistic and micro-diacronic characteristics. Our study is based on a compilation of the ESLO corpus in order to make it compatible with the textometric tool TXM. We operate a set of operations to use all the possibilities the tool offers. Finally, we present a fine-grained and multidimensional analysis of the interrogatives utterances used in the ESLO corpus.
Document type :
Journal articles
Complete list of metadatas

https://halshs.archives-ouvertes.fr/halshs-03133017
Contributor : Flora Badin <>
Submitted on : Friday, February 5, 2021 - 3:42:57 PM
Last modification on : Sunday, February 21, 2021 - 3:15:17 AM

File

 Restricted access
To satisfy the distribution rights of the publisher, the document is embargoed until : 2021-05-05

Please log in to resquest access to the document

Identifiers

  • HAL Id : halshs-03133017, version 1

Citation

Flora Badin, Loïc Liégeois, Gabriel Thiberge, Christophe Parisse. Vers un outillage informatique optimisé pour corpus langagiers oraux en vue d'une exploitation textométrique : le cas des interrogatives partielles dans ESLO. Corpus, 2021. ⟨halshs-03133017⟩

Share

Metrics

Record views

44