Skip to Main content Skip to Navigation
Journal articles

Méthodologie d’harmonisation et de traitement des données orales du CÉFC

Abstract : The CEFC corpus includes data from several different sources to make observable the diversity of oral French at least partly, solving the problems inherent to the heterogeneity of these data is intrinsic to the constitution of this resource and motivated by its objective. This article will describe, step by step, the methodological approach that enables us to build a homogeneous resource by pooling these different sources in order to provide coherent automatic annotations and to facilitate the analysis of an oral corpus of several million words.
Document type :
Journal articles
Complete list of metadata

https://halshs.archives-ouvertes.fr/halshs-03008795
Contributor : Carole Etienne <>
Submitted on : Monday, November 16, 2020 - 11:07:00 PM
Last modification on : Thursday, February 25, 2021 - 9:54:05 AM

Licence


Copyright

Identifiers

  • HAL Id : halshs-03008795, version 1

Collections

Citation

Christophe Benzitoun, Carole Etienne. Méthodologie d’harmonisation et de traitement des données orales du CÉFC. Langages, Armand Colin (Larousse jusqu'en 2003), 2020, Orféo : un corpus et une plateforme pour l'étude du français contemporain, pp.39-52. ⟨halshs-03008795⟩

Share

Metrics

Record views

24