« Tu pourrais enregistrer un corpus pour moi ? » Pour une charte de qualité des corpus

Abstract : The time-consuming task of archiving and disseminating data is not a priority with most phoneticians. As a result, finding a suitable ready-made corpus is no easy task; researchers often rely on corpora of questionable value. Looking back at a century of speech recording, the legacy is not as extensive—and nowhere as tidy—as the layman would think. This paper calls for a " Corpus quality standard ". The argument (based on detailed examples) is that small-scale programs adhering to simple standards can actually go to build the databases we need. A quality standard would make data publication easier (thus fostering research) and allow for a smoother transition into the shelves of libraries, fulfilling the phoneticians' key role in documenting the languages of the world.
Complete list of metadatas

Cited literature [4 references]  Display  Hide  Download

https://halshs.archives-ouvertes.fr/halshs-01647020
Contributor : Alexis Michaud <>
Submitted on : Friday, November 24, 2017 - 6:55:33 PM
Last modification on : Tuesday, July 23, 2019 - 4:16:01 PM

File

Michaud2002_TuPourraisEnregist...
Explicit agreement for this submission

Licence


Distributed under a Creative Commons Attribution - NonCommercial - ShareAlike 4.0 International License

Identifiers

  • HAL Id : halshs-01647020, version 1

Citation

Alexis Michaud. « Tu pourrais enregistrer un corpus pour moi ? » Pour une charte de qualité des corpus. XXIVe Journées d'Etude de la Parole, Nancy (2002), Jun 2002, Nancy, France. pp. 153-156. ⟨halshs-01647020⟩

Share

Metrics

Record views

207

Files downloads

53