Skip to Main content Skip to Navigation
Conference papers

« Tu pourrais enregistrer un corpus pour moi ? » Pour une charte de qualité des corpus

Abstract : The time-consuming task of archiving and disseminating data is not a priority with most phoneticians. As a result, finding a suitable ready-made corpus is no easy task; researchers often rely on corpora of questionable value. Looking back at a century of speech recording, the legacy is not as extensive—and nowhere as tidy—as the layman would think. This paper calls for a " Corpus quality standard ". The argument (based on detailed examples) is that small-scale programs adhering to simple standards can actually go to build the databases we need. A quality standard would make data publication easier (thus fostering research) and allow for a smoother transition into the shelves of libraries, fulfilling the phoneticians' key role in documenting the languages of the world.
Complete list of metadatas

Cited literature [4 references]  Display  Hide  Download
Contributor : Alexis Michaud <>
Submitted on : Friday, November 24, 2017 - 6:55:33 PM
Last modification on : Wednesday, July 15, 2020 - 10:03:39 AM


Explicit agreement for this submission


Distributed under a Creative Commons Attribution - NonCommercial - ShareAlike 4.0 International License


  • HAL Id : halshs-01647020, version 1


Alexis Michaud. « Tu pourrais enregistrer un corpus pour moi ? » Pour une charte de qualité des corpus. XXIVe Journées d'Etude de la Parole, Nancy (2002), Jun 2002, Nancy, France. pp. 153-156. ⟨halshs-01647020⟩



Record views


Files downloads