Integrating automatic transcription into the language documentation workflow: Experiments with Na data and the PERSEPHONE toolkit

Abstract : Automatic speech recognition tools have potential for facilitating language documentation, but in practice these tools remain little-used by linguists for a variety of reasons, such as that the technology is still new (and evolving rapidly), user-friendly interfaces are still under development, and case studies demonstrating the practical usefulness of automatic recognition in a low-resource setting remain few. This article reports on a success story in integrating automatic transcription into the language documentation workflow, specifically for Yongning Na, a language of Southwest China. Using PERSEPHONE, an open-source toolkit, a single-speaker speech transcription tool was trained over five hours of manually transcribed speech. The experiments found that this method can achieve a remarkably low error rate (on the order of 17%), and that automatic transcriptions were useful as a canvas for the linguist. The present report is intended for linguists with little or no knowledge of speech processing. It aims to provide insights into (i) the way the tool operates and (ii) the process of collaborating with natural language processing specialists. Practical recommendations are offered on how to anticipate the requirements of this type of technology from the early stages of data collection in the field.
Type de document :
Pré-publication, Document de travail
In press: volume 12 of Language Documentation and Conservation. 2018
Liste complète des métadonnées

Littérature citée [34 références]  Voir  Masquer  Télécharger

https://halshs.archives-ouvertes.fr/halshs-01841979
Contributeur : Alexis Michaud <>
Soumis le : mardi 17 juillet 2018 - 17:01:57
Dernière modification le : samedi 21 juillet 2018 - 01:12:51

Fichiers

Persephone_YongningNa_2018.pdf
Fichiers produits par l'(les) auteur(s)

Licence


Distributed under a Creative Commons Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales 4.0 International License

Identifiants

  • HAL Id : halshs-01841979, version 1

Collections

Citation

Alexis Michaud, Oliver Adams, Trevor Cohn, Graham Neubig, Séverine Guillaume. Integrating automatic transcription into the language documentation workflow: Experiments with Na data and the PERSEPHONE toolkit. In press: volume 12 of Language Documentation and Conservation. 2018. 〈halshs-01841979〉

Partager

Métriques

Consultations de la notice

75

Téléchargements de fichiers

52