Phonetic lessons from automatic phonemic transcription: preliminary reflections on Na (Sino-Tibetan) and Tsuut’ina (Dene) data

Automatic phonemic transcription tools now reach high levels of accuracy on a single speaker with relatively small amounts of training data: on the order of 100 to 250 minutes of transcribed speech. Beyond its practical usefulness for language documentation, use of automatic transcription also yields some insights for phoneticians. The present report illustrates this by going into qualitative error analysis on two test cases, Yongning Na (Sino-Tibetan) and Tsuut’ina (Dene). Among other benefits, error analysis allows for a renewed exploration of phonetic detail: examining the output of phonemic transcription software compared with spectrographic and aural evidence. From a methodological point of view, the present report is intended as a case study in Computational Language Documentation: an interdisciplinary approach that associates fieldworkers (“diversity linguists”) and computer scientists with phoneticians/phonologists.

Mots clés

Computational Language Documentation interdisciplinarity error analysis machine learning speech recognition

Domaines

Linguistique

Liste complète des métadonnées

Format du dépôt	Fichier
Type de dépôt	Communication dans un congrès
Titre	en Phonetic lessons from automatic phonemic transcription: preliminary reflections on Na (Sino-Tibetan) and Tsuut’ina (Dene) data
Résumé	en Automatic phonemic transcription tools now reach high levels of accuracy on a single speaker with relatively small amounts of training data: on the order of 100 to 250 minutes of transcribed speech. Beyond its practical usefulness for language documentation, use of automatic transcription also yields some insights for phoneticians. The present report illustrates this by going into qualitative error analysis on two test cases, Yongning Na (Sino-Tibetan) and Tsuut’ina (Dene). Among other benefits, error analysis allows for a renewed exploration of phonetic detail: examining the output of phonemic transcription software compared with spectrographic and aural evidence. From a methodological point of view, the present report is intended as a case study in Computational Language Documentation: an interdisciplinary approach that associates fieldworkers (“diversity linguists”) and computer scientists with phoneticians/phonologists.
Auteur(s)	Alexis Michaud ¹ , Oliver Adams ² , Christopher Cox ³ , Séverine Guillaume ¹ 1 LACITO - Langues et civilisations à tradition orale ( 406905 ) - 7, rue Guy Môquet, 94800, VILLEJUIF - France Université Sorbonne Nouvelle - Paris 3 UMR7107 ( 52995 ) ; Institut National des Langues et Civilisations Orientales UMR7107 ( 300064 ) ; Centre National de la Recherche Scientifique UMR7107 ( 441569 ) 2 University of Melbourne ( 306322 ) - Parkville VIC 3010 - Australie 3 University of Alberta ( 98298 ) - 116 St & 85 Ave, Edmonton, AB T6G 2R3 - Canada
Langue du document	Anglais
Licence	Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales
Invité	Non
Comité de lecture	Oui
Date de production/écriture	2019
Vulgarisation	Non
URL du congrès ou éditeur	https://icphs2019.org/icphs2019-fullpapers/
Ville	Melbourne
Pays	Australie
Titre du congrès	ICPhS XIX (19th International Congress of Phonetic Sciences)
Date début congrès	2019-08-05
Date fin congrès	2019-08-09
Titre de la collection	Proceedings of ICPhS XIX (19th International Congress of Phonetic Sciences)
Actes	Oui
Date de publication	2019
Audience	Internationale
Domaine(s)	Sciences de l'Homme et Société/Linguistique
Données associées	10.24397/pangloss-0004537#S13
Projet(s) ANR	Université Sorbonne Paris Cité [En savoir plus] USPC - ANR-11-IDEX-0005 IDEX - 2011
Voir aussi	http://pangloss.cnrs.fr/
Mots-clés	en Computational Language Documentation, interdisciplinarity, error analysis, machine learning, speech recognition

Fichier principal

ICPhS2019_PhoneticLessons_Na_Tsuutina.pdf ( 2.42 Mo )

Origine : Fichiers produits par l'(les) auteur(s)

Alexis Michaud : Connectez-vous pour contacter le contributeur

https://shs.hal.science/halshs-02059313

Soumis le : vendredi 29 mars 2019 à 10:03:32

Dernière modification le : mardi 2 avril 2024 à 15:48:04

Dates et versions

halshs-02059313, version 1 (06-03-2019)

halshs-02059313, version 2 (29-03-2019)

Licence

Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales - CC BY 4.0

Identifiants

HAL Id : halshs-02059313 , version 2

Citer

Alexis Michaud, Oliver Adams, Christopher Cox, Séverine Guillaume. Phonetic lessons from automatic phonemic transcription: preliminary reflections on Na (Sino-Tibetan) and Tsuut’ina (Dene) data. ICPhS XIX (19th International Congress of Phonetic Sciences), Aug 2019, Melbourne, Australia. ⟨halshs-02059313v2⟩

Exporter

BibTeX TEI Dublin Core DC Terms EndNote Datacite

Collections

CNRS UNIV-PARIS3 INALCO LACITO CAMPUS-AAR AAI USPC ASIES_ET_PACIFIQUE ANR

356 Consultations

651 Téléchargements

Dernière date de mise à jour le 07/04/2024