, ?????"le+V+se"?????????-???????, 2016.
,
Automatic understanding of unwritten languages. Melbourne: The University of Melbourne, 2017. ,
Evaluating phonemic transcription of low-resource tonal languages for language documentation, Proceedings of the 11th Language Resources and Evaluation Conference (LREC 2018, pp.3356-3365, 2018. ,
URL : https://hal.archives-ouvertes.fr/halshs-01709648
Cross-lingual word embeddings for low-resource language modeling, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, vol.1, pp.937-947, 2017. ,
, ), a sort of "cooperative" which was successfully set up within a few years and currently operates with only two permanent staff, 13Possible business models include collaboration between several institutions and funding agencies, or crowdfunding: joint support from a great number of institutions (typically, universities), following the model of the publishing house Language Science Press
Inducing bilingual lexicons from small quantities of sentence-aligned phonemic transcriptions, Proceedings of the International Workshop on Spoken Language Translation (IWSLT 2015). Da Nang, 2015. ,
Breaking the unwritten language barrier: The BULB Project, Procedia Computer Science, vol.81, pp.8-14, 2016. ,
URL : https://hal.archives-ouvertes.fr/halshs-01428027
De la reconnaissance automatique de la parole à l'analyse linguistique de corpus oraux, Actes des XXVIe Journées d'Etude de la Parole, pp.389-400, 2006. ,
Unsupervised language model adaptation, Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.224-227, 2003. ,
Automatic speech recognition for under-resourced languages: A survey, Speech Communication, vol.56, pp.85-100, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00953644
Machine translation for language preservation, The COLING 2012 Organizing Committee, pp.125-134, 2012. ,
Aikuma: A mobile app for collaborative language documentation, Proceedings of the 2014 Workshop on the Use of Computational Methods in the Study of Endangered Languages, pp.1-5, 2014. ,
Parallel speech collection for under-resourced language studies using the LIG-AIKUMA mobile device app, Procedia Computer Science, vol.81, pp.61-66, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01350065
Endangered sound patterns: Three perspectives on theory and description, Language Documentation & Conservation, vol.1, issue.1, pp.1-16, 2007. ,
Language documentation meets language technology, Proceedings of the First International Workshop on Computational Linguistics for Uralic Languages, pp.8-18, 2015. ,
, Vers des ressources électroniques interconnectées: Lexica, les dictionnaires de la collection Pangloss, pp.48-51, 2017.
Enquête et description des langues à tradition orale. Volume I: l'enquête de terrain et l'analyse grammaticale, Société d'Études Linguistiques et Anthropologiques de France, 1971. ,
, , 2004.
Effects of lexical frequency and lexical category on the duration of Vietnamese syllables, Proceedings of ICPhS XVIII, 2015. ,
Endangered language documentation: Bootstrapping a Chatino speech corpus, forced aligner, ASR, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016, pp.4004-4011, 2016. ,
, Laurent Romary & Eveline Wandl-Vogt. 2015. Going digital: Creating change in the Humanities
Phonology, tone and the functions of tone in San Juan Quiahije Chatino. Austin: University of Texas at Austin. Doctoral dissertation, 2011. ,
Using automatic alignment to analyze endangered language data: Testing the viability of untrained alignment, Las Memorias del Congreso de Idiomas Indígenas de Latinoamérica-II, vol.134, pp.2235-2246, 2006. ,
Field linguistics: A minor manual, Sprachtypologie und Universalienforschung, vol.60, issue.1, pp.12-31, 2007. ,
Mining parallel data from comparable corpora via triangulation, Proceedings of the International Conference on Asian Language Processing, pp.185-188, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00959145
Towards the automatic processing of Yongning Na (Sino-Tibetan): Developing a "light" acoustic model of the target language and testing "heavyweight" models from five national languages, Proceedings of the 4th International Workshop on Spoken Language Technologies for Under-resourced Languages, pp.153-160, 2014. ,
URL : https://hal.archives-ouvertes.fr/halshs-00980431
The two-level tonal system of Lataddi Narua, Linguistics of the Tibeto-Burman Area, vol.39, issue.1, pp.67-104, 2016. ,
Un procédé électrique percutané d'inscription de l'accolement glottique au cours de la phonation: Glottographie de haute fréquence, pp.66-69, 1957. ,
Acoustic theory of speech production, with calculations based on X-ray studies of Russian articulations, 1960. ,
Formation des registres et mutations consonantiques dans les langues mon-khmer, Mon-Khmer Studies, vol.8, pp.1-76, 1979. ,
Corpus linguistics or computer-aided armchair linguistics, Directions in corpus linguistics: Proceedings of Nobel Symposium, vol.82, pp.35-60, 1992. ,
First applications of a new laryngograph, Medical and Biological Illustration, vol.21, pp.172-182, 1971. ,
Interdependence between tones, segments and phonation types in Shanghai Chinese, 2015. ,
Speaker diarization using deep neural network embeddings, Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4930-4934, 2017. ,
Domain-initial strengthening on French vowels and phonological contrasts: Evidence from lip articulation and spectral variation, Journal of Phonetics, vol.46, pp.128-146, 2014. ,
URL : https://hal.archives-ouvertes.fr/halshs-01402718
EasyAlign: An automatic phonetic alignment tool under Praat, Proceedings of the 12th Annual Conference of the International Speech Communication Association, pp.3233-3236, 2011. ,
Speech recognition with deep recurrent neural networks, Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.6645-6649, 2013. ,
DOI : 10.1109/icassp.2013.6638947
URL : http://learning.cs.toronto.edu/~hinton/absps/RNN13.pdf
The efficacy of human post-editing for language translation, Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp.439-448, 2013. ,
The dark side of Digital Humanities: Dispatches from two recent MLA conventions, Differences, vol.25, issue.1, pp.79-92, 2014. ,
Jeroen van den Hoven, Roberto V. Zicari & Andrej Zwitter. 2017. Will democracy survive Big Data and Artificial Intelligence?, Scientific American ,
Building tone resources for second language learners from phonetic documentation: Cherokee examples. Language Documentation & Conservation 11, pp.289-304, 2017. ,
Journal of Sino-Tibetan Linguistics ????? 3, pp.30-55, 2009. ,
Decisions and mechanisms in exemplar-based phonology, Experimental approaches to phonology, pp.25-40, 2007. ,
Forced alignment for understudied language varieties: Testing Prosodylab-Aligner with Tongan data. Language Documentation & Conservation 12, pp.80-123, 2018. ,
SailAlign: Robust long speech-text alignment, Proceedings of the Workshop on New Tools and Methods for Very-Large Scale Phonetics Research, 2011. ,
A summary of the REVERB challenge: State-of-theart and remaining challenges in reverberant speech processing research, EURASIP Journal on Advances in Signal Processing, vol.2016, issue.1, 2016. ,
Modeling under-resourced languages for speech recognition, Language Resources and Evaluation, vol.51, issue.4, pp.961-987, 2017. ,
DOI : 10.1007/s10579-016-9336-9
Design of the CMU sphinx-4 decoder, Proceedings of the 8th European Conference on Speech Communication and Technology, 2003. ,
, Languages of China: An Ethnologue country report 19th edition. Dallas: SIL International, 2016.
The first DIHARD speech diarization challenge, 2018. ,
A descriptive grammar of Yongning Na (Mosuo), 2010. ,
Yongning Na (Mosuo): Language documentation in the Sino-Tibetan borderland, Presented at the International Conference on Sino-Tibetan Languages and Linguistics, pp.403-439, 1990. ,
??????????????? (Overlapping speech detection using high-level information features), Science and Technology), vol.57, issue.1, pp.79-83, 2017. ,
Documenting and researching endangered languages: The Pangloss Collection. Language Documentation & Conservation 8, pp.119-135, 2014. ,
URL : https://hal.archives-ouvertes.fr/halshs-01003734
, , 2015.
Tone in Yongning Na: Lexical tones and morphotonology, Studies in Diversity Linguistics 13), 2017. ,
URL : https://hal.archives-ouvertes.fr/halshs-01094049
Speech recognition for newly documented languages: Highly encouraging tests using automatically generated phonemic transcription of Yongning Na audio recordings. HimalCo-Himalayan Corpora, 2017. ,
Combining documentation and research: Ongoing work on an endangered language, Proceedings of IALP 2012 (2012 International Conference on Asian Language Processing, pp.169-172, 2012. ,
DOI : 10.1109/ialp.2012.32
URL : https://hal.archives-ouvertes.fr/halshs-00731261
The phonology of Laze: Phonemic analysis, syllabic inventory, and a short word list, Yuyanxue Luncong ?????, vol.45, pp.196-230, 2012. ,
URL : https://hal.archives-ouvertes.fr/halshs-00582639
Tone and intonation: Introductory notes and practical recommendations. KALIPHO-Kieler Arbeiten zur Linguistik und Phonetik 3, pp.43-80, 2015. ,
URL : https://hal.archives-ouvertes.fr/halshs-01091477
Click reduction in fluent speech: A semiautomated analysis of Mangetti Dune !Xung, Proceedings of the 2nd Workshop on the Use of Computational Methods in the Study of Endangered Languages, pp.107-115, 2017. ,
Methods for interpreting and understanding deep neural networks, Digital Signal Processing, vol.73, pp.1-15, 2017. ,
DOI : 10.1016/j.dsp.2017.10.011
URL : https://doi.org/10.1016/j.dsp.2017.10.011
Learning a language model from continuous speech, Proceedings of the Eleventh Annual Conference of the International Speech Communication Association (Interspeech 2010, pp.1053-1056, 2010. ,
Linguistic fieldwork, 2001. ,
Perception of phonetic detail in the identification of highly reduced words, Journal of Phonetics, vol.39, issue.3, pp.319-329, 2011. ,
Speech data acquisition: The underestimated challenge. KALIPHO-Kieler Arbeiten zur Linguistik und Phonetik 3, pp.1-42, 2015. ,
URL : https://hal.archives-ouvertes.fr/halshs-01026295
Ayoker & Timothy Mills, Shilluk. Journal of the International Phonetic Association, vol.41, issue.1, pp.111-125, 2011. ,
Regulating artificial intelligence systems: Risks, challenges, competencies, and strategies, Harvard Journal of Law & Technology, vol.29, issue.2, 2016. ,
DOI : 10.2139/ssrn.2609777
Language-independent and language-adaptive acoustic modeling for speech recognition, Speech Communication, vol.35, pp.31-51, 2001. ,
DOI : 10.1016/s0167-6393(00)00094-7
URL : http://www.ri.cmu.edu/pub_files/pub3/schultz_tanja_2001_3/schultz_tanja_2001_3.pdf
An investigation of deep neural networks for noise robust speech recognition, Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.7398-7402, 2013. ,
DOI : 10.1109/icassp.2013.6639100
Production and perception of speaker-specific phonetic detail at word boundaries, Journal of Phonetics, vol.40, issue.2, pp.213-233, 2012. ,
DOI : 10.1016/j.wocn.2011.11.003
URL : http://eprints.gla.ac.uk/45862/1/45862.pdf
Linguistic fieldwork: A practical guide. Language Documentation & Conservation 5. 66-68, 2008. ,
Efficient speech transcription through respeaking, Proceedings of Interspeech 2013, pp.1087-1091, 2013. ,
Satoshi Nakamura & Alex Waibel, Speech Communication, 2017. ,
Towards automatic speech recognition without pronunciation dictionary, transcribed speech and text resources in the target language using cross-lingual word-to-phoneme alignment, Proceedings of the International Workshop on Spoken Language Technologies for Under-Resourced Languages, 2014. ,
DOI : 10.1016/j.csl.2014.10.001
Acoustic phonetics, 1998. ,
DOI : 10.1121/1.1327577
Untrained forced alignment of transcriptions and audio for language documentation corpora using WebMAUS, Proceedings of the Ninth International Conference on Language Resources and Evaluation, pp.3940-3947, 2014. ,
Language Documentation & Conservation 11, 2017. ,
Assessing annotated corpora as research output, Australian Journal of Linguistics, vol.36, issue.1, pp.1-21, 2016. ,
DOI : 10.1080/07268602.2016.1109428
Doing great things with small languages (Australian Research Council grant DP0984419), 2006. ,
On the acoustic and perceptual characterization of reference vowels in a cross-language perspective, Proceedings of ICPhS XVII, 2011. ,
Proposals for a representation of sounds based on their main acoustico-perceptual properties, Tones and features, pp.306-330, 2011. ,
Comparison of acoustic model adaptation techniques on non-native speech, Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.540-543, 2003. ,
Bringing user-centered design to the field of language archives. Language Documentation & Conservation 10, pp.641-681, 2016. ,
The FAIR Guiding Principles for scientific data management and stewardship, 2016. ,
DOI : 10.1038/sdata.2016.18
URL : http://www.nature.com/articles/sdata201618.pdf
The other N: The role of repetitions and items in the design of phonetic experiments, Proceedings of the 18th International Congress of Phonetic Sciences, 2015. ,
Defining documentary linguistics, Language documentation and description, vol.1, pp.35-51, 2003. ,
,
,
,
, Graham Neubig gneubig@cs.cmu.edu Séverine Guillaume severine