Addressing Code-Switching in French/Algerian Arabic Speech

Abstract : This study focuses on code-switching (CS) in French/Algerian Arabic bilingual communities and investigates how speech technologies, such as automatic data partitioning, language identification and automatic speech recognition (ASR) can serve to analyze and classify this type of bilingual speech. A preliminary study carried out using a corpus of Maghrebian broadcast data revealed a relatively high presence of CS Alge-rian Arabic as compared to the neighboring countries Morocco and Tunisia. Therefore this study focuses on code switching produced by bilingual Algerian speakers who can be considered native speakers of both Algerian Arabic and French. A specific corpus of four hours of speech from 8 bilingual French Algerian speakers was collected. This corpus contains read speech and conversational speech in both languages and includes stretches of code-switching. We provide a linguistic description of the code-switching stretches in terms of intra-sentential and inter-sentential switches, the speech duration in each language. We report on some initial studies to locate French, Arabic and the code-switched stretches, using ASR system word posteriors for this pair of languages.
Type de document :
Communication dans un congrès
Interspeech 2017, Sep 2019, Stocholm, Sweden. Interspeech 2017, pp.62-66, 〈10.21437/interspeech.2017-1373〉
Liste complète des métadonnées
Contributeur : Djegdjiga Amazouz <>
Soumis le : jeudi 3 janvier 2019 - 17:14:39
Dernière modification le : mercredi 13 février 2019 - 01:26:51


Fichiers produits par l'(les) auteur(s)



Djegdjiga Amazouz, Martine Adda-Decker, Lori Lamel. Addressing Code-Switching in French/Algerian Arabic Speech. Interspeech 2017, Sep 2019, Stocholm, Sweden. Interspeech 2017, pp.62-66, 〈10.21437/interspeech.2017-1373〉. 〈halshs-01969148〉



Consultations de la notice


Téléchargements de fichiers