Addressing Code-Switching in French/Algerian Arabic Speech

Abstract : This study focuses on code-switching (CS) in French/Algerian Arabic bilingual communities and investigates how speech technologies, such as automatic data partitioning, language identification and automatic speech recognition (ASR) can serve to analyze and classify this type of bilingual speech. A preliminary study carried out using a corpus of Maghrebian broadcast data revealed a relatively high presence of CS Alge-rian Arabic as compared to the neighboring countries Morocco and Tunisia. Therefore this study focuses on code switching produced by bilingual Algerian speakers who can be considered native speakers of both Algerian Arabic and French. A specific corpus of four hours of speech from 8 bilingual French Algerian speakers was collected. This corpus contains read speech and conversational speech in both languages and includes stretches of code-switching. We provide a linguistic description of the code-switching stretches in terms of intra-sentential and inter-sentential switches, the speech duration in each language. We report on some initial studies to locate French, Arabic and the code-switched stretches, using ASR system word posteriors for this pair of languages.
Document type :
Conference papers
Liste complète des métadonnées
Contributor : Djegdjiga Amazouz <>
Submitted on : Thursday, January 3, 2019 - 5:14:39 PM
Last modification on : Saturday, March 16, 2019 - 1:55:43 AM
Document(s) archivé(s) le : Thursday, April 4, 2019 - 2:51:07 PM


Files produced by the author(s)



Djegdjiga Amazouz, Martine Adda-Decker, Lori Lamel. Addressing Code-Switching in French/Algerian Arabic Speech. Interspeech 2017, Sep 2019, Stocholm, Sweden. pp.62-66, ⟨10.21437/interspeech.2017-1373⟩. ⟨halshs-01969148⟩



Record views


Files downloads