Using the First Axis of a Correspondence Analysis as an Analytical Tool: Application to Establish and Define an Orality Gradient for Genres of Medieval French Texts

Abstract : Our corpus of medieval French texts is divided into 59 discourse units (DUs) which cross text genres and spoken vs non spoken text chunks (as tagged with q and sp TEI tags). A correspondence analysis (CA) performed on selected POS tags indicates orality as the main dimension of variation across DUs. We then design several methodological paths to investigate this gradient as computed by the CA first axis. Bootstrap is used to check the stability of observations; gradient-ordered barplots provide both a synthetic and analytic view of the correlation of any variable with the gradient; a way is also found to characterize the gradient poles (here, more-oral or less-oral poles) not only with the POS used for the CA analysis, but also with words, in order to get a more precise and lexical description. This methodology could be transposed to other data with a potential gradient structure.
Type de document :
Communication dans un congrès
Domenica Fioredistella IEZZI, Livia CELARDO, Michelangelo MISURACA. 14th International Conference on the Statistical Analysis of Textual Data / 14es Journées internationales d'Analyse statistique des Données Textuelles (JADT 2018), Jun 2018, Roma, Italy. UniversItalia, Proceedings of 14th International Conference on the Statistical Analysis of Textual Data, 2, pp.594-601, JADT'18. Proceedings of the 14th International Conference on Statistical Analysis of Textual Data. 〈http://jadt2018.uniroma2.it http://lexicometrica.univ-paris3.fr/jadt/index.htm http://www.jadt.org〉
Liste complète des métadonnées

Littérature citée [9 références]  Voir  Masquer  Télécharger

https://halshs.archives-ouvertes.fr/halshs-01759219
Contributeur : Bénédicte Pincemin <>
Soumis le : vendredi 6 avril 2018 - 19:11:54
Dernière modification le : mercredi 31 octobre 2018 - 12:24:26

Licence


Distributed under a Creative Commons Paternité 4.0 International License

Identifiants

  • HAL Id : halshs-01759219, version 1

Citation

Bénédicte Pincemin, Céline Guillot-Barbance, Alexei Lavrentiev. Using the First Axis of a Correspondence Analysis as an Analytical Tool: Application to Establish and Define an Orality Gradient for Genres of Medieval French Texts. Domenica Fioredistella IEZZI, Livia CELARDO, Michelangelo MISURACA. 14th International Conference on the Statistical Analysis of Textual Data / 14es Journées internationales d'Analyse statistique des Données Textuelles (JADT 2018), Jun 2018, Roma, Italy. UniversItalia, Proceedings of 14th International Conference on the Statistical Analysis of Textual Data, 2, pp.594-601, JADT'18. Proceedings of the 14th International Conference on Statistical Analysis of Textual Data. 〈http://jadt2018.uniroma2.it http://lexicometrica.univ-paris3.fr/jadt/index.htm http://www.jadt.org〉. 〈halshs-01759219〉

Partager

Métriques

Consultations de la notice

188

Téléchargements de fichiers

73