Skip to Main content Skip to Navigation
Conference papers

Textometry on Audiovisual Corpora: Experiments with TXM software

Abstract : Textometry is applied to audiovisual corpora, such as transcripts from semi-directed interviews, or the "Actualités françaises" newsreels archive. A workflow using an assisted or automatic transcription software is efficient to get a rich encoding. New features are added to the TXM software: a specialized import module based on Transcriber XML format, a utility to convert text transcripts to XML, and the MediaPlayer extension to watch the video segment corresponding to a word context selection. Methodological thoughts arise from this experience. It is highly relevant that textometry takes into account internal text structures (such as speech turns) and other meta-information (such as timecodes). Meta-information has to be displayed and available for processing without being mixed with contents. Another challenge is to integrate multiple interrelated representations. A back-to-media feature is as fundamental as the back-to-text one to provide context to interpretation work.
Complete list of metadata

Cited literature [18 references]  Display  Hide  Download
Contributor : Bénédicte Pincemin Connect in order to contact the contributor
Submitted on : Thursday, June 4, 2020 - 4:43:44 PM
Last modification on : Sunday, June 26, 2022 - 1:10:40 AM
Long-term archiving on: : Thursday, December 3, 2020 - 1:57:06 PM


Files produced by the author(s)


Distributed under a Creative Commons Attribution - ShareAlike 4.0 International License


  • HAL Id : halshs-02779055, version 1


Bénédicte Pincemin, Serge Heiden, Matthieu Decorde. Textometry on Audiovisual Corpora: Experiments with TXM software. 15th International Conference on Statistical Analysis of Textual Data JADT 2020, Laboratoire d’Etudes et Recherches Appliquées en Sciences Sociales (Lerass), EA827, Université de Toulouse 3 - Paul Sabatier, Jun 2020, Toulouse, France. ⟨halshs-02779055⟩



Record views


Files downloads