HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

TXM : Une plateforme logicielle open-source pour la textométrie - conception et développement

Abstract : The research project Federation and Research Developments in Textometry around the creation of an Open- Source Platform distributes its XML-TEI encoded corpus textometric analysis platform online. The design of this platform is based on a synthesis of features of existing textometric software. It relies on identifying the open-source software technology available and effectively processing digital resources encoded in XML and Unicode, and on a state of the art of open-source full-text search engines on structured and annotated corpora. The architecture is based on a Java toolkit component articulating a search engine (IMS CWB), a statistical computing environment (R) and a module for importing XML-TEI encoded corpora. The platform is distributed as an open-source toolkit for developers and in the form of two applications for end users of textometry: a local application to install on a workstation (Windows or Linux) and an online web application. Still early in its development, the platform implements at present only a few essential features, but its distribution in open-source already allows an open community development. This should facilitate its development and integration of new models and methods.
Complete list of metadata

Cited literature [11 references]  Display  Hide  Download

Contributor : Serge Heiden Connect in order to contact the contributor
Submitted on : Wednesday, December 22, 2010 - 3:43:24 PM
Last modification on : Monday, January 24, 2022 - 6:52:05 PM
Long-term archiving on: : Friday, December 2, 2016 - 12:42:43 PM


Files produced by the author(s)


  • HAL Id : halshs-00549779, version 1



Serge Heiden, Jean-Philippe Magué, Bénédicte Pincemin. TXM : Une plateforme logicielle open-source pour la textométrie - conception et développement. 10th International Conference on the Statistical Analysis of Textual Data - JADT 2010, Jun 2010, Rome, Italie. pp.1021-1032. ⟨halshs-00549779⟩



Record views


Files downloads