Skip to Main content Skip to Navigation
Conference papers

TXM : Une plateforme logicielle open-source pour la textométrie - conception et développement

Abstract : The research project Federation and Research Developments in Textometry around the creation of an Open- Source Platform distributes its XML-TEI encoded corpus textometric analysis platform online. The design of this platform is based on a synthesis of features of existing textometric software. It relies on identifying the open-source software technology available and effectively processing digital resources encoded in XML and Unicode, and on a state of the art of open-source full-text search engines on structured and annotated corpora. The architecture is based on a Java toolkit component articulating a search engine (IMS CWB), a statistical computing environment (R) and a module for importing XML-TEI encoded corpora. The platform is distributed as an open-source toolkit for developers and in the form of two applications for end users of textometry: a local application to install on a workstation (Windows or Linux) and an online web application. Still early in its development, the platform implements at present only a few essential features, but its distribution in open-source already allows an open community development. This should facilitate its development and integration of new models and methods.
Complete list of metadata

Cited literature [11 references]  Display  Hide  Download
Contributor : Serge Heiden Connect in order to contact the contributor
Submitted on : Wednesday, December 22, 2010 - 3:43:24 PM
Last modification on : Tuesday, March 30, 2021 - 3:47:40 AM
Long-term archiving on: : Friday, December 2, 2016 - 12:42:43 PM


Files produced by the author(s)


  • HAL Id : halshs-00549779, version 1



Serge Heiden, Jean-Philippe Magué, Bénédicte Pincemin. TXM : Une plateforme logicielle open-source pour la textométrie - conception et développement. 10th International Conference on the Statistical Analysis of Textual Data - JADT 2010, Jun 2010, Rome, Italie. pp.1021-1032. ⟨halshs-00549779⟩



Record views


Files downloads