TXM : Une plateforme logicielle open-source pour la textométrie - conception et développement

Abstract : The research project Federation and Research Developments in Textometry around the creation of an Open- Source Platform distributes its XML-TEI encoded corpus textometric analysis platform online. The design of this platform is based on a synthesis of features of existing textometric software. It relies on identifying the open-source software technology available and effectively processing digital resources encoded in XML and Unicode, and on a state of the art of open-source full-text search engines on structured and annotated corpora. The architecture is based on a Java toolkit component articulating a search engine (IMS CWB), a statistical computing environment (R) and a module for importing XML-TEI encoded corpora. The platform is distributed as an open-source toolkit for developers and in the form of two applications for end users of textometry: a local application to install on a workstation (Windows or Linux) and an online web application. Still early in its development, the platform implements at present only a few essential features, but its distribution in open-source already allows an open community development. This should facilitate its development and integration of new models and methods.
Complete list of metadatas

Cited literature [11 references]  Display  Hide  Download

https://halshs.archives-ouvertes.fr/halshs-00549779
Contributor : Serge Heiden <>
Submitted on : Wednesday, December 22, 2010 - 3:43:24 PM
Last modification on : Tuesday, May 28, 2019 - 5:28:53 PM
Long-term archiving on : Friday, December 2, 2016 - 12:42:43 PM

File

Heiden_al_jadt2010.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : halshs-00549779, version 1

Collections

Citation

Serge Heiden, Jean-Philippe Magué, Bénédicte Pincemin. TXM : Une plateforme logicielle open-source pour la textométrie - conception et développement. 10th International Conference on the Statistical Analysis of Textual Data - JADT 2010, Jun 2010, Rome, Italie. pp.1021-1032. ⟨halshs-00549779⟩

Share

Metrics

Record views

2872

Files downloads

1509