Collaboratively Producing Interoperable Ontologies and Semantically Annotated Corpora: the symogih.org project

Francesco Beretta 1, 2
Abstract : The Digital History department (Pôle histoire numérique) of the LARHRA laboratory in Lyon has developed since ten years the symogih.org project (Système modulaire de gestion de l’information historique), a method and a platform to collaboratively produce structured data and use them for semantically annotate TEI encoded texts. The aim of the project is not only to connect individual historical research and data production with a collectively managed data repository, but also to interlink the platform’s data to those published by other data providers, e.g. authority files of national libraries, museums and other cultural heritage institutions, and to format them according to widespread standards, like the CIDOC-CRM. In this way the data will be available, interoperable and reusable for new platform-internal and external research projects, and for the public. In the first part of my talk, I will describe the method the symogih.org project has adopted to collaboratively develop and maintain an ontology for historical data which can be indefinitely extended according to the needs of present participants and of new research projects. Further, I’ll report about the ongoing process of refining the symogih.org ontology using the CIDOC-CRM modelling method. This process is aimed at developing a CRM extension for historical data that will be managed by a consortium and be opened to any interested project and to further development according to the specific needs of participant projects. In the second part, I’ll give an account of a method to semantically annotate XML encoded texts using some basic tags and properties of the TEI standard, combining them with the flexibility and richness of an ontology for historical data. The workflow integrates the corpus analysis environment TXM for exploring the text from a linguistic perspective before annotating it semantically with the project ontology. I’ll then outline how this method allows to analyse the terminology of a historical text corpus and collaboratively manage a conceptual thesaurus.
Type de document :
Communication dans un congrès
Third International Workshop on Semantic Web for Scientific Heritage, May 2017, Portoroz, Slovenia. 2017, 〈http://www.cepam.cnrs.fr/zoomathia/sw4sh/〉
Liste complète des métadonnées

https://halshs.archives-ouvertes.fr/halshs-01539489
Contributeur : Francesco Beretta <>
Soumis le : mercredi 14 juin 2017 - 22:10:41
Dernière modification le : mercredi 31 octobre 2018 - 12:24:24

Licence


Distributed under a Creative Commons Paternité - Partage selon les Conditions Initiales 4.0 International License

Identifiants

  • HAL Id : halshs-01539489, version 1

Collections

Citation

Francesco Beretta. Collaboratively Producing Interoperable Ontologies and Semantically Annotated Corpora: the symogih.org project. Third International Workshop on Semantic Web for Scientific Heritage, May 2017, Portoroz, Slovenia. 2017, 〈http://www.cepam.cnrs.fr/zoomathia/sw4sh/〉. 〈halshs-01539489〉

Partager

Métriques

Consultations de la notice

276

Téléchargements de fichiers

30