Combining Rules and CRF Learning for Opinion Source Identification in Spanish Texts - HAL-SHS - Sciences de l'Homme et de la Société Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Combining Rules and CRF Learning for Opinion Source Identification in Spanish Texts

Résumé

In this work we present a system for the automatic annotation of opinions in Spanish texts. We focus mainly in the definition of a TFS-style model for the predicates of opinion and their arguments, in the creation of a lexicon of opinion predicates and in two additional variants for identifying the source of opinions. The original system extracts opinions and all its elements (predicate, source, topic and message) based on hand-coded rules, the first variant uses a CRF model for learning the source, assuming that the predicate is already tagged, and the second variant is a combined version, with the result of source recognition via the rule-based system being added as an additional attribute for training the CRF model. We found that this hybrid system performs better than each of the systems evaluated separately. This work involved the construction of several resources for Spanish: a lexicon of opinion predicates, a 13,000 word corpus with whole opinion annotations and a 40,000 word corpus with annotations of opinion predicates and sources.
Fichier principal
Vignette du fichier
76370452.pdf (209.14 Ko) Télécharger le fichier
Origine : Accord explicite pour ce dépôt
Loading...

Dates et versions

halshs-00785381 , version 1 (06-02-2013)

Identifiants

  • HAL Id : halshs-00785381 , version 1

Citer

Aiala Rosa, Dina Wonsever, Jean-Luc Minel. Combining Rules and CRF Learning for Opinion Source Identification in Spanish Texts. IBERAMIA 2012, Nov 2012, Bahia Blanca, Argentina. pp.452 - 461. ⟨halshs-00785381⟩
111 Consultations
340 Téléchargements

Partager

Gmail Facebook X LinkedIn More