Edition 1.1 of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions
Carlos Ramisch
(1)
,
Silvio Cordeiro
(1)
,
Agata Savary
(2)
,
Veronika Vincze
(3)
,
Verginica Barbu Mititelu
(4)
,
Archna Bhatia
(5)
,
Maja Buljan
(6)
,
Marie Candito
,
Polona Gantar
(7)
,
Voula Giouli
(8)
,
Tunga Güngör
(9)
,
Abdelati Hawwari
(10)
,
Uxoa Iñurrieta
(11)
,
Jolanta Kovalevskaitė
(12)
,
Simon Krek
(13)
,
Timm Lichte
(14)
,
Chaya Liebeskind
(15)
,
Johanna Monti
(16)
,
Carla Parra Escartín
(17)
,
Behrang Qasemizadeh
(14)
,
Renata Ramisch
(18)
,
Nathan Schneider
(19)
,
Ivelina Stoyanova
(20)
,
Ashwini Vaidya
(21)
,
Abigail Walsh
(17)
1
TALEP -
Traitement Automatique du Langage Ecrit et Parlé
2 BDTLN - Bases de données et traitement des langues naturelles
3 University of Szeged [Szeged]
4 Romanian Academy
5 IHMC - Florida Institute for Human and Machine Cognition [Pensacola]
6 University of Stuttgart
7 University of Ljubljana
8 ATHENA - Research and Innovation Center in Information, Communication and Knowledge Technologies
9 Boǧaziçi üniversitesi = Boğaziçi University [Istanbul]
10 GW - The George Washington University
11 UPV / EHU - University of the Basque Country = Euskal Herriko Unibertsitatea
12 VDU - Vytautas Magnus University - Vytauto Didziojo Universitetas
13 IJS - Jozef Stefan Institute [Ljubljana]
14 Heinrich Heine Universität Düsseldorf = Heinrich Heine University [Düsseldorf]
15 JCT - Jerusalem College of Technology
16 UniOr - Università di Napoli L'Orientale = University of Naples
17 DCU - Dublin City University [Dublin]
18 Interinstitutional Center for Computational Linguistics
19 GU - Georgetown University [Washington]
20 BAS - Bulgarian Academy of Sciences
21 IIT Delhi - Indian Institute of Technology Delhi
2 BDTLN - Bases de données et traitement des langues naturelles
3 University of Szeged [Szeged]
4 Romanian Academy
5 IHMC - Florida Institute for Human and Machine Cognition [Pensacola]
6 University of Stuttgart
7 University of Ljubljana
8 ATHENA - Research and Innovation Center in Information, Communication and Knowledge Technologies
9 Boǧaziçi üniversitesi = Boğaziçi University [Istanbul]
10 GW - The George Washington University
11 UPV / EHU - University of the Basque Country = Euskal Herriko Unibertsitatea
12 VDU - Vytautas Magnus University - Vytauto Didziojo Universitetas
13 IJS - Jozef Stefan Institute [Ljubljana]
14 Heinrich Heine Universität Düsseldorf = Heinrich Heine University [Düsseldorf]
15 JCT - Jerusalem College of Technology
16 UniOr - Università di Napoli L'Orientale = University of Naples
17 DCU - Dublin City University [Dublin]
18 Interinstitutional Center for Computational Linguistics
19 GU - Georgetown University [Washington]
20 BAS - Bulgarian Academy of Sciences
21 IIT Delhi - Indian Institute of Technology Delhi
Carlos Ramisch
- Fonction : Auteur
- PersonId : 5103
- IdHAL : carlos-ramisch
- ORCID : 0000-0001-7466-9039
- IdRef : 170720802
Agata Savary
- Fonction : Auteur
- PersonId : 4644
- IdHAL : agata-savary
- IdRef : 113077661
Marie Candito
- Fonction : Auteur
- PersonId : 13596
- IdHAL : marie-candito
- IdRef : 153698616
Résumé
This paper describes the PARSEME Shared Task 1.1 on automatic identification of verbal multi-word expressions. We present the annotation methodology, focusing on changes from last year's shared task. Novel aspects include enhanced annotation guidelines, additional annotated data for most languages, corpora for some new languages, and new evaluation settings. Corpora were created for 20 languages, which are also briefly discussed. We report organizational principles behind the shared task and the evaluation metrics employed for ranking. The 17 participating systems, their methods and obtained results are also presented and analysed.
Format du dépôt | Fichier |
---|---|
Type de dépôt | Communication dans un congrès |
Résumé |
en
This paper describes the PARSEME Shared Task 1.1 on automatic identification of verbal multi-word expressions. We present the annotation methodology, focusing on changes from last year's shared task. Novel aspects include enhanced annotation guidelines, additional annotated data for most languages, corpora for some new languages, and new evaluation settings. Corpora were created for 20 languages, which are also briefly discussed. We report organizational principles behind the shared task and the evaluation metrics employed for ranking. The 17 participating systems, their methods and obtained results are also presented and analysed.
|
Titre |
en
Edition 1.1 of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions
|
Auteur(s) |
Carlos Ramisch
1
, Silvio Cordeiro
1
, Agata Savary
2
, Veronika Vincze
3
, Verginica Barbu Mititelu
4
, Archna Bhatia
5
, Maja Buljan
6
, Marie Candito
, Polona Gantar
7
, Voula Giouli
8
, Tunga Güngör
9
, Abdelati Hawwari
10
, Uxoa Iñurrieta
11
, Jolanta Kovalevskaitė
12
, Simon Krek
13
, Timm Lichte
14
, Chaya Liebeskind
15
, Johanna Monti
16
, Carla Parra Escartín
17
, Behrang Qasemizadeh
14
, Renata Ramisch
18
, Nathan Schneider
19
, Ivelina Stoyanova
20
, Ashwini Vaidya
21
, Abigail Walsh
17
1
TALEP -
Traitement Automatique du Langage Ecrit et Parlé
( 530703 )
- France
2
BDTLN -
Bases de données et traitement des langues naturelles
( 394523 )
- France
3
University of Szeged [Szeged]
( 63200 )
- Dugonics square 13 H-6720 Szeged
- Hongrie
4
Romanian Academy
( 303967 )
- 125, Calea Victoriei, sector 1, RO - 010071, Bucharest
- Roumanie
5
IHMC -
Florida Institute for Human and Machine Cognition [Pensacola]
( 267499 )
- 40 S Alcaniz St, Pensacola, FL 32502
- États-Unis
6
University of Stuttgart
( 63008 )
- Allemagne
7
University of Ljubljana
( 302844 )
- Kongresni trg 12, 1000 Ljubljana
- Slovénie
8
ATHENA - Research and Innovation Center in Information, Communication and Knowledge Technologies
( 439388 )
- Artemidos 6 & Epidavrou, 151 25 Maroussi
- Grèce
9
Boǧaziçi üniversitesi = Boğaziçi University [Istanbul]
( 183131 )
- 34342 Bebek, Istanbul
- Turquie
10
GW -
The George Washington University
( 304728 )
- 2121 Eye Street, NW , Washington, DC 20052
- États-Unis
11
UPV / EHU -
University of the Basque Country = Euskal Herriko Unibertsitatea
( 37123 )
- Barrio Sarriena s/n,
48940 Leioa,
Bizkaia
- Espagne
12
VDU -
Vytautas Magnus University - Vytauto Didziojo Universitetas
( 440320 )
- K. Donelaičio str. 58, 44248, Kaunas
- Lituanie
13
IJS -
Jozef Stefan Institute [Ljubljana]
( 451147 )
- Jamova cesta 39, 1000 Ljubljana Slovenia
- Slovénie
14
Heinrich Heine Universität Düsseldorf = Heinrich Heine University [Düsseldorf]
( 335120 )
- Universitätsstr. 1
40225 Düsseldorf
- Allemagne
15
JCT -
Jerusalem College of Technology
( 460710 )
- Havaad Haleumi Street 21 Jerusalem 91160
- Israël
16
UniOr -
Università di Napoli L'Orientale = University of Naples
( 361219 )
- Palazzo Du Mesnil, Via Chiatamone 61/62 - 80121 Napoli
- Italie
17
DCU -
Dublin City University [Dublin]
( 300829 )
- Glanevin, Dublin 9
- Irlande
18
Interinstitutional Center for Computational Linguistics
( 542318 )
- Brésil
19
GU -
Georgetown University [Washington]
( 472411 )
- 37th and O Streets, N.W., Washington D.C. 20057
- États-Unis
20
BAS -
Bulgarian Academy of Sciences
( 302199 )
- 15th November #1 str, Sofia, 1040
- Bulgarie
21
IIT Delhi -
Indian Institute of Technology Delhi
( 51173 )
- Hauz Khas, New Delhi - 110 016. India
- Inde
|
Ville |
Santa Fe
|
Pays |
États-Unis
|
Vulgarisation |
Non
|
Langue du document |
Anglais
|
Source |
Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018)
|
Comité de lecture |
Oui
|
Audience |
Internationale
|
Invité |
Non
|
Actes |
Non
|
Titre du congrès |
Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018)
|
Page/Identifiant |
222 - 240
|
Date début congrès |
2018-08-25
|
Projet(s) ANR |
|
Domaine(s) |
|
Éditeur commercial |
|
Origine :
Fichiers éditeurs autorisés sur une archive ouverte
Loading...