Literal readings of multiword expressions: as scarce as hen's teeth

Abstract : Multiword expressions can have both idiomatic and literal occurrences. Distinguishing these two cases is considered one of the major challenges in MWE processing. We suggest that literal readings should be considered in both semantic and syntactic terms, which motivates their study in a treebank. We propose heuristics to automatically pre-identify candidate sentences that might contain literal readings of verbal VMWEs, and we apply them to an existing Polish treebank. We also perform a linguistic study of the literal readings extracted by the different heuristics. The results suggest that literal readings constitute a rare phenomenon. We also identify some properties that may distinguish them from their idiomatic counterparts.
Complete list of metadatas

Cited literature [11 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01694995
Contributor : Agata Savary <>
Submitted on : Tuesday, February 6, 2018 - 5:09:07 PM
Last modification on : Monday, December 2, 2019 - 4:01:48 PM
Long-term archiving on : Wednesday, May 2, 2018 - 2:21:09 PM

File

W17-7610.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : hal-01694995, version 1

Citation

Agata Savary, Silvio Cordeiro. Literal readings of multiword expressions: as scarce as hen's teeth. Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT 16), Jan 2018, Prague, Czech Republic, Jan 2018, Prague, Czech Republic. pp.64 - 72. ⟨hal-01694995⟩

Share

Metrics

Record views

551

Files downloads

152