Literal readings of multiword expressions: as scarce as hen's teeth

Abstract : Multiword expressions can have both idiomatic and literal occurrences. Distinguishing these two cases is considered one of the major challenges in MWE processing. We suggest that literal readings should be considered in both semantic and syntactic terms, which motivates their study in a treebank. We propose heuristics to automatically pre-identify candidate sentences that might contain literal readings of verbal VMWEs, and we apply them to an existing Polish treebank. We also perform a linguistic study of the literal readings extracted by the different heuristics. The results suggest that literal readings constitute a rare phenomenon. We also identify some properties that may distinguish them from their idiomatic counterparts.
Complete list of metadatas

Cited literature [11 references]  Display  Hide  Download
Contributor : Agata Savary <>
Submitted on : Tuesday, February 6, 2018 - 5:09:07 PM
Last modification on : Monday, December 2, 2019 - 4:01:48 PM
Long-term archiving on : Wednesday, May 2, 2018 - 2:21:09 PM


Publisher files allowed on an open archive


  • HAL Id : hal-01694995, version 1


Agata Savary, Silvio Cordeiro. Literal readings of multiword expressions: as scarce as hen's teeth. Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT 16), Jan 2018, Prague, Czech Republic, Jan 2018, Prague, Czech Republic. pp.64 - 72. ⟨hal-01694995⟩



Record views


Files downloads