From built examples to attested examples: a syntax-based query system for non-specialists

Abstract : Using queries to explore corpora is today routine practice not only among researchers in various fields with an empirical approach to discourse, but also among non-specialists who use search engines or con-cordancers for language learning purposes. While keyword-based queries are quite common , non-specialists are less likely to explore syntactic constructions. Syntax-based queries usually require the use of regular expressions with grammatical words combined with mor-phosyntactic tags, meaning that users need to master both the query language of the tool and the tagset of the annotated corpus. However , non-specialists such as language learners may prefer to focus on the output rather than spend time and efforts mastering a query language. To address this shortcoming, we propose a methodology including a syntactic parser and using common similarity measures to compare sequences of automatically produced morphosyntactic tags.
Document type :
Conference papers
Complete list of metadatas

https://halshs.archives-ouvertes.fr/halshs-01399523
Contributor : Ilaine Wang <>
Submitted on : Saturday, November 19, 2016 - 10:42:36 AM
Last modification on : Tuesday, July 23, 2019 - 4:16:13 PM
Long-term archiving on : Tuesday, March 21, 2017 - 4:02:24 AM

Identifiers

  • HAL Id : halshs-01399523, version 1

Collections

Citation

Ilaine Wang, Sylvain Kahane, Isabelle Tellier. From built examples to attested examples: a syntax-based query system for non-specialists. PACLIC30, Jong-Bok Kim, Oct 2016, Seoul, South Korea. ⟨halshs-01399523⟩

Share

Metrics

Record views

335

Files downloads

128