A Statistical Approach to Persian Light Verb Constructions
Résumé
This article presents the linguistic bases of Persian light verb constructions and shows the corpus based construction of lists of collocates for some common Persian verbs. The proposed methods of corpus construction are language independent and the good results on a relatively small corpus of 20 million words confirms the power of association measures based on the hypergeometric distribution. The resulting lists show a graduation of lexicalization and the semantic homogeneity of some light verb subcategorization schemes which could be the reason for their wide usage.
Domaines
Linguistique
Origine : Fichiers produits par l'(les) auteur(s)