No more rage against the machine: how the corpus-based identification of machine-translationese can lead to student empowerment - HAL-SHS - Sciences de l'Homme et de la Société Accéder directement au contenu
Article Dans Une Revue The Journal of specialised translation (JoSTrans) Année : 2020

No more rage against the machine: how the corpus-based identification of machine-translationese can lead to student empowerment

Rudy Loock

Résumé

The aim of this article is to show how a linguistic analysis of a corpus of machine-translated texts, both quantitative and qualitative, can empower translation trainees by helping them define their added value over machine translation (MT) systems. In particular the aim is to show that MT, even when providing grammatically correct output, does not comply with linguistic usage, thus failing to provide natural-sounding translations as expected in today’s market for specialised translation. Following two avenues left open for future research in Loock (2018), this article provides the results of a corpus analysis of EN-FR machine-translated texts using 3 MT systems: DeepL (NMT) and the European Commission’s eTranslation in both its SMT and NMT versions. The quantitative results show that the linguistic characteristics of machine-translated texts differ from French original texts, with an almost systematic over-representation of a series of linguistic features, possibly but partially due to source language interference, while the qualitative analysis of a sample reveals finer-grained results (e.g. variability of results depending on (N)MT tool, frequency of adverb deletion). It is then explained how such results, leading to the identification of ‘machine-translationese’, are meant to be used in an educational setting to improve translator education, by (i) making students aware of the gap between machine-translated texts and original texts, and (ii) providing them with information on what to focus on during the post-editing process.

Domaines

Linguistique
Fichier non déposé

Dates et versions

halshs-02913980 , version 1 (11-08-2020)

Identifiants

  • HAL Id : halshs-02913980 , version 1

Citer

Rudy Loock. No more rage against the machine: how the corpus-based identification of machine-translationese can lead to student empowerment. The Journal of specialised translation (JoSTrans), 2020, 34, pp.150-170. ⟨halshs-02913980⟩
423 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More