Natural Language Processing for aviation safety reports: from classification to interactive analysis - HAL-SHS - Sciences de l'Homme et de la Société Accéder directement au contenu
Article Dans Une Revue Computers in Industry Année : 2016

Natural Language Processing for aviation safety reports: from classification to interactive analysis

Résumé

In this paper we describe the different NLP techniques designed and used in collaboration between the CLLE-ERSS research laboratory and the CFH / Safety Data company to manage and analyse aviation incident reports. These reports are written every time anything abnormal occurs during a civil air flight. Although most of them relate routine problems, they are a valuable source of information about possible sources of greater danger. These texts are written in plain language, show a wide range of linguistic variation (telegraphic style overcrowded by acronyms or standard prose) and exist in different languages, even for a single company/country (although our main focus is on English and French). In addition to their variety, their sheer quantity (e.g. 600/month for a large airline company) clearly requires the use of advanced NLP and text mining techniques in order to extract useful information from them. Although this context and objectives seem to indicate that standard NLP techniques can be applied in a straightforward manner, innovative techniques are required to handle the specifics of aviation report text and the complex classification systems. We present several tools that aim at a better access to this data (classification and information retrieval), and help aviation safety experts in their analyses (data/text mining and interactive analysis). Some of these tools are currently in test or in use both at the national and international levels, by airline companies as well as by regulation authorities (DGAC, EASA , ICAO).

Domaines

Linguistique
Fichier principal
Vignette du fichier
Preprint.pdf (765.44 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

halshs-01322238 , version 1 (26-05-2016)

Identifiants

Citer

Ludovic Tanguy, Nikola Tulechki, Assaf Urieli, Eric Hermann, Céline Raynal. Natural Language Processing for aviation safety reports: from classification to interactive analysis. Computers in Industry, 2016, 78, pp.80-95. ⟨10.1016/j.compind.2015.09.005⟩. ⟨halshs-01322238⟩
341 Consultations
3582 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More