HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Journal articles

Анализ корпусов текстов террористической и антиправовой направленности

Abstract : The purpose of the study in the development of a technique of creation and automatic analysis of special corpora for their subsequent application as the training datasets and detecting the differentiating characters in problems of text classification. The method is to use the analysis tools provided by the TXM platform expanded with new procedures of calculation of additional characteristics of texts, such as combinations of letters, pseudo-stems, noun phrases and verb phrases. As a results, it is shown that the developed extenders of the case TXM platform allow to solve effectively problems of the analysis of texts of special subject, the created corpus of extremist subject can be used as the training selection for problems of classification of texts, the conclusion about use of combinations of letters as the universal differentiating characters along with classical linguistic characteristics of texts is drawn.
Complete list of metadata

Contributor : Alexei Lavrentiev Connect in order to contact the contributor
Submitted on : Tuesday, August 13, 2019 - 1:52:29 PM
Last modification on : Friday, January 7, 2022 - 9:52:02 AM

Links full text



Alexei Lavrentiev, Ivan Smirnov, Margarita Suvorova, Fedor Solovyev, Alina Fokina, et al.. Анализ корпусов текстов террористической и антиправовой направленности. Voprosy kiberbezopasnosti, NPO Eshelon, 2019, pp.54-60. ⟨10.21681/2311-3456-2019-4-54-60⟩. ⟨halshs-02266136⟩



Record views