O. A. Kovaleva, A. V. Samokhvalov, M. A. Liashkov, S. Yu. Pchelintsev, “The quality improvement method for detecting attacks on web applications using pre-trained natural language models”, Izv. Saratov Univ. Math. Mech. Inform., 24:3 (2024), 442

Izvestiya of Saratov University. Mathematics. Mechanics. Informatics

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Impact factor

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Izv. Saratov Univ. Math. Mech. Inform.:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Izvestiya of Saratov University. Mathematics. Mechanics. Informatics, 2024, Volume 24, Issue 3, Pages 442–451
DOI: https://doi.org/10.18500/1816-9791-2024-24-3-442-451 (Mi isu1042)

Scientific Part
Computer Sciences

The quality improvement method for detecting attacks on web applications using pre-trained natural language models

O. A. Kovaleva, A. V. Samokhvalov, M. A. Liashkov, S. Yu. Pchelintsev

Derzhavin Tambov State University, 33 Internationalnaya St., Tambov 392036, Russia

Full-text PDF (5973 kB)

References:

PDF

HTML

DOI: https://doi.org/10.18500/1816-9791-2024-24-3-442-451

Abstract: This paper explores the use of deep learning techniques to improve the performance of web application firewalls (WAFs), describes a specific method for improving the performance of web application firewalls, and presents the results of its testing on publicly available CSIC 2010 data. Most web application firewalls work on the basis of rules that have been compiled by experts. When running, firewalls inspect HTTP requests exchanged between client and server to detect attacks and block potential threats. Manual drafting of rules requires experts' time, and distributed ready-made rule sets do not take into account the specifics of particular user applications, therefore they allow many false positives and miss many network attacks. In recent years, the use of pretrained language models has led to significant improvements in a diverse set of natural language processing tasks as they are able to perform knowledge transfer. The article describes the adaptation of these approaches to the field of information security, i.e. the use of a pretrained language model as a feature extractor to match an HTTP request with a feature vector. These vectors are then used to train the classifier. We offer a solution that consists of two stages. In the first step, we create a deep pre-trained language model based on normal HTTP requests to the web application. In the second step, we use this model as a feature extractor and train a one-class classifier. Both steps are performed for each application. The experimental results show that the proposed approach significantly outperforms the classical Mod-Security approaches based on rules configured using OWASP CRS and does not require the involvement of a security expert to define trigger rules.

Key words: firewalls, HTTP request analysis, pre-trained language models.

Received: 28.01.2023
Accepted: 02.02.2023

Bibliographic databases:

Document Type: Article

UDC: 004.032.2

Language: Russian

Citation: O. A. Kovaleva, A. V. Samokhvalov, M. A. Liashkov, S. Yu. Pchelintsev, “The quality improvement method for detecting attacks on web applications using pre-trained natural language models”, Izv. Saratov Univ. Math. Mech. Inform., 24:3 (2024), 442–451

Citation in format AMSBIB

\Bibitem{KovSamLia24}

\by O.~A.~Kovaleva, A.~V.~Samokhvalov, M.~A.~Liashkov, S.~Yu.~Pchelintsev

\paper The quality improvement method for detecting attacks on web applications using~pre-trained natural language models

\jour Izv. Saratov Univ. Math. Mech. Inform.

\yr 2024

\vol 24

\issue 3

\pages 442--451

\mathnet{http://mi.mathnet.ru/isu1042}

\crossref{https://doi.org/10.18500/1816-9791-2024-24-3-442-451}

\edn{https://elibrary.ru/OJWHMC}

Linking options:

https://www.mathnet.ru/eng/isu1042

https://www.mathnet.ru/eng/isu/v24/i3/p442

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Izvestiya of Saratov University. Mathematics. Mechanics. Informatics

Что такое QR-код?

Registration to the website

Logotypes