A. V. Lapkina, A. A. Petukhov, “HTTP-request classification in automatic web application crawling”, Proceedings of ISP RAS, 33:3 (2021), 77

Loading [MathJax]/jax/output/SVG/config.js

Proceedings of the Institute for System Programming of the RAS

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Proceedings of ISP RAS:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Proceedings of the Institute for System Programming of the RAS, 2021, Volume 33, Issue 3, Pages 77–86
DOI: https://doi.org/10.15514/ISPRAS-2021-33(3)-6 (Mi tisp600)

HTTP-request classification in automatic web application crawling

A. V. Lapkina, A. A. Petukhov

Lomonosov Moscow State University

Full-text PDF (417 kB)

References:

PDF

HTML

DOI: https://doi.org/10.15514/ISPRAS-2021-33(3)-6

Abstract: The problem of automatic requests classification, as well as the problem of determining the routing rules for the requests on the server side, is directly connected with analysis of the user interface of dynamic web pages. This problem can be solved at the browser level, since it contains complete information about possible requests arising from interaction interaction between the user and the web application. In this paper, in order to extract the classification features, using data from the request execution context in the web client is suggested. A request context or a request trace is a collection of additional identification data that can be obtained by observing the web page JavaScript code execution or the user interface elements changes as a result of the interface elements activation. Such data, for example, include the position and the style of the element that caused the client request, the JavaScript function call stack, and the changes in the page's DOM tree after the request was initialized. In this study the implementation of the Chrome Developer Tools Protocol is used to solve the problem at the browser level and to automate the request trace selection.

Keywords: request classification, application crawling, dynamic web application, Chrome DevTools.

Document Type: Article

Language: English

Citation: A. V. Lapkina, A. A. Petukhov, “HTTP-request classification in automatic web application crawling”, Proceedings of ISP RAS, 33:3 (2021), 77–86

Citation in format AMSBIB

\Bibitem{LapPet21}

\by A.~V.~Lapkina, A.~A.~Petukhov

\paper HTTP-request classification in automatic web application crawling

\jour Proceedings of ISP RAS

\yr 2021

\vol 33

\issue 3

\pages 77--86

\mathnet{http://mi.mathnet.ru/tisp600}

\crossref{https://doi.org/10.15514/ISPRAS-2021-33(3)-6}

Linking options:

https://www.mathnet.ru/eng/tisp600

https://www.mathnet.ru/eng/tisp/v33/i3/p77

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Proceedings of the Institute for System Programming of the RAS

Statistics & downloads:
Abstract page:	480
Full-text PDF :	121
References:	27

Registration to the website

Logotypes