M. G. Konovalov, R. V. Razumchik, “Controlling a bounded two-dimensional Markov chain with a given invariant measure”, Inform. Primen., 16:2 (2022), 109

Informatika i Ee Primeneniya [Informatics and its Applications]

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Impact factor

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Inform. Primen.:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Informatika i Ee Primeneniya [Informatics and its Applications], 2022, Volume 16, Issue 2, Pages 109–117
DOI: https://doi.org/10.14357/19922264220214 (Mi ia793)

Controlling a bounded two-dimensional Markov chain with a given invariant measure

M. G. Konovalov, R. V. Razumchik

Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation

Full-text PDF (386 kB)

References:

PDF

HTML

DOI: https://doi.org/10.14357/19922264220214

Abstract: Consideration is given to the two-dimensional discrete-time Markov chain (random walk) with the bounded continuous state space (rectangle). Upon each transition, depending on its current position and if not on the boundary, the chain moves in one of four possible directions (north, south, east, or west). Having selected a direction, the length of the jump within the admissible interval is determined by the random variable. Assuming that some (reference) distribution on the state space is given, one seeks to solve the inverse control problem, i. e., to find such a control strategy (probabilities of choosing either direction) which brings the stationary distribution of the chain close (in a certain sense) to the reference distribution. The solution based on the policy gradient method is proposed. Illustrative examples are provided.

Keywords: Markov chain control, continuous state space, policy gradient, unmanned air vehicles.

Funding agency	Grant number
Russian Foundation for Basic Research	20-07-00804
The research was partially supported by the Russian Foundation for Basic Research (project No. 20-07-00804).

Received: 17.04.2022

Bibliographic databases:

Document Type: Article

Language: Russian

Citation: M. G. Konovalov, R. V. Razumchik, “Controlling a bounded two-dimensional Markov chain with a given invariant measure”, Inform. Primen., 16:2 (2022), 109–117

Citation in format AMSBIB

\Bibitem{KonRaz22}

\by M.~G.~Konovalov, R.~V.~Razumchik

\paper Controlling a~bounded two-dimensional Markov chain with~a~given invariant measure

\jour Inform. Primen.

\yr 2022

\vol 16

\issue 2

\pages 109--117

\mathnet{http://mi.mathnet.ru/ia793}

\crossref{https://doi.org/10.14357/19922264220214}

\mathscinet{http://mathscinet.ams.org/mathscinet-getitem?mr=4531989}

Linking options:

https://www.mathnet.ru/eng/ia793

https://www.mathnet.ru/eng/ia/v16/i2/p109

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Statistics & downloads:
Abstract page:	284
Full-text PDF :	42
References:	15

Что такое QR-код?

Registration to the website

Logotypes