|
This article is cited in 2 scientific papers (total in 2 papers)
The solution of the problem of bluff detection in the game «I-doubt-it» based on reinforcement learning
S. A. Knyazyatov, G. G. Malinetskiy
Abstract:
In this paper we consider the construction of an algorithm based on reinforcement learning for the problem of recognizing and using a bluff on the example of a card game «I-doubt-it». The constructed algorithm has the 'intellectual ability' to restructure its behavior strategy and to evaluate possible moves based on previous experience.This class of algorithms used to make decisions in rapidly changing environments. The method and results of comparing algorithms among themselves, the results of games of the best algorithms with a real opponent are obtained. The effect of 'overfitting' is detected, increasing the number of training batches, in some cases, does not improve, but worsens the quality of the algorithm.
Keywords:
reinforcement learning, mathematical modeling, $Q$-learning, SARSA($\lambda$)
method, bluff detection algorithm, bluff imitation, neural networks, high-speed
decision making.
Citation:
S. A. Knyazyatov, G. G. Malinetskiy, “The solution of the problem of bluff detection in the game «I-doubt-it» based on reinforcement learning”, Keldysh Institute preprints, 2018, 170, 21 pp.
Linking options:
https://www.mathnet.ru/eng/ipmp2529 https://www.mathnet.ru/eng/ipmp/y2018/p170
|
|