A. V. Grabovoy, V. V. Strijov, “Bayesian distillation of deep learning models”, Avtomat. i Telemekh., 2021, no. 11, 16–29; Autom. Remote Control, 82:11 (2021), 1846

Avtomatika i Telemekhanika

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Impact factor
	Guidelines for authors
	Submit a manuscript

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Avtomat. i Telemekh.:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Avtomatika i Telemekhanika, 2021, Issue 11, Pages 16–29
DOI: https://doi.org/10.31857/S0005231021110027 (Mi at15826)

This article is cited in 2 scientific papers (total in 2 papers)

Topical issue (end)

Bayesian distillation of deep learning models

A. V. Grabovoy^a, V. V. Strijov^b

^a Moscow Institute of Physics and Technology, Dolgoprudnyi, Moscow oblast, 141701 Russia
^b Dorodnicyn Computing Centre, Russian Academy of Sciences, Moscow, 119333 Russia

Full-text PDF (519 kB) First page Citations (2)

References:

PDF

HTML

DOI: https://doi.org/10.31857/S0005231021110027

Abstract: We study the problem of reducing the complexity of approximating models and consider methods based on distillation of deep learning models. The concepts of trainer and student are introduced. It is assumed that the student model has fewer parameters than the trainer model. A Bayesian approach to the student model selection is suggested. A method is proposed for assigning an a priori distribution of student parameters based on the a posteriori distribution of trainer model parameters. Since the trainer and student parameter spaces do not coincide, we propose a mechanism for the reduction of the trainer model parameter space to the student model parameter space by changing the trainer model structure. A theoretical analysis of the proposed reduction mechanism is carried out. A computational experiment was carried out on synthesized and real data. The FashionMNIST sample was used as real data.

Keywords: model selection, Bayesian inference, model distillation, local transformation, probability space transformation.

Funding agency	Grant number
Ministry of Science and Higher Education of the Russian Federation	13/1251/2018
Russian Foundation for Basic Research	19-07-01155 19-07-00875
This paper contains results of the project “Mathematical Methods for Big Data Mining” carried out as part of the implementation of the Competence Center Program of the National Technological Initiative “Big Data Storage and Analysis Center” supported by the Ministry of Science and Higher Education of the Russian Federation under the Agreement of Lomonosov Moscow State University with the Fund for Support of Projects of the National Technology Initiative of December 11, 2018, no. 13/1251/2018. This work was supported by the Russian Foundation for Basic Research, projects nos. 19-07-01155 and 19-07-00875.

Presented by the member of Editorial Board: A. A. Lazarev

Received: 20.01.2021
Revised: 25.06.2021
Accepted: 30.06.2021

English version:
Automation and Remote Control, 2021, Volume 82, Issue 11, Pages 1846–1856
DOI: https://doi.org/10.1134/S0005117921110023

Bibliographic databases:

Document Type: Article

Language: Russian

Citation: A. V. Grabovoy, V. V. Strijov, “Bayesian distillation of deep learning models”, Avtomat. i Telemekh., 2021, no. 11, 16–29; Autom. Remote Control, 82:11 (2021), 1846–1856

Citation in format AMSBIB

\Bibitem{GraStr21}

\by A.~V.~Grabovoy, V.~V.~Strijov

\paper Bayesian distillation of deep learning models

\jour Avtomat. i Telemekh.

\yr 2021

\issue 11

\pages 16--29

\mathnet{http://mi.mathnet.ru/at15826}

\crossref{https://doi.org/10.31857/S0005231021110027}

\transl

\jour Autom. Remote Control

\yr 2021

\vol 82

\issue 11

\pages 1846--1856

\crossref{https://doi.org/10.1134/S0005117921110023}

\isi{https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=Publons&SrcAuth=Publons_CEL&DestLinkType=FullRecord&DestApp=WOS_CPL&KeyUT=000733324700002}

\scopus{https://www.scopus.com/record/display.url?origin=inward&eid=2-s2.0-85121557620}

Linking options:

https://www.mathnet.ru/eng/at15826

https://www.mathnet.ru/eng/at/y2021/i11/p16

This publication is cited in the following 2 articles:

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Statistics & downloads:
Abstract page:	131
Full-text PDF :	1
References:	53
First page:	25

Что такое QR-код?

Registration to the website

Logotypes