Avtomatika i Telemekhanika
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor
Guidelines for authors
Submit a manuscript

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Avtomat. i Telemekh.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Avtomatika i Telemekhanika, 2021, Issue 11, Pages 16–29
DOI: https://doi.org/10.31857/S0005231021110027
(Mi at15826)
 

This article is cited in 2 scientific papers (total in 2 papers)

Topical issue (end)

Bayesian distillation of deep learning models

A. V. Grabovoya, V. V. Strijovb

a Moscow Institute of Physics and Technology, Dolgoprudnyi, Moscow oblast, 141701 Russia
b Dorodnicyn Computing Centre, Russian Academy of Sciences, Moscow, 119333 Russia
References:
Abstract: We study the problem of reducing the complexity of approximating models and consider methods based on distillation of deep learning models. The concepts of trainer and student are introduced. It is assumed that the student model has fewer parameters than the trainer model. A Bayesian approach to the student model selection is suggested. A method is proposed for assigning an a priori distribution of student parameters based on the a posteriori distribution of trainer model parameters. Since the trainer and student parameter spaces do not coincide, we propose a mechanism for the reduction of the trainer model parameter space to the student model parameter space by changing the trainer model structure. A theoretical analysis of the proposed reduction mechanism is carried out. A computational experiment was carried out on synthesized and real data. The FashionMNIST sample was used as real data.
Keywords: model selection, Bayesian inference, model distillation, local transformation, probability space transformation.
Funding agency Grant number
Ministry of Science and Higher Education of the Russian Federation 13/1251/2018
Russian Foundation for Basic Research 19-07-01155
19-07-00875
This paper contains results of the project “Mathematical Methods for Big Data Mining” carried out as part of the implementation of the Competence Center Program of the National Technological Initiative “Big Data Storage and Analysis Center” supported by the Ministry of Science and Higher Education of the Russian Federation under the Agreement of Lomonosov Moscow State University with the Fund for Support of Projects of the National Technology Initiative of December 11, 2018, no. 13/1251/2018. This work was supported by the Russian Foundation for Basic Research, projects nos. 19-07-01155 and 19-07-00875.
Presented by the member of Editorial Board: A. A. Lazarev

Received: 20.01.2021
Revised: 25.06.2021
Accepted: 30.06.2021
English version:
Automation and Remote Control, 2021, Volume 82, Issue 11, Pages 1846–1856
DOI: https://doi.org/10.1134/S0005117921110023
Bibliographic databases:
Document Type: Article
Language: Russian
Citation: A. V. Grabovoy, V. V. Strijov, “Bayesian distillation of deep learning models”, Avtomat. i Telemekh., 2021, no. 11, 16–29; Autom. Remote Control, 82:11 (2021), 1846–1856
Citation in format AMSBIB
\Bibitem{GraStr21}
\by A.~V.~Grabovoy, V.~V.~Strijov
\paper Bayesian distillation of deep learning models
\jour Avtomat. i Telemekh.
\yr 2021
\issue 11
\pages 16--29
\mathnet{http://mi.mathnet.ru/at15826}
\crossref{https://doi.org/10.31857/S0005231021110027}
\transl
\jour Autom. Remote Control
\yr 2021
\vol 82
\issue 11
\pages 1846--1856
\crossref{https://doi.org/10.1134/S0005117921110023}
\isi{https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=Publons&SrcAuth=Publons_CEL&DestLinkType=FullRecord&DestApp=WOS_CPL&KeyUT=000733324700002}
\scopus{https://www.scopus.com/record/display.url?origin=inward&eid=2-s2.0-85121557620}
Linking options:
  • https://www.mathnet.ru/eng/at15826
  • https://www.mathnet.ru/eng/at/y2021/i11/p16
  • This publication is cited in the following 2 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Avtomatika i Telemekhanika
    Statistics & downloads:
    Abstract page:120
    Full-text PDF :1
    References:44
    First page:18
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024