M. A. Ryndin, D. Y. Turdakov, “Domain adaptation by proactive labeling”, Proceedings of ISP RAS, 31:5 (2019), 145

Loading [MathJax]/jax/output/SVG/config.js

Proceedings of the Institute for System Programming of the RAS

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Proceedings of ISP RAS:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Proceedings of the Institute for System Programming of the RAS, 2019, Volume 31, Issue 5, Pages 145–152
DOI: https://doi.org/10.15514/ISPRAS-2019-31(5)-11 (Mi tisp460)

This article is cited in 1 scientific paper (total in 1 paper)

Domain adaptation by proactive labeling

M. A. Ryndin^a, D. Y. Turdakov^ba

^a Ivannikov Institute for System Programming of RAS
^b Lomonosov Moscow State University

Full-text PDF (965 kB) Citations (1)

References:

PDF

HTML

DOI: https://doi.org/10.15514/ISPRAS-2019-31(5)-11

Abstract: Getting tagged data is an expensive and time-consuming process. There are several approaches to how to reduce the number of examples needed for training. For example, the methods used in active learning are aimed at choosing only the most difficult examples for marking. Using active learning allows to achieve results similar to supervised learning, using much less labeled data. However, such methods are often dispersive and highly dependent on the choice of the initial approximation, and the optimal strategies for choosing examples for marking up either depend on the type of classifier or are computationally complex. Another approach is domain adaptation. Most of the approaches in this area are unsupervised and are based on approximating the distribution of data in domains by solving the problem of optimal transfer or extraction of domain-independent features. Supervised learning approaches are not resistant to changes in the distribution of the target variable. This is one of the reasons why the task of semis-supervised domain adaptation is posed: there are labeled data in the source domain, a lot of unlabeled data in the target domain and the ability to get labels for some of the data from the target domain. In this work, we show how proactive labeling can help transfer knowledge from one source domain to a different but relative target domain. We propose to use a machine learning model trained on source domain as a free fallible oracle. This oracle can determine complexity of a training example to make several decisions. First, this example should be added to training dataset. Second, do we have enough knowldge learnt from source to label this example ourself or we need to call a trusted expert? We present an algorithm that utilize this ideas and one of its features is ability to work with any classifier that has probabilistic interpretation of its outputs. Experimental evaluation on Amazon review dataset establish the effectiveness of proposed method.

Keywords: domain adaptation, proactive learning.

Document Type: Article

Language: Russian

Citation: M. A. Ryndin, D. Y. Turdakov, “Domain adaptation by proactive labeling”, Proceedings of ISP RAS, 31:5 (2019), 145–152

Citation in format AMSBIB

\Bibitem{RynTur19}

\by M.~A.~Ryndin, D.~Y.~Turdakov

\paper Domain adaptation by proactive labeling

\jour Proceedings of ISP RAS

\yr 2019

\vol 31

\issue 5

\pages 145--152

\mathnet{http://mi.mathnet.ru/tisp460}

\crossref{https://doi.org/10.15514/ISPRAS-2019-31(5)-11}

Linking options:

https://www.mathnet.ru/eng/tisp460

https://www.mathnet.ru/eng/tisp/v31/i5/p145

This publication is cited in the following 1 articles:

O. V. Belyaeva, A. I. Perminov, I. S. Kozlov, “Ispolzovanie sinteticheskikh dannykh dlya tonkoi nastroiki modelei segmentatsii dokumentov”, Trudy ISP RAN, 32:4 (2020), 189–202

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Proceedings of the Institute for System Programming of the RAS

Statistics & downloads:
Abstract page:	153
Full-text PDF :	44
References:	27

Registration to the website

Logotypes