E. P. Ofitserov, “Motif based sequence classification”, Chebyshevskii Sb., 19:1 (2018), 187

Chebyshevskii Sbornik

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Chebyshevskii Sb.:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Chebyshevskii Sbornik, 2018, Volume 19, Issue 1, Pages 187–199
DOI: https://doi.org/10.22405/2226-8383-2018-19-1-187-199 (Mi cheb631)

This article is cited in 1 scientific paper (total in 1 paper)

Motif based sequence classification

E. P. Ofitserov

Tula State University

Full-text PDF (646 kB) Citations (1)

References:

PDF

HTML

DOI: https://doi.org/10.22405/2226-8383-2018-19-1-187-199

Abstract: Sequence classification problems often arise in such areas as bioinformatics and natural language processing. In the last few year best results in this field were achieved by the deep learning methods, especially by architectures based on recurrent neural networks (RNN). However, the common problem of such models is a lack of interpretability, i.e., extraction of key features from data that affect the most the model's decision. Meanwhile, using of less complicated neural network leads to decreasing predictive performance thus limiting usage of state-of-art machine learning methods in many subject areas. In this work we propose a novel interpretable deep learning architecture based on extraction of principal sets of short substrings — sequence motifs. The presence of extracted motif in the input sequence is a marker for a certain class. The key component of proposed solution is differential alignment algorithm developed by us, which provides a smooth analog of classical string comparison methods such as Levenshtein edit distance, and Smith–Waterman local alignment. Unlike previous works devoted to the motif based classification, which used CNN for shift-invariant searching, ours model provide a way to shift and gap invariant extraction of motifs.

Keywords: sequence classification, machine learning, neural network, motif extraction.

Bibliographic databases:

Document Type: Article

UDC: 004.032.26, 004.424.62

Language: Russian

Citation: E. P. Ofitserov, “Motif based sequence classification”, Chebyshevskii Sb., 19:1 (2018), 187–199

Citation in format AMSBIB

\Bibitem{Ofi18}

\by E.~P.~Ofitserov

\paper Motif based sequence classification

\jour Chebyshevskii Sb.

\yr 2018

\vol 19

\issue 1

\pages 187--199

\mathnet{http://mi.mathnet.ru/cheb631}

\crossref{https://doi.org/10.22405/2226-8383-2018-19-1-187-199}

\elib{https://elibrary.ru/item.asp?id=36312686}

Linking options:

https://www.mathnet.ru/eng/cheb631

https://www.mathnet.ru/eng/cheb/v19/i1/p187

This publication is cited in the following 1 articles:

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Statistics & downloads:
Abstract page:	314
Full-text PDF :	207
References:	39

Что такое QR-код?

Registration to the website

Logotypes