Matematicheskaya Biologiya i Bioinformatika
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Mat. Biolog. Bioinform.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Matematicheskaya Biologiya i Bioinformatika, 2016, Volume 11, Issue 1, Pages 14–23
DOI: https://doi.org/10.17537/2016.11.14
(Mi mbb248)
 

Bioinformatics

Number of overlaps in patterns

E. I. Furletovaa, M. A. Roytbergabc

a Institute of Mathematical Problems of Biology, Russian Academy of Science, Pushchino, Moscow Region, Russia
b Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, Russia
c Higher School of Economics, Moscow, Russia
References:
Abstract: The aim of the paper is to estimate the number of overlaps in the given pattern. The pattern is a set of words of same length $m$ in an alphabet $A$. We present theoretical and experimental bounds for overlaps number in two types of patterns. Firstly, we considered random patterns which relate to uniform probability model, i.e. all letters in the alphabet and, correspondently, all words of same length are equiprobable. We proved that the average number of overlaps $P$ for random patterns consisting of $n$ words of length $m$ linearly depends on pattern size $n$ and is independent of length of pattern words. In performed computer experiments the ratio $P/n$ ranged from $0.33$ till $1.06$; the theoretical evaluations of the ratio for the patterns do not exceed $1.67$. The secondly, we studied the patterns described by position weight matrices (PWM) from the data base HOCOMOCO and various cut-offs. For such patterns the ratio $P/n$ in experiments ranged from $0.004$ till $1$, for most of the patterns it is smaller then $0.1$.
Key words: overlap, pattern, pattern occurrence in a sequence.
Funding agency Grant number
Russian Foundation for Basic Research 14-04-32220_мол_а
14-01-93106_НЦНИЛ_а
16-04-01640_а
Received 19.11.2015, Published 27.01.2016
Document Type: Article
UDC: 510.52:519.21
Language: Russian
Citation: E. I. Furletova, M. A. Roytberg, “Number of overlaps in patterns”, Mat. Biolog. Bioinform., 11:1 (2016), 14–23
Citation in format AMSBIB
\Bibitem{FurRoi16}
\by E.~I.~Furletova, M.~A.~Roytberg
\paper Number of overlaps in patterns
\jour Mat. Biolog. Bioinform.
\yr 2016
\vol 11
\issue 1
\pages 14--23
\mathnet{http://mi.mathnet.ru/mbb248}
\crossref{https://doi.org/10.17537/2016.11.14}
Linking options:
  • https://www.mathnet.ru/eng/mbb248
  • https://www.mathnet.ru/eng/mbb/v11/i1/p14
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Statistics & downloads:
    Abstract page:148
    Full-text PDF :49
    References:27
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024