Vestnik Sankt-Peterburgskogo Universiteta. Seriya 10. Prikladnaya Matematika. Informatika. Protsessy Upravleniya
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Vestnik S.-Petersburg Univ. Ser. 10. Prikl. Mat. Inform. Prots. Upr.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Vestnik Sankt-Peterburgskogo Universiteta. Seriya 10. Prikladnaya Matematika. Informatika. Protsessy Upravleniya, 2023, Volume 19, Issue 1, Pages 43–50
DOI: https://doi.org/10.21638/11701/spbu10.2023.104
(Mi vspui565)
 

Applied mathematics

Finding the presence of borrowings in scientific works based on Markov chains

R. R. Saakyana, I. A. Shpekhtb, G. A. Petrosyana

a Vanadzor State University named after H. Tumanyan, 36, ul. Tigran Mets, Vanadzor, 2001, Republic of Armenia
b Academy of Marketing and Social Information Technologies — IMSIT, 5, Zipovskaya ul., Krasnodar, 350010, Russian Federation
References:
Abstract: The study aims to develop optimal approaches to the search for borrowings in scientific works. The article discusses the stages of searching for the presence of borrowings, such as preprocessing, rough filtering of texts, searching for similar texts, and searching for borrowings. The main focus is on the description of approaches and techniques that can be effectively implemented at each stage. For example, for the preprocessing stage, it may be converting text characters from uppercase to lowercase, removing punctuation marks, and removing stop words. For the stage of rough text filtering, it is filters by topic and word frequency. It may be calculating the importance of words in the context of the text and representing the word as a vector in multidimensional space to determine the proximity measure for the stage of finding similar texts. Finally, it is a search for an exact match, paraphrases and a measure of similarity of expressions for the stage of finding borrowings. The scientific novelty lies in using Markov chains to find the similarity of texts for the second and third stages of the search for borrowings proposed by authors. As a result, the example shows the technique of using Markov chains for text representation, searching for the most frequently occurring words, building a graph of a Markov chain of words, and the prospects for using Markov chains of texts for rough filtering and searching for similar texts.
Keywords: search for borrowings, algorithms for finding borrowings, Markov chains, originality checker software.
Received: November 13, 2022
Accepted: January 19, 2023
Document Type: Article
UDC: 004.021
MSC: 68W05
Language: Russian
Citation: R. R. Saakyan, I. A. Shpekht, G. A. Petrosyan, “Finding the presence of borrowings in scientific works based on Markov chains”, Vestnik S.-Petersburg Univ. Ser. 10. Prikl. Mat. Inform. Prots. Upr., 19:1 (2023), 43–50
Citation in format AMSBIB
\Bibitem{SaaShpPet23}
\by R.~R.~Saakyan, I.~A.~Shpekht, G.~A.~Petrosyan
\paper Finding the presence of borrowings in scientific works based on Markov chains
\jour Vestnik S.-Petersburg Univ. Ser. 10. Prikl. Mat. Inform. Prots. Upr.
\yr 2023
\vol 19
\issue 1
\pages 43--50
\mathnet{http://mi.mathnet.ru/vspui565}
\crossref{https://doi.org/10.21638/11701/spbu10.2023.104}
Linking options:
  • https://www.mathnet.ru/eng/vspui565
  • https://www.mathnet.ru/eng/vspui/v19/i1/p43
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Вестник Санкт-Петербургского университета. Серия 10. Прикладная математика. Информатика. Процессы управления
    Statistics & downloads:
    Abstract page:24
    Full-text PDF :8
    References:9
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024