Program Systems: Theory and Applications
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Guidelines for authors
Submit a manuscript

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Program Systems: Theory and Applications:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Program Systems: Theory and Applications, 2019, Volume 10, Issue 4, Pages 181–199
DOI: https://doi.org/10.25209/2079-3316-2019-10-4-181-199
(Mi ps358)
 

This article is cited in 3 scientific papers (total in 3 papers)

Artificial Intelligence, Intelligent Systems, Neural Networks

PaRuS — syntax annotated Russian corpus

N. A. Vlasova, I. V. Trofimov, Yu. P. Serdyuk, E. A. Suleymanova, I. N. Vozdvizhenskiy

Ailamazyan Program Systems Institute of Russian Academy of Sciences
References:
Abstract: In this article we present a new annotated Russian language corpus named PaRuS (Parsed Russian Sentences). The corpus containing over 2.5 billion tokens is intended for use in computer linguistics tasks involving machine learning methods. PaRuS is a collection of annotated literary Russian sentences. Our linguistic annotation includes morphological features in MULTEXT-East format, and syntactic information in SynTagRus notation. We consider the methodology of corpus creation and describe PaRuS_pipe, a hybrid linguistic pipe developed for sentence annotation. We also discuss the quality of linguistic annotation in PaRuS and provide an assessment of the PaRuS_pipe morphological analyzer, according to the MorphoRuEval-2017 competition methodology.
Key words and phrases: computer linguistics, corpus linguistics, Russian, language corpus, markup, morphology, syntax.
Funding agency Grant number
Russian Foundation for Basic Research 19-07-00779
Received: 19.11.2019
Accepted: 26.12.2019
Document Type: Article
UDC: 004.89:81'322.2
BBC: Ш111:З813
MSC: Primary 68T50; Secondary 91F20
Language: Russian
Citation: N. A. Vlasova, I. V. Trofimov, Yu. P. Serdyuk, E. A. Suleymanova, I. N. Vozdvizhenskiy, “PaRuS — syntax annotated Russian corpus”, Program Systems: Theory and Applications, 10:4 (2019), 181–199
Citation in format AMSBIB
\Bibitem{VlaTroSer19}
\by N.~A.~Vlasova, I.~V.~Trofimov, Yu.~P.~Serdyuk, E.~A.~Suleymanova, I.~N.~Vozdvizhenskiy
\paper PaRuS — syntax annotated Russian corpus
\jour Program Systems: Theory and Applications
\yr 2019
\vol 10
\issue 4
\pages 181--199
\mathnet{http://mi.mathnet.ru/ps358}
\crossref{https://doi.org/10.25209/2079-3316-2019-10-4-181-199}
Linking options:
  • https://www.mathnet.ru/eng/ps358
  • https://www.mathnet.ru/eng/ps/v10/i4/p181
  • This publication is cited in the following 3 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Program Systems: Theory and Applications
    Statistics & downloads:
    Abstract page:226
    Full-text PDF :217
    References:12
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024