Sistemy i Sredstva Informatiki [Systems and Means of Informatics]
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Sistemy i Sredstva Inform.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Sistemy i Sredstva Informatiki [Systems and Means of Informatics], 2019, Volume 29, Issue 3, Pages 77–91
DOI: https://doi.org/10.14357/08696527190307
(Mi ssi656)
 

This article is cited in 8 scientific papers (total in 8 papers)

Supracorpora databases in linguistic projects

A. Yu. Egorovaa, I. M. Zatsmana, O. S. Mamonovab

a Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
b Faculty of Foreign Languages and Area Studies, M. V. Lomonosov Moscow State University, 1 Leninskie Gory, Bldg. 13-14, Moscow 119991, Russian Federation
Full-text PDF (504 kB) Citations (8)
References:
Abstract: The paper considers the task of providing linguistic studies with means of supracorpora databases containing aligned parallel texts (each includes the original text and its translation) as well as bilingual annotations of the researched linguistic items and their translation equivalents formed on the basis of parallel texts. Each annotation, formed by a linguist, fixes a translation model of a linguistic item. The experience of implementing several linguistic projects at Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences showed that not all translation models that linguists extract from parallel texts during linguistic annotation are described in bilingual dictionaries and handbooks. Thus, supracorpora databases allow researchers to create new knowledge about the translation equivalents of the researched linguistic items. It is extracted by linguists when comparing and annotating the sentences of the original text and their translations. The main aim of the paper is to describe the functions of supracorpora databases that provide linguists with new knowledge in the process of annotation.
Keywords: supracorpora database, linguistic annotation, linguistic unit, corpus linguistics, translation models.
Funding agency Grant number
Russian Foundation for Basic Research 18-07-00192_а
The work was fullfilled at the Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences with partial support of the Russian Foundation for Basic Research (project 18-07-00192).
Received: 23.07.2019
Document Type: Article
Language: Russian
Citation: A. Yu. Egorova, I. M. Zatsman, O. S. Mamonova, “Supracorpora databases in linguistic projects”, Sistemy i Sredstva Inform., 29:3 (2019), 77–91
Citation in format AMSBIB
\Bibitem{EgoZatMam19}
\by A.~Yu.~Egorova, I.~M.~Zatsman, O.~S.~Mamonova
\paper Supracorpora databases in~linguistic projects
\jour Sistemy i Sredstva Inform.
\yr 2019
\vol 29
\issue 3
\pages 77--91
\mathnet{http://mi.mathnet.ru/ssi656}
\crossref{https://doi.org/10.14357/08696527190307}
Linking options:
  • https://www.mathnet.ru/eng/ssi656
  • https://www.mathnet.ru/eng/ssi/v29/i3/p77
  • This publication is cited in the following 8 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Системы и средства информатики
    Statistics & downloads:
    Abstract page:176
    Full-text PDF :79
    References:16
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024