A. Yu. Egorova, I. M. Zatsman, O. S. Mamonova, “Supracorpora databases in linguistic projects”, Sistemy i Sredstva Inform., 29:3 (2019), 77

Sistemy i Sredstva Informatiki [Systems and Means of Informatics]

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Impact factor

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Sistemy i Sredstva Inform.:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Sistemy i Sredstva Informatiki [Systems and Means of Informatics], 2019, Volume 29, Issue 3, Pages 77–91
DOI: https://doi.org/10.14357/08696527190307 (Mi ssi656)

This article is cited in 8 scientific papers (total in 8 papers)

Supracorpora databases in linguistic projects

A. Yu. Egorova^a, I. M. Zatsman^a, O. S. Mamonova^b

^a Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
^b Faculty of Foreign Languages and Area Studies, M. V. Lomonosov Moscow State University, 1 Leninskie Gory, Bldg. 13-14, Moscow 119991, Russian Federation

Full-text PDF (504 kB) Citations (8)

References:

PDF

HTML

DOI: https://doi.org/10.14357/08696527190307

Abstract: The paper considers the task of providing linguistic studies with means of supracorpora databases containing aligned parallel texts (each includes the original text and its translation) as well as bilingual annotations of the researched linguistic items and their translation equivalents formed on the basis of parallel texts. Each annotation, formed by a linguist, fixes a translation model of a linguistic item. The experience of implementing several linguistic projects at Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences showed that not all translation models that linguists extract from parallel texts during linguistic annotation are described in bilingual dictionaries and handbooks. Thus, supracorpora databases allow researchers to create new knowledge about the translation equivalents of the researched linguistic items. It is extracted by linguists when comparing and annotating the sentences of the original text and their translations. The main aim of the paper is to describe the functions of supracorpora databases that provide linguists with new knowledge in the process of annotation.

Keywords: supracorpora database, linguistic annotation, linguistic unit, corpus linguistics, translation models.

Funding agency	Grant number
Russian Foundation for Basic Research	18-07-00192_а
The work was fullfilled at the Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences with partial support of the Russian Foundation for Basic Research (project 18-07-00192).

Received: 23.07.2019

Document Type: Article

Language: Russian

Citation: A. Yu. Egorova, I. M. Zatsman, O. S. Mamonova, “Supracorpora databases in linguistic projects”, Sistemy i Sredstva Inform., 29:3 (2019), 77–91

Citation in format AMSBIB

\Bibitem{EgoZatMam19}

\by A.~Yu.~Egorova, I.~M.~Zatsman, O.~S.~Mamonova

\paper Supracorpora databases in~linguistic projects

\jour Sistemy i Sredstva Inform.

\yr 2019

\vol 29

\issue 3

\pages 77--91

\mathnet{http://mi.mathnet.ru/ssi656}

\crossref{https://doi.org/10.14357/08696527190307}

Linking options:

https://www.mathnet.ru/eng/ssi656

https://www.mathnet.ru/eng/ssi/v29/i3/p77

This publication is cited in the following 8 articles:

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Statistics & downloads:
Abstract page:	193
Full-text PDF :	97
References:	30

Что такое QR-код?

Registration to the website

Logotypes