Zapiski Nauchnykh Seminarov POMI
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Zap. Nauchn. Sem. POMI:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Zapiski Nauchnykh Seminarov POMI, 2021, Volume 499, Pages 248–266 (Mi znsl7052)  

II

Robust word vectors: context-informed embeddings for noisy texts

T. Khakhulina, V. Logachevab, V. Malykhcbd

a Skolkovo Institute of Science and Technology, Nobelya Ulitsa, 3, 121205, Moscow, Russia
b Moscow Institute of Physics and Technology, 9 Institutskiy per., Dolgoprudny, Moscow Region
c Steklov Institute of Mathematics at St. Petersburg, nab. r. Fontanki, 27, 191023, St. Petersburg
d Institute for Systems Analysis, Federal Research Center “Computer Science and Control” of Russian Academy of Sciences, pr. 60-letiya Oktyabrya, 9, 117312, Moscow
References:
Abstract: We suggest a new language-independent architecture of robust word vectors (RoVe). It is designed to alleviate the issue of typos and misspellings, common in almost any user-generated content, which hinder automatic text processing. Our model is morphologically motivated, which allows it to deal with unseen word forms in morphologically rich languages. We present the results on a number of natural language processing (NLP) tasks and languages for a variety of related architectures and show that the proposed architecture is robust to typos.
Key words and phrases: word vectors, distributed representations, natural language processing.
Funding agency Grant number
PAO Sberbank 0000000007417F630002
National Technological Initiative
This work was supported by the National Technology Initiative and PAO Sberbank project ID 0000000007417F630002.
Received: 14.01.2019
Document Type: Article
UDC: 004.85
Language: English
Citation: T. Khakhulin, V. Logacheva, V. Malykh, “Robust word vectors: context-informed embeddings for noisy texts”, Investigations on applied mathematics and informatics. Part I, Zap. Nauchn. Sem. POMI, 499, POMI, St. Petersburg, 2021, 248–266
Citation in format AMSBIB
\Bibitem{KhaLogMal21}
\by T.~Khakhulin, V.~Logacheva, V.~Malykh
\paper Robust word vectors: context-informed embeddings for noisy texts
\inbook Investigations on applied mathematics and informatics. Part~I
\serial Zap. Nauchn. Sem. POMI
\yr 2021
\vol 499
\pages 248--266
\publ POMI
\publaddr St.~Petersburg
\mathnet{http://mi.mathnet.ru/znsl7052}
Linking options:
  • https://www.mathnet.ru/eng/znsl7052
  • https://www.mathnet.ru/eng/znsl/v499/p248
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Записки научных семинаров ПОМИ
    Statistics & downloads:
    Abstract page:89
    Full-text PDF :44
    References:20
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024