Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika"
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Vestn. YuUrGU. Ser. Vych. Matem. Inform.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika", 2023, Volume 12, Issue 1, Pages 28–45
DOI: https://doi.org/10.14529/cmse230102
(Mi vyurv291)
 

This article is cited in 1 scientific paper (total in 1 paper)

A method for creating structural models of text documents using neural networks

D. V. Berezkin, I. A. Kozlov, P. A. Martynyuk, A. M. Panfilkin

Bauman Moscow State Technical University (st. 2nd Baumanskaya 5/1, Moscow, 105005 Russian Federation)
Full-text PDF (909 kB) Citations (1)
Abstract: The article describes modern neural network BERT-based models and considers their application for Natural Language Processing tasks such as question answering and named entity recognition. The article presents a method for solving the problem of automatically creating structural models of text documents. The proposed method is hybrid and is based on jointly utilizing several NLP models. The method builds a structural model of a document by extracting sentences that correspond to various aspects of the document. Information extraction is performed by using the BERT Question Answering model with questions that are prepared separately for each aspect. The answers are filtered via the BERT Named Entity Recognition model and used to generate the contents of each field of the structural model. The article proposes two algorithms for field content generation: Exclusive answer choosing algorithm and Generalizing answer forming algorithm, that are used for short and voluminous fields respectively. The article also describes the software implementation of the proposed method and discusses the results of experiments conducted to evaluate the quality of the method.
Keywords: information extraction, neural network, named entity recognition, question-answering system.
Funding agency Grant number
Ministry of Science and Higher Education of the Russian Federation
This paper is a part of the research work carried out within the Bauman Deep Analytics project of the Priority 2030 program.
Received: 03.11.2022
Document Type: Article
UDC: 004.89
Language: English
Citation: D. V. Berezkin, I. A. Kozlov, P. A. Martynyuk, A. M. Panfilkin, “A method for creating structural models of text documents using neural networks”, Vestn. YuUrGU. Ser. Vych. Matem. Inform., 12:1 (2023), 28–45
Citation in format AMSBIB
\Bibitem{BerKozMar23}
\by D.~V.~Berezkin, I.~A.~Kozlov, P.~A.~Martynyuk, A.~M.~Panfilkin
\paper A method for creating structural models of text documents using neural networks
\jour Vestn. YuUrGU. Ser. Vych. Matem. Inform.
\yr 2023
\vol 12
\issue 1
\pages 28--45
\mathnet{http://mi.mathnet.ru/vyurv291}
\crossref{https://doi.org/10.14529/cmse230102}
Linking options:
  • https://www.mathnet.ru/eng/vyurv291
  • https://www.mathnet.ru/eng/vyurv/v12/i1/p28
  • This publication is cited in the following 1 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika"
    Statistics & downloads:
    Abstract page:12
    Full-text PDF :7
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024