Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika"
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Vestn. YuUrGU. Ser. Vych. Matem. Inform.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika", 2022, Volume 11, Issue 4, Pages 51–66
DOI: https://doi.org/10.14529/cmse220404
(Mi vyurv287)
 

Developing intelligent assistants to searchfor content on websites of a certain genre

V. D. Rublev, E. A. Sidorova

A.P. Ershov Institute of Informatics Systems, Siberian Branch of the Russian Academy of Sciences (Lavrentieva Avenue 6, Novosibirsk, 630090 Russia)
Abstract: This paper discusses an approach to automatic generation of intelligent assistants, which provide information search on the content of a website. A feature of the approach is to use genre models, developed for a given type of resource (educational, informational, etc.), on the basis of which the genre structuring and subsequent thematic clustering of the content of the target website is performed. The resulting genre structures allow us to define more precisely the boundaries of thematic clusters related to the topic of the user's search query. The search quality evaluation for the Russian-language websites showed an F-score of 87.8% and originality of 80.9%, which exceeds the Yandex search engine results by 1.1% and 9.1%, respectively. In order to predict user information needs, a method for refining the resulting sample is proposed. It allows a user to get information implicitly, based on current and previous queries, about what the user was not satisfied with in the previous search results. A model of user's search intentions has been developed and its computational component includes a method for evaluating query closeness based on the FRiS function. Based on the proposed methods, a chatbot was created on the Telegram messenger platform to search the websites of educational institutions. The experiments showed that the user needs the average of 1.75 qualifying questions to find the necessary information.
Keywords: information retrieval, intelligent assistant, website genre model, thematic analysis, information retrieval system, user search intent model.
Received: 06.11.2022
Document Type: Article
UDC: 004.912
Language: English
Citation: V. D. Rublev, E. A. Sidorova, “Developing intelligent assistants to searchfor content on websites of a certain genre”, Vestn. YuUrGU. Ser. Vych. Matem. Inform., 11:4 (2022), 51–66
Citation in format AMSBIB
\Bibitem{RubSid22}
\by V.~D.~Rublev, E.~A.~Sidorova
\paper Developing intelligent assistants to searchfor content on websites of a certain genre
\jour Vestn. YuUrGU. Ser. Vych. Matem. Inform.
\yr 2022
\vol 11
\issue 4
\pages 51--66
\mathnet{http://mi.mathnet.ru/vyurv287}
\crossref{https://doi.org/10.14529/cmse220404}
Linking options:
  • https://www.mathnet.ru/eng/vyurv287
  • https://www.mathnet.ru/eng/vyurv/v11/i4/p51
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika"
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2025