Computer Research and Modeling
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Computer Research and Modeling:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Computer Research and Modeling, 2012, Volume 4, Issue 4, Pages 693–706
DOI: https://doi.org/10.20537/2076-7633-2012-4-4-693-706
(Mi crm522)
 

This article is cited in 14 scientific papers (total in 14 papers)

MATHEMATICAL MODELING AND NUMERICAL SIMULATION

Regularization, robustness and sparsity of probabilistic topic models

K. V. Vorontsova, A. A. Potapenkob

a RUKONT-PhysTech Laboratory, CMAM department, MIPT, 9 Institutskii per., Dolgoprudny, Moscow Region, 141700, Russia
b CMC department, Moscow State University, Leninskie gory, Moscow, 119991, Russia
References:
Abstract: We propose a generalized probabilistic topic model of text corpora which can incorporate heuristics of Bayesian regularization, sampling, frequent parameters update, and robustness in any combinations. Well- known models PLSA, LDA, CVB0, SWB, and many others can be considered as special cases of the proposed broad family of models. We propose the robust PLSA model and show that it is more sparse and performs better that regularized models like LDA.
Keywords: text analysis, topic modeling, probabilistic latent semantic analysis, EM-algorithm, latent Dirichlet allocation, Gibbs sampling, Bayesian regularization, perplexity, robusteness.
Received: 06.09.2012
Document Type: Article
UDC: 004.852
Language: Russian
Citation: K. V. Vorontsov, A. A. Potapenko, “Regularization, robustness and sparsity of probabilistic topic models”, Computer Research and Modeling, 4:4 (2012), 693–706
Citation in format AMSBIB
\Bibitem{VorPot12}
\by K.~V.~Vorontsov, A.~A.~Potapenko
\paper Regularization, robustness and sparsity of probabilistic topic models
\jour Computer Research and Modeling
\yr 2012
\vol 4
\issue 4
\pages 693--706
\mathnet{http://mi.mathnet.ru/crm522}
\crossref{https://doi.org/10.20537/2076-7633-2012-4-4-693-706}
Linking options:
  • https://www.mathnet.ru/eng/crm522
  • https://www.mathnet.ru/eng/crm/v4/i4/p693
  • This publication is cited in the following 14 articles:
    1. Ravil I. Mukhamediev, Marina Yelis, Kirill Yakunin, Yelena Popova, Yan Kuchin, Adilkhan Symagulov, Nadiya Yunicheva, Elena Zaitseva, Vitaly Levashenko, Elena Muhamedijeva, Viktors Gopejenko, Rustam Mussabayev, “Exploring the health care system's representation in the media through hierarchical topic modeling”, Cogent Engineering, 11:1 (2024)  crossref
    2. Antonina Pinchuk, Svetlana Karepova, Dmitry Tikhomirov, “Text Mining technologies in sociological analysis (using the example of studying students`ideas about the mission of a modern university)”, Sociologicheskaja nauka i social'naja praktika, 12:1 (2024), 62  crossref
    3. M. M. Gayanova, E. Yu. Sazonova, O. N. Smetanina, A. K. Sulejmanov, “Selection of Tools for Preprocessing and Thematic Modeling of Scientific Articles from the Data Lake”, Pattern Recognit. Image Anal., 33:3 (2023), 313  crossref
    4. Sergei Dosko, Vladimir Utencov, Aleksey Spasenov, Igor Lukashin, Kirill Kucherov, Lecture Notes on Data Engineering and Communications Technologies, 119, Advances in Artificial Systems for Power Engineering II, 2022, 170  crossref
    5. Wei Jiek Chong, Hui Na Chua, May Fen Gan, 2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET), 2022, 1  crossref
    6. Kirill Yakunin, Maksat Kalimoldayev, Ravil I. Mukhamediev, Rustam Mussabayev, Vladimir Barakhnin, Yan Kuchin, Sanzhar Murzakhmetov, Timur Buldybayev, Ulzhan Ospanova, Marina Yelis, Akylbek Zhumabayev, Viktors Gopejenko, Zhazirakhanym Meirambekkyzy, Alibek Abdurazakov, “KazNewsDataset: Single Country Overall Digital Mass Media Publication Corpus”, Data, 6:3 (2021), 31  crossref
    7. Kirill Yakunin, Ravil Mukhamediev, Yan Kuchin, Rustam Musabayev, Timur Buldybayev, Sanzhar Murzakhmetov, “Classification of negative publication in mass media using topic modeling”, J. Phys.: Conf. Ser., 1727:1 (2021), 012019  crossref
    8. Kirill Yakunin, Ravil I. Mukhamediev, Elena Zaitseva, Vitaly Levashenko, Marina Yelis, Adilkhan Symagulov, Yan Kuchin, Elena Muhamedijeva, Margulan Aubakirov, Viktors Gopejenko, “Mass Media as a Mirror of the COVID-19 Pandemic”, Computation, 9:12 (2021), 140  crossref
    9. Kirill Yakunin, Ravil I. Mukhamediev, Marina Yelis, Adilkhan Symagulov, Yan Kuchin, Elena Muhamedijeva, Jan Rabcan, Aubakirov Margulan, 2021 International Conference on Information and Digital Technologies (IDT), 2021, 260  crossref
    10. Yakunin Kirill, Ionescu George Mihail, Murzakhmetov Sanzhar, Mussabayev Rustam, Filatova Olga, Mukhamediev Ravil, “Propaganda Identification Using Topic Modelling”, Procedia Computer Science, 178 (2020), 205  crossref
    11. Ravil I. Mukhamediev, Kirill Yakunin, Rustam Mussabayev, Timur Buldybayev, Yan Kuchin, Sanzhar Murzakhmetov, Marina Yelis, “Classification of Negative Information on Socially Significant Topics in Mass Media”, Symmetry, 12:12 (2020), 1945  crossref
    12. V B Barakhnin, R I Mukhamedyev, R R Mussabaev, O Yu Kozhemyakina, A Issayeva, Ya I Kuchin, S B Murzakhmetov, K O Yakunin, “Methods to identify the destructive information”, J. Phys.: Conf. Ser., 1405:1 (2019), 012004  crossref
    13. E. V. Tutubalina, “Sovmestnaya veroyatnostnaya tematicheskaya model dlya identifikatsii problemnykh vyskazyvanii, svyazannykh narusheniem funktsionalnosti produktov”, Trudy ISP RAN, 27:4 (2015), 111–128  mathnet  crossref  elib
    14. Maria Saburova, Archil Maysuradze, Communications in Computer and Information Science, 518, Knowledge Engineering and Semantic Web, 2015, 168  crossref
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Computer Research and Modeling
    Statistics & downloads:
    Abstract page:328
    Full-text PDF :136
    References:42
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2025