|
This article is cited in 1 scientific paper (total in 1 paper)
Artificial Intelligence, Knowledge and Data Engineering
Comparative analysis of scientific journals collections
F. V. Krasnova, M. E. Shvartsmanbc, A. V. Dimentovb a Gazprom Neft Science and Technology Center
b National Electronic Information Consortium
c Russian State Library
Abstract:
The authors developed an approach to comparative analysis of scientific journals collections based on the analysis of co-authors graph and the text model. The use of time series of co-authorship graphs metrics allowed the authors to analyze trends in the development of journal authors. The text model was built using machine learning techniques. The journals content was classified to determine the authenticity degree of various journals and different issues of a single journal via a text model. The authors developed a metric of Content Authenticity Ratio, which allows quantifying the authenticity of journal collections in comparison. Comparative thematic analysis of journals collections was carried out using the thematic model with additive regularization. Based on the created thematic model, the authors constructed thematic profiles of the journals archives in a single thematic basis. The approach developed by the authors was applied to archives of two journals on the Rheumatology for the period 2000–2018. As a benchmark for comparing the co-author’s metrics, public data sets from the SNAP research laboratory at Stanford University were used. As a result, the authors adapted the existing examples of the effective functioning of the authors collaborations in order to improve the work of journals editorial staff. Quantitative comparison of large volumes of texts and metadata of scientific articles was carried out. As a result of the experiment conducted using the developed methods, it was shown that the content authenticity of the selected journals is 89%, co-authorships in one of the journals have a pronounced centrality, which is a distinctive feature of the policy editor. The clarity and consistency of the results confirm the effectiveness of the approach proposed by the authors. The code developed in the course of the experiment in the Python programming language can be used for comparative analysis of other collections of journals in the Russian language.
Keywords:
comparative thematic analysis, comparative text model, deep text analysis, social network analysis, graph metrics.
Received: 22.05.2019
Citation:
F. V. Krasnov, M. E. Shvartsman, A. V. Dimentov, “Comparative analysis of scientific journals collections”, Tr. SPIIRAN, 18:3 (2019), 767–793
Linking options:
https://www.mathnet.ru/eng/trspy1063 https://www.mathnet.ru/eng/trspy/v18/i3/p767
|
Statistics & downloads: |
Abstract page: | 207 | Full-text PDF : | 120 |
|