St. Petersburg Polytechnical University Journal. Computer Science. Telecommunication and Control Systems
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Computing, Telecommunication and Control:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


St. Petersburg Polytechnical University Journal. Computer Science. Telecommunication and Control Systems, 2015, Issue 5(229), Pages 79–87
DOI: https://doi.org/10.5862/JCSTCS.229.8
(Mi ntitu128)
 

Intellectual Systems and Technologies

Clustering of the external web environment of universities using a modified LSH algorithm

V. N. Korelin, I. S. Blekanov, S. L. Sergeev

St. Petersburg State University
Abstract: The paper is dedicated to cluster analysis of external web sites of large universities (web sites that refer to universities and web sites that are referred by universities). Web sites in Russia, the USA and the UK that have highest webometric ranking in their region were chosen as the subject of the study. The goal of the research is to identify a group of sites for each university that have the same kind of activity. The found clusters have been analyzed to determine the impact of group size and the number of groups on webometric ranking of university sites. To achieve the goal of the research, the authors developed a clustering algorithm based on the probabilistic method of reducing the dimension of multidimensional data (Locality-Sensitive Hashing, or LSH). An experiment that was conducted using the test data showed that the developed algorithm has good clustering quality and fast speed performance during massive dataset mining. The main results of the research are presented.
Keywords: webometrics, external web sites of universities, clustering, locality-sensitive hashing, min hashing, external web sites clustering, hyperlinks analysis.
Funding agency Grant number
Russian Foundation for Basic Research 15-01-06105_а
Document Type: Article
UDC: 025.4, 004
Language: Russian
Citation: V. N. Korelin, I. S. Blekanov, S. L. Sergeev, “Clustering of the external web environment of universities using a modified LSH algorithm”, St. Petersburg Polytechnical University Journal. Computer Science. Telecommunication and Control Sys, 2015, no. 5(229), 79–87
Citation in format AMSBIB
\Bibitem{KorBleSer15}
\by V.~N.~Korelin, I.~S.~Blekanov, S.~L.~Sergeev
\paper Clustering of the external web environment of universities using a modified LSH algorithm
\jour St. Petersburg Polytechnical University Journal. Computer Science. Telecommunication and Control Sys
\yr 2015
\issue 5(229)
\pages 79--87
\mathnet{http://mi.mathnet.ru/ntitu128}
\crossref{https://doi.org/10.5862/JCSTCS.229.8}
Linking options:
  • https://www.mathnet.ru/eng/ntitu128
  • https://www.mathnet.ru/eng/ntitu/y2015/i5/p79
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Computing, Telecommunication and Control
    Statistics & downloads:
    Abstract page:138
    Full-text PDF :54
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024