|
Modelirovanie i Analiz Informatsionnykh Sistem, 2013, Volume 20, Number 2, Pages 80–91
(Mi mais299)
|
|
|
|
Unified Classification Model for Geotagging Websites
A. N. Volkov Yandex LLC,
Leo Tolstoy St., 16, Moscow, 119021, Russia
Abstract:
The paper presents a novel approach to finding regional scopes (geotagging) of websites. Unlike the traditional approaches, which generally involve training a separate classification model for each class (region), the proposed method is based on training a single model which is used for all regions of the same type (e.g. cities). This approach is made possible by the usage of “relative” features which indicate how a selected region matches up to other regions for a given website. The classification system uses a variety of features of different nature that have not been yet used together for machine-learning based regional classification of websites. The evaluation demonstrates the advantage of our “one model per region type” method versus the traditional “one model per region” approach. A separate experiment demonstrates the ability of the proposed classifier to successfully detect regions which were not present in the training set (which is impossible for traditional approaches).
Keywords:
geotagging, classification models, machine learning.
Received: 12.12.2012
Citation:
A. N. Volkov, “Unified Classification Model for Geotagging Websites”, Model. Anal. Inform. Sist., 20:2 (2013), 80–91
Linking options:
https://www.mathnet.ru/eng/mais299 https://www.mathnet.ru/eng/mais/v20/i2/p80
|
Statistics & downloads: |
Abstract page: | 227 | Full-text PDF : | 112 | References: | 57 |
|