Proceedings of the Institute for System Programming of the RAS
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Proceedings of ISP RAS:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Proceedings of the Institute for System Programming of the RAS, 2015, Volume 27, Issue 3, Pages 291–302
DOI: https://doi.org/10.15514/ISPRAS-2015-27(3)-20
(Mi tisp152)
 

Combined classifier for website messages filtration

Veniamin Tarasov, Åkaterina Mezenceva, Danila Karbaev

Volga Region State University of Telecommunications and Informatics
References:
Abstract: The paper describes a new approach to website messages filtration using combined classifier. Information security standards for the internet resources require user data protection however the increasing volume of spam messages in interactive sections of websites poses a special problem. Spam messages vary significantly in content, however the common feature of these messages is that they are usually of little interest to the majority of the recipients. Many filtering approaches are based on the Naive Bayesian classifier — an effective method to construct automatically anti-spam filters with high performance. Unlike many email filtering solutions the proposed approach is based on the effective combination of Bayes and Fisher methods, which allows us to build accurate and stable spam filter. In this paper we consider the organization of combined classifier according to determined optimization criteria based on statistical methods, probability calculations and decision rules. We consider the optimization criteria for grading messages basing on statistical methods. The classifiers normally admit the compromise between the acceptable level of false-positive and false-negative errors, and use the threshold values for decision-making, which may vary. In order to receive more valid results of spam detection we need to analyze multitudes of results of various filters and a subset of their overlaps. The approach we suggest is to construct classifier organization, which presumes the combined use of Bayes and Fischer methods for improved the filtration quality based on the analysis of subsets and set overlaps identified by both methods (spam, non-spam, false triggering and spam leaks).
Keywords: combined classifier, spam filter, optimization criterion.
Bibliographic databases:
Document Type: Article
Language: English
Citation: Veniamin Tarasov, Åkaterina Mezenceva, Danila Karbaev, “Combined classifier for website messages filtration”, Proceedings of ISP RAS, 27:3 (2015), 291–302
Citation in format AMSBIB
\Bibitem{TarMezKar15}
\by Veniamin~Tarasov, Åkaterina~Mezenceva, Danila~Karbaev
\paper Combined classifier for website messages filtration
\jour Proceedings of ISP RAS
\yr 2015
\vol 27
\issue 3
\pages 291--302
\mathnet{http://mi.mathnet.ru/tisp152}
\crossref{https://doi.org/10.15514/ISPRAS-2015-27(3)-20}
\elib{https://elibrary.ru/item.asp?id=23832949}
Linking options:
  • https://www.mathnet.ru/eng/tisp152
  • https://www.mathnet.ru/eng/tisp/v27/i3/p291
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Proceedings of the Institute for System Programming of the RAS
    Statistics & downloads:
    Abstract page:106
    Full-text PDF :50
    References:25
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024