Abstract
Most of current spam email detection systems use keywords in a blacklist to detect spam emails. However these keywords can be written as misspellings, for example "baank", "ba-nk" and "bankk" instead of "bank". Moreover, misspellings are changed from time to time and hence spam email detection system needs to constantly update the blacklist to detect spam emails containing such misspellings. However it is impossible to predict all possible misspellings for a given keyword to add those to the blacklist. We present a possibility theory-based approach to spam email detection to solve this problem. We consider every keyword in the blacklist along with its misspellings as a fuzzy set and propose a possibility function. This function will be used to calculate a possibility score for an unknown email. Using a proposed if-then rule and this core, we can decide whether or not this unknown email is spam. Experimental results are also presented
Original language | English |
---|---|
Title of host publication | 2007 IEEE International Conference on Granular Computing (GRC 2007) |
Editors | T.Y Lin, X Hu |
Place of Publication | United States |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 571-575 |
Number of pages | 5 |
ISBN (Print) | 9780769530321 |
DOIs | |
Publication status | Published - 2007 |
Event | IEEE International Conference on Granular Computing - San Jose, United States Duration: 2 Nov 2007 → 4 Nov 2007 |
Conference
Conference | IEEE International Conference on Granular Computing |
---|---|
Country/Territory | United States |
City | San Jose |
Period | 2/11/07 → 4/11/07 |