Using Decision Tree Algorithms in Detecting Spam Emails Written in Malay: A Comparison Study

Salim, Mohammad (2022) Using Decision Tree Algorithms in Detecting Spam Emails Written in Malay: A Comparison Study. ITM Web of Conferences, 42.

[img] Text (Research Article)
itmconf_icacs2022_01001.pdf - Published Version

Download (642kB)
Official URL: https://www.itm-conferences.org/articles/itmconf/a...

Abstract

Emails have become the most economical and fastest communication forms. However, during the past few years, the increment of email users has dramatically increased spam emails. Various anti-spam techniques have been developed to minimize if not eliminate the spam problem. In this paper, we study the disparity in the effectiveness of using different decision tree algorithms in email classification and combat spam problems. For that, we have chosen Universiti Utara Malaysia emails as a case study. To achieve the best possible classification accuracy, we compared all chosen algorithms' performance, which are Random Forest, LMT, Decision Stump, J48, Random Tree, and REP Tree. The experimental results showed that the Decision Stump algorithm is more effective to be used in classifying the emails, and the F-measures, Precision, and recall score for the Decision Stump algorithm are higher than the other comparison algorithms.

Item Type: Article
Subjects: Q Science > QA Mathematics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
Depositing User: ePrints deposit
Date Deposited: 26 May 2022 13:39
Last Modified: 26 May 2022 13:39
URI: http://eprints.tiu.edu.iq/id/eprint/914

Actions (login required)

View Item View Item