KLASIFIKASI SPAM EMAIL MENGGUNAKAN ALGORITMA PRINCIPAL COMPONENT ANALYSIS (PCA) DAN DECISION TREE

SHOLIHAH, SITI PEBSYA ROISATUN and Stiawan, Deris (2020) KLASIFIKASI SPAM EMAIL MENGGUNAKAN ALGORITMA PRINCIPAL COMPONENT ANALYSIS (PCA) DAN DECISION TREE. Undergraduate thesis, Sriwijaya University.

[thumbnail of RAMA_56201_09011281520102.pdf] Text
RAMA_56201_09011281520102.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (4MB) | Request a copy
[thumbnail of RAMA_56201_09011281520102_TURNITIN.pdf] Text
RAMA_56201_09011281520102_TURNITIN.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (8MB) | Request a copy
[thumbnail of RAMA_56201_09011281520102_0003047905_01_front_ref.pdf]
Preview
Text
RAMA_56201_09011281520102_0003047905_01_front_ref.pdf - Accepted Version
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Preview
[thumbnail of RAMA_56201_09011281520102_0003047905_02.pdf] Text
RAMA_56201_09011281520102_0003047905_02.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (873kB) | Request a copy
[thumbnail of RAMA_56201_09011281520102_0003047905_03.pdf] Text
RAMA_56201_09011281520102_0003047905_03.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (933kB) | Request a copy
[thumbnail of RAMA_56201_09011281520102_0003047905_04.pdf] Text
RAMA_56201_09011281520102_0003047905_04.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Request a copy
[thumbnail of RAMA_56201_09011281520102_0003047905_05.pdf] Text
RAMA_56201_09011281520102_0003047905_05.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (191kB) | Request a copy
[thumbnail of RAMA_56201_09011281520102_0003047905_06_ref.pdf] Text
RAMA_56201_09011281520102_0003047905_06_ref.pdf - Bibliography
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (309kB) | Request a copy
[thumbnail of RAMA_56201_09011281520102_0003047905_07_lamp.pdf] Text
RAMA_56201_09011281520102_0003047905_07_lamp.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (532kB) | Request a copy

Abstract

Email spam is a topic of problem that will continue to increase because it is easy and cheap to send email, which can be annoying and time-consuming for users. For that reason, the classification of spam emails is still challenging because there are still a lot of spam emails. This research was conducted on two email spam datasets, namely the Spambase dataset obtained from UCI Machine Learning, and the Emails dataset obtained from Kaggle. Spam classification is done using the Decision Tree algorithm. The classification process is carried out after the pre-processing stage, namely by doing text mining (Email dataset only), separating data, scaling data, and applying the Principal Component Analysis (PCA) algorithm as a sign of the number of features in the dataset based on the value that is important the influence of each feature. . The results of the classification using the Decision Tree Algorithm are 93.16% for the Spambase dataset and 94.24% for the Emails dataset. Meanwhile, the application of PCA to the Decision Tree resulted in a value of 90% for the Spambase dataset and 89.53% for the Emails dataset.

Item Type: Thesis (Undergraduate)
Uncontrolled Keywords: Spam Email, Klasifikasi, Decision Tree, Principal Component Analysis (PCA)
Subjects: T Technology > T Technology (General) > T10.5-11.9 Communication of technical information
T Technology > T Technology (General) > T57.6-57.97 Operations research. Systems analysis
T Technology > TA Engineering (General). Civil engineering (General) > TA174.A385 Engineering design--Data processing. Manufacturing processes--Data processing. Computer integrated manufacturing systems. Manufacturing processes--Automation. CAD/CAM systems.
Divisions: 09-Faculty of Computer Science > 56201-Computer Systems (S1)
Depositing User: Users 8193 not found.
Date Deposited: 24 Sep 2020 06:42
Last Modified: 24 Sep 2020 06:42
URI: http://repository.unsri.ac.id/id/eprint/35599

Actions (login required)

View Item View Item