DETEKSI ANOMALI FILE PDF MALWARE PADA LAYANAN AGREGATOR GARBA RUJUKAN DIGITAL (GARUDA) DENGAN ALGORITMA DECISION TREE

YUNINGSIH, NOVI and Stiawan, Deris and Septian, Tri Wanda (2022) DETEKSI ANOMALI FILE PDF MALWARE PADA LAYANAN AGREGATOR GARBA RUJUKAN DIGITAL (GARUDA) DENGAN ALGORITMA DECISION TREE. Undergraduate thesis, Sriwijaya University.

[thumbnail of RAMA_56201_09011281823133.pdf] Text
RAMA_56201_09011281823133.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (26MB) | Request a copy
[thumbnail of RAMA_56201_09011281823133_TURNITIN.pdf] Text
RAMA_56201_09011281823133_TURNITIN.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (9MB) | Request a copy
[thumbnail of RAMA_56201_09011281823133_0003047905_0028098902_01_front_ref.pdf]
Preview
Text
RAMA_56201_09011281823133_0003047905_0028098902_01_front_ref.pdf - Submitted Version
Available under License Creative Commons Public Domain Dedication.

Download (6MB) | Preview
[thumbnail of RAMA_56201_09011281823133_0003047905_0028098902_02.pdf] Text
RAMA_56201_09011281823133_0003047905_0028098902_02.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (6MB) | Request a copy
[thumbnail of RAMA_56201_09011281823133_0003047905_0028098902_03.pdf] Text
RAMA_56201_09011281823133_0003047905_0028098902_03.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (5MB) | Request a copy
[thumbnail of RAMA_56201_09011281823133_0003047905_0028098902_04.pdf] Text
RAMA_56201_09011281823133_0003047905_0028098902_04.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (7MB) | Request a copy
[thumbnail of RAMA_56201_09011281823133_0003047905_0028098902_05.pdf] Text
RAMA_56201_09011281823133_0003047905_0028098902_05.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (421kB) | Request a copy
[thumbnail of RAMA_56201_09011281823133_0003047905_0028098902_06_ref.pdf] Text
RAMA_56201_09011281823133_0003047905_0028098902_06_ref.pdf - Bibliography
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Request a copy
[thumbnail of RAMA_56201_09011281823133_0003047905_0028098902_07_lamp.pdf] Text
RAMA_56201_09011281823133_0003047905_0028098902_07_lamp.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (460kB) | Request a copy

Abstract

Portable Document Format (PDF) is a document exchange media that is very vulnerable to malicious attacks, namely Malware PDF. One of the services that most often use PDF files as a medium is a scientific publication service Garba Rujukan Digital (GARUDA). Therefore, research was conducted using static analysis methods for each PDF and data extraction using PDFiD. Based on these research, it found an oddity or anomaly to some PDF files so that the dataset is divided into three classes, namely PDF benign, PDF anomaly, and PDF malware. The generated dataset in this research is a dataset with imbalanced conditions and used Synthetic Minority Oversampling Technique (SMOTE) and NearMiss to balance the data. To classify malware PDF file attacks used one of the well-known machine learning methods, Decision Tree Algorithm. Classification divided into two types, classification with the original dataset (imbalanced dataset conditions) and classification with balanced dataset conditions. Then to validate the accuracy of the classification model used cross validation method, Stratified K-Fold Cross Validation. Based on classification results, the best performance obtained by the average percentage of accuracy 99.83%, precision 99.83%, recall 99.83%, F1-score 99.84%, TNR (true negative rate) 99.92%, AUC (area under curve) 99.88%, and FPR (false positive rate) 0.001 and FNR (false negative rate) 0.002.

Item Type: Thesis (Undergraduate)
Uncontrolled Keywords: ALGORITMA DECISION TREE
Subjects: Q Science > Q Science (General) > Q300-390 Cybernetics > Q325.5 Machine learning
Q Science > Q Science (General) > Q334-342 Computer science. Artificial intelligence. Algorithms. Robotics. Automation.
Q Science > QA Mathematics > QA75-76.95 Calculating machines > QA76.9.A25 Computer security. Systems and Data Security.
T Technology > T Technology (General) > T57.6-57.97 Operations research. Systems analysis > T57.85 Network systems theory Including network analysis Cf. TS157.5+ Scheduling
Divisions: 09-Faculty of Computer Science > 56201-Computer Systems (S1)
Depositing User: Novi Yuningsih
Date Deposited: 03 Jan 2023 02:12
Last Modified: 03 Jan 2023 02:12
URI: http://repository.unsri.ac.id/id/eprint/85028

Actions (login required)

View Item View Item