YUNINGSIH, NOVI and Stiawan, Deris and Septian, Tri Wanda (2022) DETEKSI ANOMALI FILE PDF MALWARE PADA LAYANAN AGREGATOR GARBA RUJUKAN DIGITAL (GARUDA) DENGAN ALGORITMA DECISION TREE. Undergraduate thesis, Sriwijaya University.
Text
RAMA_56201_09011281823133.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (26MB) | Request a copy |
|
Text
RAMA_56201_09011281823133_TURNITIN.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (9MB) | Request a copy |
|
Preview |
Text
RAMA_56201_09011281823133_0003047905_0028098902_01_front_ref.pdf - Submitted Version Available under License Creative Commons Public Domain Dedication. Download (6MB) | Preview |
Text
RAMA_56201_09011281823133_0003047905_0028098902_02.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (6MB) | Request a copy |
|
Text
RAMA_56201_09011281823133_0003047905_0028098902_03.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (5MB) | Request a copy |
|
Text
RAMA_56201_09011281823133_0003047905_0028098902_04.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (7MB) | Request a copy |
|
Text
RAMA_56201_09011281823133_0003047905_0028098902_05.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (421kB) | Request a copy |
|
Text
RAMA_56201_09011281823133_0003047905_0028098902_06_ref.pdf - Bibliography Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (1MB) | Request a copy |
|
Text
RAMA_56201_09011281823133_0003047905_0028098902_07_lamp.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (460kB) | Request a copy |
Abstract
Portable Document Format (PDF) is a document exchange media that is very vulnerable to malicious attacks, namely Malware PDF. One of the services that most often use PDF files as a medium is a scientific publication service Garba Rujukan Digital (GARUDA). Therefore, research was conducted using static analysis methods for each PDF and data extraction using PDFiD. Based on these research, it found an oddity or anomaly to some PDF files so that the dataset is divided into three classes, namely PDF benign, PDF anomaly, and PDF malware. The generated dataset in this research is a dataset with imbalanced conditions and used Synthetic Minority Oversampling Technique (SMOTE) and NearMiss to balance the data. To classify malware PDF file attacks used one of the well-known machine learning methods, Decision Tree Algorithm. Classification divided into two types, classification with the original dataset (imbalanced dataset conditions) and classification with balanced dataset conditions. Then to validate the accuracy of the classification model used cross validation method, Stratified K-Fold Cross Validation. Based on classification results, the best performance obtained by the average percentage of accuracy 99.83%, precision 99.83%, recall 99.83%, F1-score 99.84%, TNR (true negative rate) 99.92%, AUC (area under curve) 99.88%, and FPR (false positive rate) 0.001 and FNR (false negative rate) 0.002.
Item Type: | Thesis (Undergraduate) |
---|---|
Uncontrolled Keywords: | ALGORITMA DECISION TREE |
Subjects: | Q Science > Q Science (General) > Q300-390 Cybernetics > Q325.5 Machine learning Q Science > Q Science (General) > Q334-342 Computer science. Artificial intelligence. Algorithms. Robotics. Automation. Q Science > QA Mathematics > QA75-76.95 Calculating machines > QA76.9.A25 Computer security. Systems and Data Security. T Technology > T Technology (General) > T57.6-57.97 Operations research. Systems analysis > T57.85 Network systems theory Including network analysis Cf. TS157.5+ Scheduling |
Divisions: | 09-Faculty of Computer Science > 56201-Computer Systems (S1) |
Depositing User: | Novi Yuningsih |
Date Deposited: | 03 Jan 2023 02:12 |
Last Modified: | 03 Jan 2023 02:12 |
URI: | http://repository.unsri.ac.id/id/eprint/85028 |
Actions (login required)
View Item |