SARI, NADYA ANDRIANI PUSPITA and Utami, Alvi Syahrini and Rizqie, M. Qurhanul (2023) KLASIFIKASI HATE SPEECH DAN ABUSIVE LANGUAGE PADA TEKS MENGGUNAKAN METODE LONG SHORT-TERM MEMORY (LSTM). Undergraduate thesis, Sriwijaya University.
Text
RAMA_55201_09021182025015.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (2MB) | Request a copy |
|
Text
RAMA_55201_09021182025015_TURNITIN.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (4MB) | Request a copy |
|
Text
RAMA_55201_09021182025015_0022127804_0203128701_01_front_ref.pdf - Accepted Version Available under License Creative Commons Public Domain Dedication. Download (807kB) |
|
Text
RAMA_55201_09021182025015_0022127804_0203128701_02.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (356kB) | Request a copy |
|
Text
RAMA_55201_09021182025015_0022127804_0203128701_03.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (130kB) | Request a copy |
|
Text
RAMA_55201_09021182025015_0022127804_0203128701_04.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (880kB) | Request a copy |
|
Text
RAMA_55201_09021182025015_0022127804_0203128701_05.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (142kB) | Request a copy |
|
Text
RAMA_55201_09021182025015_0022127804_0203128701_06.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (8kB) | Request a copy |
|
Text
RAMA_55201_09021182025015_0022127804_0203128701_08_lamp.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (4kB) | Request a copy |
|
Text
RAMA_55201_09021182025015_0022127804_0203128701_07_ref.pdf - Bibliography Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (85kB) | Request a copy |
Abstract
Social media is an online platform that allows users to share content, interact with other users, create statuses, and also leave comments. Users can freely make comments that contain hate speech and abusive language. One platform that is widely used to make these comments is Twitter. This research aims to classify hate speech and abusive language in text. The method used is Long Short Term Memory (LSTM) and Word2Vec as word embedding. The data used is multilabel class and taken from Kaggle with a total data of 13,169 tweets which are then divided into 80% training data and 20% test data. After manually searching for random hyperparameters 10 times for each hyperparameter, the best results were obtained for the LSTM model with a dropout configuration of 0.2, hidden unit 256, recurrent dropout in the LSTM layer 0.2, epochs 15, and batch size 32. After the research, the average hamming loss value was 0.153.
Item Type: | Thesis (Undergraduate) |
---|---|
Uncontrolled Keywords: | Hamming Loss, Klasifikasi Multilabel, Long Short Term Memory, Twitter, Word2Vec, Hate Speech, Abusive Language |
Subjects: | P Language and Literature > P Philology. Linguistics > P98-98.5 Computational linguistics. Natural language processing |
Divisions: | 09-Faculty of Computer Science > 55201-Informatics (S1) |
Depositing User: | Nadya Andriani Puspita Sari |
Date Deposited: | 02 Jan 2024 13:05 |
Last Modified: | 02 Jan 2024 13:05 |
URI: | http://repository.unsri.ac.id/id/eprint/137346 |
Actions (login required)
View Item |