KLASIFIKASI HATE SPEECH DAN ABUSIVE LANGUAGE PADA TEKS MENGGUNAKAN METODE LONG SHORT-TERM MEMORY (LSTM)

SARI, NADYA ANDRIANI PUSPITA and Utami, Alvi Syahrini and Rizqie, M. Qurhanul (2023) KLASIFIKASI HATE SPEECH DAN ABUSIVE LANGUAGE PADA TEKS MENGGUNAKAN METODE LONG SHORT-TERM MEMORY (LSTM). Undergraduate thesis, Sriwijaya University.

[thumbnail of RAMA_55201_09021182025015.pdf] Text
RAMA_55201_09021182025015.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (2MB) | Request a copy
[thumbnail of RAMA_55201_09021182025015_TURNITIN.pdf] Text
RAMA_55201_09021182025015_TURNITIN.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (4MB) | Request a copy
[thumbnail of RAMA_55201_09021182025015_0022127804_0203128701_01_front_ref.pdf] Text
RAMA_55201_09021182025015_0022127804_0203128701_01_front_ref.pdf - Accepted Version
Available under License Creative Commons Public Domain Dedication.

Download (807kB)
[thumbnail of RAMA_55201_09021182025015_0022127804_0203128701_02.pdf] Text
RAMA_55201_09021182025015_0022127804_0203128701_02.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (356kB) | Request a copy
[thumbnail of RAMA_55201_09021182025015_0022127804_0203128701_03.pdf] Text
RAMA_55201_09021182025015_0022127804_0203128701_03.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (130kB) | Request a copy
[thumbnail of RAMA_55201_09021182025015_0022127804_0203128701_04.pdf] Text
RAMA_55201_09021182025015_0022127804_0203128701_04.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (880kB) | Request a copy
[thumbnail of RAMA_55201_09021182025015_0022127804_0203128701_05.pdf] Text
RAMA_55201_09021182025015_0022127804_0203128701_05.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (142kB) | Request a copy
[thumbnail of RAMA_55201_09021182025015_0022127804_0203128701_06.pdf] Text
RAMA_55201_09021182025015_0022127804_0203128701_06.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (8kB) | Request a copy
[thumbnail of RAMA_55201_09021182025015_0022127804_0203128701_08_lamp.pdf] Text
RAMA_55201_09021182025015_0022127804_0203128701_08_lamp.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (4kB) | Request a copy
[thumbnail of RAMA_55201_09021182025015_0022127804_0203128701_07_ref.pdf] Text
RAMA_55201_09021182025015_0022127804_0203128701_07_ref.pdf - Bibliography
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (85kB) | Request a copy

Abstract

Social media is an online platform that allows users to share content, interact with other users, create statuses, and also leave comments. Users can freely make comments that contain hate speech and abusive language. One platform that is widely used to make these comments is Twitter. This research aims to classify hate speech and abusive language in text. The method used is Long Short Term Memory (LSTM) and Word2Vec as word embedding. The data used is multilabel class and taken from Kaggle with a total data of 13,169 tweets which are then divided into 80% training data and 20% test data. After manually searching for random hyperparameters 10 times for each hyperparameter, the best results were obtained for the LSTM model with a dropout configuration of 0.2, hidden unit 256, recurrent dropout in the LSTM layer 0.2, epochs 15, and batch size 32. After the research, the average hamming loss value was 0.153.

Item Type: Thesis (Undergraduate)
Uncontrolled Keywords: Hamming Loss, Klasifikasi Multilabel, Long Short Term Memory, Twitter, Word2Vec, Hate Speech, Abusive Language
Subjects: P Language and Literature > P Philology. Linguistics > P98-98.5 Computational linguistics. Natural language processing
Divisions: 09-Faculty of Computer Science > 55201-Informatics (S1)
Depositing User: Nadya Andriani Puspita Sari
Date Deposited: 02 Jan 2024 13:05
Last Modified: 02 Jan 2024 13:05
URI: http://repository.unsri.ac.id/id/eprint/137346

Actions (login required)

View Item View Item