KLASIFIKASI MULTILABEL KOMENTAR PADA TWITTER MENGGUNAKAN LONG SHORT TERM MEMORY

ARMANDO, FADEL and Abdiansah, Abdiansah and Kurniati, Rizki (2023) KLASIFIKASI MULTILABEL KOMENTAR PADA TWITTER MENGGUNAKAN LONG SHORT TERM MEMORY. Undergraduate thesis, Sriwijaya University.

[thumbnail of RAMA_55201_09021381924146.pdf] Text
RAMA_55201_09021381924146.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Request a copy
[thumbnail of RAMA_55201_09021381924146_TURNITIN.pdf] Text
RAMA_55201_09021381924146_TURNITIN.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (7MB) | Request a copy
[thumbnail of RAMA_55201_09021381924146_0001108401_0012079104_01_front_ref.pdf] Text
RAMA_55201_09021381924146_0001108401_0012079104_01_front_ref.pdf - Accepted Version
Available under License Creative Commons Public Domain Dedication.

Download (951kB)
[thumbnail of RAMA_55201_09021381924146_0001108401_0012079104_02.pdf] Text
RAMA_55201_09021381924146_0001108401_0012079104_02.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (185kB) | Request a copy
[thumbnail of RAMA_55201_09021381924146_0001108401_0012079104_03.pdf] Text
RAMA_55201_09021381924146_0001108401_0012079104_03.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (133kB) | Request a copy
[thumbnail of RAMA_55201_09021381924146_0001108401_0012079104_04.pdf] Text
RAMA_55201_09021381924146_0001108401_0012079104_04.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (459kB) | Request a copy
[thumbnail of RAMA_55201_09021381924146_0001108401_0012079104_05.pdf] Text
RAMA_55201_09021381924146_0001108401_0012079104_05.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (94kB) | Request a copy
[thumbnail of RAMA_55201_09021381924146_0001108401_0012079104_06.pdf] Text
RAMA_55201_09021381924146_0001108401_0012079104_06.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (7kB) | Request a copy
[thumbnail of RAMA_55201_09021381924146_0001108401_0012079104_07_ref.pdf] Text
RAMA_55201_09021381924146_0001108401_0012079104_07_ref.pdf - Bibliography
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (84kB) | Request a copy
[thumbnail of RAMA_55201_09021381924146_0001108401_0012079104_08_lamp.pdf] Text
RAMA_55201_09021381924146_0001108401_0012079104_08_lamp.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (3kB) | Request a copy

Abstract

Twitter is one of the most popular social media platforms and users can write comments in the form of tweets freely without any restrictions. These tweets can contain various blasphemies and toxic comments. Toxic comments are comments that are rude, disrespectful, unreasonable, or even to the point of humiliating someone. They can cause serious problems on social media and some people will avoid engaging in unfair and unhealthy debates. Toxic comments can consist of several labels. This research aims to perform multilabel classification of comments. The method used is Long Short Term Memory and Word2Vec as word embedding. The data used amounted to 2,682 tweets which were then divided into 80% training data and 20% test data. After tuning the hyperparameters using random search, the best results were obtained for the LSTM model with a dropout configuration of 0.2, hidden unit 128, recurrent dropout in the LSTM layer 0.3, epochs 20, and batch size 64. Based on the research results, the average value of hamming loss is 0.138.

Item Type: Thesis (Undergraduate)
Uncontrolled Keywords: Hamming Loss, Long Short Term Memory, Multilabel Classification, Twitter, Word2Vec
Subjects: P Language and Literature > P Philology. Linguistics > P98-98.5 Computational linguistics. Natural language processing
Q Science > Q Science (General) > Q334-342 Computer science. Artificial intelligence. Algorithms. Robotics. Automation.
Divisions: 09-Faculty of Computer Science > 55201-Informatics (S1)
Depositing User: Fadel Armando
Date Deposited: 21 Aug 2023 02:23
Last Modified: 21 Aug 2023 02:23
URI: http://repository.unsri.ac.id/id/eprint/127461

Actions (login required)

View Item View Item