AUTOMATIC TEXT SUMMARIZATION PADA SINOPSIS LAPORAN KECELAKAAN LALU LINTAS PERNERBANGAN DI INDONESIA MENGGUNAKAN ALGORITMA TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

HILMAN, MUHAMMAD FARHAN and Passarella, Rossi (2023) AUTOMATIC TEXT SUMMARIZATION PADA SINOPSIS LAPORAN KECELAKAAN LALU LINTAS PERNERBANGAN DI INDONESIA MENGGUNAKAN ALGORITMA TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF). Undergraduate thesis, Sriwijaya University.

[thumbnail of RAMA_56201_09011281924148.pdf] Text
RAMA_56201_09011281924148.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (2MB) | Request a copy
[thumbnail of RAMA_56201_09011281924148_TURNITIN.pdf] Text
RAMA_56201_09011281924148_TURNITIN.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (5MB) | Request a copy
[thumbnail of RAMA_56201_09011281924148_0011067806_01_front_ref.pdf] Text
RAMA_56201_09011281924148_0011067806_01_front_ref.pdf - Accepted Version
Available under License Creative Commons Public Domain Dedication.

Download (1MB)
[thumbnail of RAMA_56201_09011281924148_0011067806_02.pdf] Text
RAMA_56201_09011281924148_0011067806_02.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (160kB) | Request a copy
[thumbnail of RAMA_56201_09011281924148_0011067806_03.pdf] Text
RAMA_56201_09011281924148_0011067806_03.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (171kB) | Request a copy
[thumbnail of RAMA_56201_09011281924148_0011067806_04.pdf] Text
RAMA_56201_09011281924148_0011067806_04.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (614kB) | Request a copy
[thumbnail of RAMA_56201_09011281924148_0011067806_05.pdf] Text
RAMA_56201_09011281924148_0011067806_05.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (11kB) | Request a copy
[thumbnail of RAMA_56201_09011281924148_0011067806_06_ref.pdf] Text
RAMA_56201_09011281924148_0011067806_06_ref.pdf - Bibliography
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (79kB) | Request a copy
[thumbnail of RAMA_56201_09011281924148_0011067806_07_lamp.pdf] Text
RAMA_56201_09011281924148_0011067806_07_lamp.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (525kB) | Request a copy

Abstract

Automatic text summarizzation merupakan salah satu cabang dalam ilmu natural language processing (NLP) yang memiliki tujuan untuk merepresentasikan suatu teks yang panjang kemudian dikompresi sehingga dapat dibaca dan dipahami dengan mudah oleh pengguna. Penerapan algoritma Term Frequency - Inverse Document Frequency (TF-IDF) dalam automatic text summarization dapat menghitung skor dan bobot dari setiap kalimat dalam dokumen sehingga dapat menemukan kalimat penting dalam suatu teks. Dalam penelitian ini, dilakukan penerapan automatic text summarization menggunakan algoritma TF-IDF pada dataset kumpulan sinopsis final report KNKT kecelakaan lalu lintas penerbangan di Indonesia. Percobaan dilakukan pada 142 data sinopsis kemudian hasil ringkasan dari algoritma TF-IDF dilakukan analisis perbandingan ROUGE yang dibandingkan dengan hasil ringkasan manusia dan hasil ringkasan website (https://www.scribbr.com/text-summarizer/). Hasil terbaik skor ROUGE dari perbandingan antara ringkasan TF-IDF dan ringkasan manusia adalah ROUGE-1 0.746, ROUGE-2 0.727 dan ROUGE-L 0.746 dengan rata-rata skor ROUGE-1 0.475, ROUGE-2 0.265 dan ROUGE-L 0.453. Sedangkan hasil terbaik skor ROUGE perbandingan antara ringkasan TF-IDF dan ringkasan website adalah ROUGE-1 0.719, ROUGE-2 0.6 dan ROUGE-L 0.719 dengan rata-rata skor ROUGE-1 0.499, ROUGE-2 0.279 dan ROUGE-L 0.478.

Item Type: Thesis (Undergraduate)
Uncontrolled Keywords: Automatic text summarization, Natural language processing (NLP), Term Frequency - Inverse Document Frequency (TF-IDF), ROUGE
Subjects: P Language and Literature > P Philology. Linguistics > P98-98.5 Computational linguistics. Natural language processing
Q Science > QA Mathematics > QA75-76.95 Calculating machines > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA75-76.95 Calculating machines > QA76.9.B45 Big data. Machine learning. Quantitative research. Metaheuristics.
Q Science > QA Mathematics > QA8.9-QA10.3 Computer science. Artificial intelligence. Computational complexity. Data structures (Computer scienc. Mathematical Logic and Formal Languages
Divisions: 09-Faculty of Computer Science > 56201-Computer Systems (S1)
Depositing User: Muhammad Farhan Hilman
Date Deposited: 23 Nov 2023 01:12
Last Modified: 23 Nov 2023 01:12
URI: http://repository.unsri.ac.id/id/eprint/130867

Actions (login required)

View Item View Item