ANSWER FINDING PADA INDONESIA QUESTION ANSWERING SYSTEM MENGGUNAKAN REGULAR EXPRESSION DAN COSINE SIMILARITY

ALFADHILAH, NABILAH ISYRAQ and Yusliani, Novi and Rodiah, Desty (2024) ANSWER FINDING PADA INDONESIA QUESTION ANSWERING SYSTEM MENGGUNAKAN REGULAR EXPRESSION DAN COSINE SIMILARITY. Undergraduate thesis, Sriwijaya University.

[thumbnail of RAMA_55201_09021381823145.pdf] Text
RAMA_55201_09021381823145.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (5MB) | Request a copy
[thumbnail of RAMA_55201_09023181823145_TURNITIN.pdf] Text
RAMA_55201_09023181823145_TURNITIN.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (5MB) | Request a copy
[thumbnail of RAMA_55201_09021381823145_0008118205_0021128905_01_front_ref.pdf] Text
RAMA_55201_09021381823145_0008118205_0021128905_01_front_ref.pdf - Accepted Version
Available under License Creative Commons Public Domain Dedication.

Download (2MB)
[thumbnail of RAMA_55201_09021381823145_0008118205_0021128905_02.pdf] Text
RAMA_55201_09021381823145_0008118205_0021128905_02.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (448kB) | Request a copy
[thumbnail of RAMA_55201_09021381823145_0008118205_0021128905_03.pdf] Text
RAMA_55201_09021381823145_0008118205_0021128905_03.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (399kB) | Request a copy
[thumbnail of RAMA_55201_09021381823145_0008118205_0021128905_04.pdf] Text
RAMA_55201_09021381823145_0008118205_0021128905_04.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (397kB) | Request a copy
[thumbnail of RAMA_55201_09021381823145_0008118205_0021128905_05.pdf] Text
RAMA_55201_09021381823145_0008118205_0021128905_05.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (679kB) | Request a copy
[thumbnail of RAMA_55201_09021381823145_0008118205_0021128905_06_ref.pdf] Text
RAMA_55201_09021381823145_0008118205_0021128905_06_ref.pdf - Bibliography
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (123kB) | Request a copy
[thumbnail of RAMA_55201_09021381823145_0008118205_0021128905_07_lamp.pdf] Text
RAMA_55201_09021381823145_0008118205_0021128905_07_lamp.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (943kB) | Request a copy

Abstract

A Question Answering System (QAS) is an automated system designed to provide accurate answers to questions posed using natural language. This research aims to determine the ability of the Indonesian QAS built to perform answer finding or predict the answers submitted to the system using regular expression and cosine similarity methods. The steps taken include filtering the system's documents that have the same type of interrogative word as the question submitted to the system using regular expression, after which the similarity value is calculated using cosine similarity. The system documents that have the highest similarity value to the question asked will be taken as the result of the system's answer finding. The system documents used consist of a collection of questions and answers divided into three types of interrogative words: 'what', 'who', and 'where', all themed around Indonesian history. The number of system documents is 150 pairs of questions and answers, consisting of 50 pairs for the 'what' type, 50 pairs for the 'who' type, and 50 pairs for the 'where' type. The system documents were obtained from websites and then documented in .sql file format. The system testing was conducted based on three scenarios, differentiated by the three types of interrogative words, in predicting the answers to 30 test questions for each scenario. The system resulted in an average precision(0.7), recall(0.7), and F-1 score(0.7) in predicting correct answers. In predicting answers, the system still predicted some incorrect answers, which is due to the number of words in the system documents being greater than the number of words in the questions asked.

Item Type: Thesis (Undergraduate)
Uncontrolled Keywords: Answer Finding, Cosine Similarity, Indonesia Question Answering System, Regular Expression
Subjects: Q Science > Q Science (General) > Q334-342 Computer science. Artificial intelligence. Algorithms. Robotics. Automation.
Divisions: 09-Faculty of Computer Science > 55201-Informatics (S1)
Depositing User: Nabilah Isyraq Alfadhilah
Date Deposited: 24 Jul 2024 13:28
Last Modified: 24 Jul 2024 13:28
URI: http://repository.unsri.ac.id/id/eprint/153205

Actions (login required)

View Item View Item