KLASTERISASI ONLINE SHOP BERDASARKAN CAPTION DENGAN ALGORITMA JARO WINKLER DISTANCE DAN K-NEAREST NEIGHBOR

JUSADI, HIKMAH and Abdiansah, Abdiansah and Yusliani, Novi (2020) KLASTERISASI ONLINE SHOP BERDASARKAN CAPTION DENGAN ALGORITMA JARO WINKLER DISTANCE DAN K-NEAREST NEIGHBOR. Undergraduate thesis, Sriwijaya University.

[thumbnail of RAMA_55201_09021181320059.pdf] Text
RAMA_55201_09021181320059.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (3MB) | Request a copy
[thumbnail of RAMA_55201_09021181320059_TURNITIN.pdf] Text
RAMA_55201_09021181320059_TURNITIN.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (9MB) | Request a copy
[thumbnail of RAMA_55201_09021181320059_0001108401_0008118205_01_front_ref.pdf]
Preview
Text
RAMA_55201_09021181320059_0001108401_0008118205_01_front_ref.pdf - Accepted Version
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Preview
[thumbnail of RAMA_55201_09021181320059_0001108401_0008118205_02.pdf] Text
RAMA_55201_09021181320059_0001108401_0008118205_02.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (638kB) | Request a copy
[thumbnail of RAMA_55201_09021181320059_0001108401_0008118205_03.pdf] Text
RAMA_55201_09021181320059_0001108401_0008118205_03.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (618kB) | Request a copy
[thumbnail of RAMA_55201_09021181320059_0001108401_0008118205_04.pdf] Text
RAMA_55201_09021181320059_0001108401_0008118205_04.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (792kB) | Request a copy
[thumbnail of RAMA_55201_09021181320059_0001108401_0008118205_05.pdf] Text
RAMA_55201_09021181320059_0001108401_0008118205_05.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (454kB) | Request a copy
[thumbnail of RAMA_55201_09021181320059_0001108401_0008118205_06.pdf] Text
RAMA_55201_09021181320059_0001108401_0008118205_06.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (215kB) | Request a copy
[thumbnail of RAMA_55201_09021181320059_0001108401_0008118205_06_ref.pdf] Text
RAMA_55201_09021181320059_0001108401_0008118205_06_ref.pdf - Bibliography
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (237kB) | Request a copy
[thumbnail of RAMA_55201_09021181320059_0001108401_0008118205_07_lamp.pdf] Text
RAMA_55201_09021181320059_0001108401_0008118205_07_lamp.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (364kB) | Request a copy

Abstract

Captions on uploaded photos and videos on the online shop account in Instagram are not equipped with category filters, so the customers are often confused about determining the online shop category based on the caption. In this study the Jaro Winkler Distance and K-Nearest Neighbor algorithms were used to cluster 10 captions data from three different categories. Before the clustering process, pre-processing of text was carried out, namely case folding, tokenizing, and stopword removal which aimed to restore the standard form of text documents, break the document into words, and remove common words that has a high frequency of appearance. Jaro Winkler Distance is used to calculate the string similarity value between test data and training data and K-Nearest Neighbor is used for the clustering process based on predetermined K values, namely 3, 53,103, 153, 203, 249, and 449. The K value with the highest accuracy result is K 249. Based on the analysis conducted from seven tests of 450 training data and 10 test data, the results obtained Precision 0.419, Recall 0.567, F-Measure 0.398, and 66% accuracy value.

Item Type: Thesis (Undergraduate)
Uncontrolled Keywords: Clustering, Caption, Instagram, Jaro Winkler Distance, K-Nearest Neighbor
Subjects: Q Science > Q Science (General) > Q334-342 Computer science. Artificial intelligence. Algorithms. Robotics. Automation.
Divisions: 09-Faculty of Computer Science > 55201-Informatics (S1)
Depositing User: Users 9933 not found.
Date Deposited: 15 Jan 2021 04:57
Last Modified: 15 Jan 2021 04:57
URI: http://repository.unsri.ac.id/id/eprint/40228

Actions (login required)

View Item View Item