JUSADI, HIKMAH and Abdiansah, Abdiansah and Yusliani, Novi (2020) KLASTERISASI ONLINE SHOP BERDASARKAN CAPTION DENGAN ALGORITMA JARO WINKLER DISTANCE DAN K-NEAREST NEIGHBOR. Undergraduate thesis, Sriwijaya University.
Text
RAMA_55201_09021181320059.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (3MB) | Request a copy |
|
Text
RAMA_55201_09021181320059_TURNITIN.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (9MB) | Request a copy |
|
Preview |
Text
RAMA_55201_09021181320059_0001108401_0008118205_01_front_ref.pdf - Accepted Version Available under License Creative Commons Public Domain Dedication. Download (1MB) | Preview |
Text
RAMA_55201_09021181320059_0001108401_0008118205_02.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (638kB) | Request a copy |
|
Text
RAMA_55201_09021181320059_0001108401_0008118205_03.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (618kB) | Request a copy |
|
Text
RAMA_55201_09021181320059_0001108401_0008118205_04.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (792kB) | Request a copy |
|
Text
RAMA_55201_09021181320059_0001108401_0008118205_05.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (454kB) | Request a copy |
|
Text
RAMA_55201_09021181320059_0001108401_0008118205_06.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (215kB) | Request a copy |
|
Text
RAMA_55201_09021181320059_0001108401_0008118205_06_ref.pdf - Bibliography Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (237kB) | Request a copy |
|
Text
RAMA_55201_09021181320059_0001108401_0008118205_07_lamp.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (364kB) | Request a copy |
Abstract
Captions on uploaded photos and videos on the online shop account in Instagram are not equipped with category filters, so the customers are often confused about determining the online shop category based on the caption. In this study the Jaro Winkler Distance and K-Nearest Neighbor algorithms were used to cluster 10 captions data from three different categories. Before the clustering process, pre-processing of text was carried out, namely case folding, tokenizing, and stopword removal which aimed to restore the standard form of text documents, break the document into words, and remove common words that has a high frequency of appearance. Jaro Winkler Distance is used to calculate the string similarity value between test data and training data and K-Nearest Neighbor is used for the clustering process based on predetermined K values, namely 3, 53,103, 153, 203, 249, and 449. The K value with the highest accuracy result is K 249. Based on the analysis conducted from seven tests of 450 training data and 10 test data, the results obtained Precision 0.419, Recall 0.567, F-Measure 0.398, and 66% accuracy value.
Item Type: | Thesis (Undergraduate) |
---|---|
Uncontrolled Keywords: | Clustering, Caption, Instagram, Jaro Winkler Distance, K-Nearest Neighbor |
Subjects: | Q Science > Q Science (General) > Q334-342 Computer science. Artificial intelligence. Algorithms. Robotics. Automation. |
Divisions: | 09-Faculty of Computer Science > 55201-Informatics (S1) |
Depositing User: | Users 9933 not found. |
Date Deposited: | 15 Jan 2021 04:57 |
Last Modified: | 15 Jan 2021 04:57 |
URI: | http://repository.unsri.ac.id/id/eprint/40228 |
Actions (login required)
View Item |