RAHMADANI, MUHAMMAD WAHYU and Abdiansah, Abdiansah and Marieska, Mastura Diana (2022) PARALLEL PROCESSING FOR STEMMING AND POS-TAGGING IN INDONESIAN TEXT. Undergraduate thesis, Sriwijaya University.
Text
RAMA_55201_09021181823166.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (1MB) | Request a copy |
|
Text
RAMA_55201_09021181823166_TURNITIN.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (7MB) | Request a copy |
|
Preview |
Text
RAMA_55201_09021181823166_0001108401_0021038607_01_front_ref.pdf - Accepted Version Available under License Creative Commons Public Domain Dedication. Download (1MB) | Preview |
Text
RAMA_55201_09021181823166_0001108401_0021038607_02.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (1MB) | Request a copy |
|
Text
RAMA_55201_09021181823166_0001108401_0021038607_03.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (1MB) | Request a copy |
|
Text
RAMA_55201_09021181823166_0001108401_0021038607_04.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (1MB) | Request a copy |
|
Text
RAMA_55201_09021181823166_0001108401_0021038607_05.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (1MB) | Request a copy |
|
Text
RAMA_55201_09021181823166_0001108401_0021038607_06_ref.pdf - Bibliography Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (1MB) | Request a copy |
|
Text
RAMA_55201_09021181823166_0001108401_0021038607_07_lamp.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Public Domain Dedication. Download (1MB) | Request a copy |
Abstract
Stemming and POS-Tagging is part of the pre-processing of raw text data in the natural language processing field which aims to produce more structured data and as an initial step that greatly affects processing performance before being processed at a further stage. In its application for Indonesian text, the efficiency level of process performance for these two stages is still low, especially for large data sizes. Parallel processing method using the python multiprocessing module was applied in this study to see the reduction in processing time for the Stemming and POS Tagging process and also to observe the impact of implementing this parallel processing method on the devices used. Results showed that the highest reduction was 78.26% for the POS-Tagging process using test data size range of 10 MB – 70 MB and 63.28% for the Stemming process using test data size range of 7 MB – 25 MB. Processor allocation in parallel processing and data size affect device performance in terms of increasing device temperature and memory consumption.
Actions (login required)
View Item |