PARALLEL PROCESSING FOR STEMMING AND POS-TAGGING IN INDONESIAN TEXT

RAHMADANI, MUHAMMAD WAHYU and Abdiansah, Abdiansah and Marieska, Mastura Diana (2022) PARALLEL PROCESSING FOR STEMMING AND POS-TAGGING IN INDONESIAN TEXT. Undergraduate thesis, Sriwijaya University.

[thumbnail of RAMA_55201_09021181823166.pdf] Text
RAMA_55201_09021181823166.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Request a copy
[thumbnail of RAMA_55201_09021181823166_TURNITIN.pdf] Text
RAMA_55201_09021181823166_TURNITIN.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (7MB) | Request a copy
[thumbnail of RAMA_55201_09021181823166_0001108401_0021038607_01_front_ref.pdf]
Preview
Text
RAMA_55201_09021181823166_0001108401_0021038607_01_front_ref.pdf - Accepted Version
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Preview
[thumbnail of RAMA_55201_09021181823166_0001108401_0021038607_02.pdf] Text
RAMA_55201_09021181823166_0001108401_0021038607_02.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Request a copy
[thumbnail of RAMA_55201_09021181823166_0001108401_0021038607_03.pdf] Text
RAMA_55201_09021181823166_0001108401_0021038607_03.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Request a copy
[thumbnail of RAMA_55201_09021181823166_0001108401_0021038607_04.pdf] Text
RAMA_55201_09021181823166_0001108401_0021038607_04.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Request a copy
[thumbnail of RAMA_55201_09021181823166_0001108401_0021038607_05.pdf] Text
RAMA_55201_09021181823166_0001108401_0021038607_05.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Request a copy
[thumbnail of RAMA_55201_09021181823166_0001108401_0021038607_06_ref.pdf] Text
RAMA_55201_09021181823166_0001108401_0021038607_06_ref.pdf - Bibliography
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Request a copy
[thumbnail of RAMA_55201_09021181823166_0001108401_0021038607_07_lamp.pdf] Text
RAMA_55201_09021181823166_0001108401_0021038607_07_lamp.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Request a copy

Abstract

Stemming and POS-Tagging is part of the pre-processing of raw text data in the natural language processing field which aims to produce more structured data and as an initial step that greatly affects processing performance before being processed at a further stage. In its application for Indonesian text, the efficiency level of process performance for these two stages is still low, especially for large data sizes. Parallel processing method using the python multiprocessing module was applied in this study to see the reduction in processing time for the Stemming and POS Tagging process and also to observe the impact of implementing this parallel processing method on the devices used. Results showed that the highest reduction was 78.26% for the POS-Tagging process using test data size range of 10 MB – 70 MB and 63.28% for the Stemming process using test data size range of 7 MB – 25 MB. Processor allocation in parallel processing and data size affect device performance in terms of increasing device temperature and memory consumption.

Item Type: Thesis (Undergraduate)
Uncontrolled Keywords: parallel processing, Stemming, POS-Tagging. Indonesian text
Subjects: Q Science > QA Mathematics > QA75-76.95 Calculating machines > QA76 Computer software
Q Science > QA Mathematics > QA75-76.95 Calculating machines > QA76.76.I58.A3115 Computer science. Computers. Intelligent agents (Computer software)
Q Science > QA Mathematics > QA75-76.95 Calculating machines > QA76.9.E94 Computer system performance. Computer Communication Networks. Computer science. Logic design. Operating systems (Computers).
Q Science > QA Mathematics > QA75-76.95 Calculating machines > QA76.Z55 Apache Hadoop (Computer file) Electronic data processing--Distributed processing. File organization (Computer science) Data mining. Streaming technology (Telecommunications)
Divisions: 09-Faculty of Computer Science > 55201-Informatics (S1)
Depositing User: S.Kom Muhammad Wahyu Rahmadani
Date Deposited: 19 Jan 2023 05:19
Last Modified: 19 Jan 2023 05:19
URI: http://repository.unsri.ac.id/id/eprint/86639

Actions (login required)

View Item View Item