Extraction of Event Sentence Information in the Covid-19 Distribution Location Detection System based on the Indonesian Language Corpus

Fathoni, Fathoni (2022) Extraction of Event Sentence Information in the Covid-19 Distribution Location Detection System based on the Indonesian Language Corpus. 2022 9th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI), 1 (1). pp. 383-388. ISSN 22505479

[thumbnail of Extraction_of_Event_Sentence_Information_in_the_Covid-19_Distribution_Location_Detection_System_based_on_the_Indonesian_Language_Corpus.pdf] Text
Extraction_of_Event_Sentence_Information_in_the_Covid-19_Distribution_Location_Detection_System_based_on_the_Indonesian_Language_Corpus.pdf - Published Version

Download (246kB)

Abstract

The procedure for obtaining data on people affected by the COVID-19 disease is carried out through a manual data collection mechanism through health centers, government and private hospitals, health clinics spread throughout Indonesia, and rapid tests carried out at certain times and locations. Such a surveillance system requires time, a lot of health personnel, and is expensive. In addition, the geographical condition of the Indonesian state, which consists of many large and small islands and the vast territory of Indonesia requires another strategy to find out and make it easier to complete data on people affected by COVID-19, such as the use of information technology. The use of the Twitter dataset to detect the spread of disease in a region or country has been widely carried out by researchers. Sentence extraction is a process that must be done to facilitate the analysis of a short or very long news sentence to get the meaning and essence of the news contained in the sentence. The primary information can be identified based on keywords generated using extraction and abstraction techniques. The initial stage of the research focused on building a corpus of twitter data and a corpus of vocabulary. The first process that will be carried out is to collect natural language datasets from Indonesian-language Tweets on Twitter. Next, carry out the process of extracting incident sentence information with steps, namely making standard sentence formats, simplifying sentences,identifying essensial words, and determining input and target words in sentences.

Item Type: Article
Subjects: #3 Repository of Lecturer Academic Credit Systems (TPAK) > Articles Access for TPAK (Not Open Sources)
Divisions: 09-Faculty of Computer Science > 57201-Information Systems (S1)
Depositing User: Mr. Fathoni Cholil
Date Deposited: 06 May 2023 00:31
Last Modified: 06 May 2023 00:31
URI: http://repository.unsri.ac.id/id/eprint/99393

Actions (login required)

View Item View Item