Soft and Hard Clustering for Abstract Scientific Paper in Indonesian

Johannes, Johannes and Ermatita, Ermatita and Sukemi, Sukemi (2019) Soft and Hard Clustering for Abstract Scientific Paper in Indonesian. In: 2019 International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS), 24-25 Oct. 2019, Fakultas Ilmu Komputer UPN Veteran Jakarta.

[thumbnail of Soft and Hard Clustering for Abstract Scientific Paper in Indonesian.pdf]
Preview
Text
Soft and Hard Clustering for Abstract Scientific Paper in Indonesian.pdf

Download (305kB) | Preview

Abstract

For ease in grouping research papers is by doing clustering. Clustering is a method to classify the objects into subsets with similar attributes. Clustering method divided into two categories ie hard and soft clustering. Hard clustering is method to grouping the data items such that each item is only assigned to one cluster, K-Means is one of them. While Soft clustering is method to grouping the data items such that an item can exist in multiple clusters, Fuzzy C-Means (FCM) is an example. Most research papers are documented into groups that are associated with the area of expertise of the researcher, even though there is also research whose contents relate to other fields outside the area of expertise of the researcher, so it should also be documented in the group of other fields so that its contribution in other fields can be known. Here in this paper we analyse the abstract of papers written in Indonesian as data set. Data samples were taken from 3 fields, namely information technology, health and economics. Clustering process using k�means and FCM to find out whether scientific paper’s abstracts from different fields of research can be in the same group / cluster as a whole, not whole or different groups. As unstructured data, abstracts must be processed through a text mining procedure first to become vector data

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Hard and soft clustering, Hard and soft clustering
Subjects: Q Science > QA Mathematics > QA75-76.95 Calculating machines > QA75.5.A142 Computer science. Information society. Information technology.
Divisions: 09-Faculty of Computer Science > 56201-Computer Systems (S1)
Depositing User: Dr Ermatita zuhairi
Date Deposited: 15 Mar 2022 07:28
Last Modified: 15 Mar 2022 07:28
URI: http://repository.unsri.ac.id/id/eprint/66124

Actions (login required)

View Item View Item