INARTE: An Indonesian Dataset for Recognition Textual Entailment

Abdiansah, Abdiansah (2018) INARTE: An Indonesian Dataset for Recognition Textual Entailment. In: 2018 4th International Conference on Science and Technology (ICST), 7-8 Aug. 2018, Yogyakarta, Indonesia.

[thumbnail of 2018-ICST.pdf]
Preview
Text
2018-ICST.pdf

Download (819kB) | Preview

Abstract

Recognition Textual Entailment (RTE) try to solve variability problem that commonly encountered in natural language-based systems. The basic idea is to detect whether the meaning of a text can be inferred by another text. The need dataset in language other than English is necessary to accelerate research development in RTE. We created RTE dataset for Indonesian by retrieval text from Web and generate text-hypothesis pairs as many as possible. The subset technique is used to decide whether Text (T) entails Hypothesis (H). The initial data used 400 question-answer pairs obtained 1,577 entailment pairs, where 481 entailment pairs obtained from the accuracy above 50%.

Item Type: Conference or Workshop Item (Paper)
Subjects: Q Science > Q Science (General) > Q334-342 Computer science. Artificial intelligence. Algorithms. Robotics. Automation.
Divisions: 09-Faculty of Computer Science > 55201-Informatics (S1)
Depositing User: Abdiansah Abdiansah
Date Deposited: 12 Mar 2020 02:01
Last Modified: 12 Mar 2020 02:01
URI: http://repository.unsri.ac.id/id/eprint/28185

Actions (login required)

View Item View Item