CLINICAL NAMED ENTITY RECOGNITION PADA DATA BIOMEDIS MENGGUNAKAN VARIAN MODEL BERT UNTUK KASUS BIOMEDIS

OKTARINA, TIARA and firdaus, firdaus (2025) CLINICAL NAMED ENTITY RECOGNITION PADA DATA BIOMEDIS MENGGUNAKAN VARIAN MODEL BERT UNTUK KASUS BIOMEDIS. Undergraduate thesis, Sriwijaya University.

[thumbnail of RAMA_56201_09011182126028_cover.jpeg] Image
RAMA_56201_09011182126028_cover.jpeg - Accepted Version
Available under License Creative Commons Public Domain Dedication.

Download (477kB)
[thumbnail of RAMA_56201_09011182126028.pdf] Text
RAMA_56201_09011182126028.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (7MB) | Request a copy
[thumbnail of RAMA_56201_09011182126028_TURNITIN.pdf] Text
RAMA_56201_09011182126028_TURNITIN.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (6MB) | Request a copy
[thumbnail of RAMA_56201_09011182126028_0221017801_01_front_ref.pdf] Text
RAMA_56201_09011182126028_0221017801_01_front_ref.pdf - Accepted Version
Available under License Creative Commons Public Domain Dedication.

Download (5MB)
[thumbnail of RAMA_56201_09011182126028_0221017801_02.pdf] Text
RAMA_56201_09011182126028_0221017801_02.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (2MB) | Request a copy
[thumbnail of RAMA_56201_09011182126028_0221017801_03.pdf] Text
RAMA_56201_09011182126028_0221017801_03.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (3MB) | Request a copy
[thumbnail of RAMA_56201_09011182126028_0221017801_04.pdf] Text
RAMA_56201_09011182126028_0221017801_04.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (14MB) | Request a copy
[thumbnail of RAMA_56201_09011182126028_0221017801_05.pdf] Text
RAMA_56201_09011182126028_0221017801_05.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (411kB) | Request a copy
[thumbnail of RAMA_56201_09011182126028_0221017801_06_ref.pdf] Text
RAMA_56201_09011182126028_0221017801_06_ref.pdf - Bibliography
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (1MB) | Request a copy
[thumbnail of RAMA_56201_09011182126028_0221017801_07_lamp.pdf] Text
RAMA_56201_09011182126028_0221017801_07_lamp.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Public Domain Dedication.

Download (515kB) | Request a copy

Abstract

This study aims to develop a system capable of identifying and classifying medical entities from unstructured biomedical texts, thereby supporting Clinical analysis and health research. The author trained and evaluated three BERT-based models: BioBERT, Clinical BERT, and BlueBERT, for the task of Clinical Named Entity Recognition (CNER). Model performance was measured using precision, recall, and F1-Score metrics on three biomedical datasets: NCBI, BC2GM, and JNLPBA. The results consistently show that BioBERT delivers the best performance across most datasets. On NCBI, BioBERT achieved the highest F1-Score of 93%, outperforming Clinical BERT and BlueBERT (both 91%). In BC2GM, BioBERT also excelled with 91%, while other models reached 90%. The JNLPBA dataset proved more challenging, with BioBERT achieving the highest F1-Score of only 80%. A significant performance improvement, up to 10% in some cases, was observed in the final experiments due to the application of full fine-tuning. This research contributes to identifying the most optimal transformer model for automated information extraction applications in healthcare.

Item Type: Thesis (Undergraduate)
Uncontrolled Keywords: Clinical Named Entity Recognition, Dataset Biomedic, Transformer, BioBERT, Clinical BERT, BlueBERT
Subjects: Q Science > Q Science (General) > Q334-342 Computer science. Artificial intelligence. Algorithms. Robotics. Automation.
Divisions: 09-Faculty of Computer Science > 56201-Computer Systems (S1)
Depositing User: Tiara Oktarina
Date Deposited: 21 Jul 2025 13:21
Last Modified: 21 Jul 2025 13:21
URI: http://repository.unsri.ac.id/id/eprint/179473

Actions (login required)

View Item View Item