학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Exploring Part-of-Speech Tagging Models in Malaysia's Multilingual

Resource Type: Conference
Authors: Chan, Yean Ling; Leong, Fang En; Lim, Tong Ming; Tan, Chi Wee
Source: 2024 3rd International Conference on Digital Transformation and Applications (ICDXA) Digital Transformation and Applications (ICDXA), 2024 3rd International Conference on. :132-136 Jan, 2024
Subject: Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
part-of-speech tagging
code-mixing
QTAG
semi-supervised
rule-based
Language

Online Access

Full Text (IEEE)

초록

This study delves into the realm of part-of-speech (POS) tagging within Malaysia's multilingual context. We investigate the efficacy of CRF-QTAG, semi-supervised CRF models and rule-based systems within Bahasa Rojak Analytics—a Malay-based language processing system. By analyzing these models' performances, we observed the profound impact of retraining on their accuracy. While CRF-QTAG and semi-supervised CRF models showcased substantial improvements post-retraining, the rigidity of the rule-based system led to underperformance. The study sheds light on challenges posed by linguistic nuances in code-mixed languages and the dependence on labeled data. Our findings highlight the potential of semi-supervised models in addressing data scarcity issues and adapting to linguistic evolution. Additionally, we advocate for further research aimed at refining rule-based approaches by emphasizing linguistic comprehension and rule definition for enhanced adaptability and accuracy. Addressing these challenges can potentially pave the way for more inclusive and precise language technologies tailored to Malaysia's diverse linguistic fabric.

공지

DAU Library

학술논문

요약정보

Exploring Part-of-Speech Tagging Models in Malaysia's Multilingual

Online Access

초록