Investigating Effective Transfer Learning Strategies for Natural Language Processing Tasks in Low-Resource Languages

Authors

  • Ayesha Shahid Senior Lecturer at Social Sciences University of Ankara TRNC and PhD Scholar at Cyprus International University TRNC.
  • Shanza Chaudhari Senior English Lecturer (ELT) at the University of Kyrenia, TRNC and M.Phil Scholar at Final International University, TRNC/Cyprus.
  • Dr. Anwar Ali Sanjrani Department of Computer Science and Information Technology University of Balochistan Quetta.

Keywords:

Transfer Learning, Natural Language Processing, Low-Resource Languages, mBERT, XLM-R, Urdu, Panjabi, Balochi, Pashto, Sindhi.

Abstract

This research investigates effective transfer learning strategies for Natural Language Processing (NLP) tasks in low-resource languages, focusing on the Pakistani context. Through a comprehensive methodology, including literature review, data collection, and experimentation with pre-trained models like mBERT and XLM-R, the study demonstrates the potential of transfer learning to enhance model performance for Urdu, Panjabi, Balochi, Pashto, and Sindhi. The findings reveal significant improvements in accuracy and F1 scores through fine-tuning and data augmentation. Qualitative analyses, including error inspection and user feedback, highlight the necessity for ongoing refinements in model usability, particularly concerning cultural nuances and real-time feedback mechanisms.

Downloads

Published

2024-10-28

How to Cite

Ayesha Shahid, Shanza Chaudhari, & Dr. Anwar Ali Sanjrani. (2024). Investigating Effective Transfer Learning Strategies for Natural Language Processing Tasks in Low-Resource Languages. International Journal of Communication Networks and Information Security (IJCNIS), 16(4), 1868–1880. Retrieved from https://ijcnis.org/index.php/ijcnis/article/view/7520

Issue

Section

Research Articles