Bangla Text Classification using Transformers

2020-11-09Code Available0· sign in to hype

Tanvirul Alam, Akib Khan, Firoj Alam

Code Available — Be the first to reproduce this paper.

Code

github.com/xashru/bangla-text-classification
pytorch★ 6

Abstract

Text classification has been one of the earliest problems in NLP. Over time the scope of application areas has broadened and the difficulty of dealing with new areas (e.g., noisy social media content) has increased. The problem-solving strategy switched from classical machine learning to deep learning algorithms. One of the recent deep neural network architecture is the Transformer. Models designed with this type of network and its variants recently showed their success in many downstream natural language processing tasks, especially for resource-rich languages, e.g., English. However, these models have not been explored fully for Bangla text classification tasks. In this work, we fine-tune multilingual transformer models for Bangla text classification tasks in different domains, including sentiment analysis, emotion detection, news categorization, and authorship attribution. We obtain the state of the art results on six benchmark datasets, improving upon the previous results by 5-29% accuracy across different tasks.

Tasks

Authorship Attribution Classification General Classification Sentiment Analysis text-classification Text Classification

Bangla Text Classification using Transformers

Code

Abstract

Tasks

Reproductions