How to Fine-Tune BERT for Text Classification?

2019-05-14Code Available1· sign in to hype

Chi Sun, Xipeng Qiu, Yige Xu, Xuanjing Huang

Code Available — Be the first to reproduce this paper.

Code

github.com/xuyige/BERT4doc-Classification
OfficialIn paperpytorch★ 641
github.com/ongunuzaymacar/comparatively-finetuning-bert
pytorch★ 126
github.com/GeorgeLuImmortal/Hierarchical-BERT-Model-with-Limited-Labelled-Data
pytorch★ 42
github.com/heraclex12/VLSP2020-Fake-News-Detection
pytorch★ 18
github.com/bcaitech1/p4-dkt-no_caffeine_no_gain
pytorch★ 16
github.com/Derposoft/ai-educator
none★ 2
github.com/arctic-yen/Google_QUEST_Q-A_Labeling
tf★ 0
github.com/sahil00199/KYC
pytorch★ 0
github.com/Domminique/Deploy-BERT-for-Sentiment-Analysis-with-FastAPI-
pytorch★ 0
github.com/jyp1111/sentiment_analysis
pytorch★ 0

Abstract

Language model pre-training has proven to be useful in learning universal language representations. As a state-of-the-art language model pre-training model, BERT (Bidirectional Encoder Representations from Transformers) has achieved amazing results in many language understanding tasks. In this paper, we conduct exhaustive experiments to investigate different fine-tuning methods of BERT on text classification task and provide a general solution for BERT fine-tuning. Finally, the proposed solution obtains new state-of-the-art results on eight widely-studied text classification datasets.

Tasks

General Classification Language Modeling Language Modelling Sentiment Analysis Text Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
IMDb	BERT_large+ITPT	Accuracy	95.79	—	Unverified
IMDb	BERT_base+ITPT	Accuracy	95.63	—	Unverified
Yelp Binary classification	BERT_large+ITPT	Error	1.81	—	Unverified
Yelp Binary classification	BERT_base+ITPT	Error	1.92	—	Unverified
Yelp Fine-grained classification	BERT_large+ITPT	Error	28.62	—	Unverified
Yelp Fine-grained classification	BERT_base+ITPT	Error	29.42	—	Unverified

How to Fine-Tune BERT for Text Classification?

Code

Abstract

Tasks

Benchmark Results

Reproductions