Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 1978 papers

Title	Date	Tasks	Status	Hype
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding	Dec 24, 2024	Natural Language UnderstandingScene Understanding	CodeCode Available	2
Large Language Model Safety: A Holistic Survey	Dec 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Learning Transferable Visual Models From Natural Language Supervision	Feb 26, 2021	Action RecognitionBenchmarking	CodeCode Available	2
Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning	Oct 18, 2023	Natural Language Understanding	CodeCode Available	2
Selective Aggregation for Low-Rank Adaptation in Federated Learning	Oct 2, 2024	Federated LearningGeneral Knowledge	CodeCode Available	2
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding	Aug 21, 2023	Entity TypingEvent Extraction	CodeCode Available	2
LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation	Apr 10, 2025	Code GenerationContinual Learning	CodeCode Available	2
Spark NLP: Natural Language Understanding at Scale	Jan 26, 2021	BIG-bench Machine LearningNatural Language Understanding	CodeCode Available	2
I-BERT: Integer-only BERT Quantization	Jan 5, 2021	GPUNatural Language Inference	CodeCode Available	2
An empirical study of LLaMA3 quantization: from LLMs to MLLMs	Apr 22, 2024	Language ModellingLarge Language Model	CodeCode Available	2
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners	Sep 15, 2020	Natural Language Understanding	CodeCode Available	2
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action	Dec 28, 2023	DecoderImage Generation	CodeCode Available	2
GPT Understands, Too	Mar 18, 2021	Knowledge ProbingLanguage Modeling	CodeCode Available	2
BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models	Sep 12, 2023	DiagnosticNatural Language Understanding	CodeCode Available	2
JGLUE: Japanese General Language Understanding Evaluation	Jun 1, 2022	FLUENatural Language Understanding	CodeCode Available	2
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI	Jul 19, 2023	Conversational RecommendationDiversity	CodeCode Available	2
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing	Nov 18, 2021	Language ModelingLanguage Modelling	CodeCode Available	2
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT	Apr 27, 2020	Document RankingInformation Retrieval	CodeCode Available	2
CLUE: A Chinese Language Understanding Evaluation Benchmark	Apr 13, 2020	General ClassificationMachine Reading Comprehension	CodeCode Available	2
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models	May 24, 2023	ChatbotNatural Language Understanding	CodeCode Available	2
An Empirical Study of Qwen3 Quantization	May 4, 2025	Natural Language UnderstandingQuantization	CodeCode Available	2
CleanAgent: Automating Data Standardization with LLM-based Agents	Mar 13, 2024	Code GenerationNatural Language Understanding	CodeCode Available	2
DeBERTa: Decoding-enhanced BERT with Disentangled Attention	Jun 5, 2020	Common Sense ReasoningCoreference Resolution	CodeCode Available	2
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks	Jun 19, 2024	DecoderLanguage Modeling	CodeCode Available	2
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models	Sep 21, 2023	Arithmetic ReasoningGSM8K	CodeCode Available	2

Show:10 25 50

← PrevPage 3 of 80Next →

All datasets PDP60 STREUSLE LexGLUE DialoGLUE fewshot DialoGLUE full GLUE

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	HNN	Accuracy	90	—	Unverified
2	UDSSM-II (ensemble)	Accuracy	78.3	—	Unverified
3	BERT-large 340M	Accuracy	78.3	—	Unverified
4	UDSSM-I (ensemble)	Accuracy	76.7	—	Unverified
5	DSSM	Accuracy	75	—	Unverified
6	UDSSM-II	Accuracy	75	—	Unverified
7	BERT-base 110M + MAS	Accuracy	68.3	—	Unverified
8	USSM + Supervised Deepnet + 3 Knowledge Bases	Accuracy	66.7	—	Unverified
9	Word-level CNN+LSTM (full scoring)	Accuracy	60	—	Unverified
10	Subword-level Transformer LM	Accuracy	58.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BERT (pred POS/lemmas)	Tags (Full) Acc	82.5	—	Unverified
2	BERT (none)	Tags (Full) Acc	82	—	Unverified
3	BERT (gold POS/lemmas)	Tags (Full) Acc	81	—	Unverified
4	GloVe (gold POS/lemmas)	Tags (Full) Acc	79.3	—	Unverified
5	RoBERTa + Linear	Full F1 (Preps)	78.2	—	Unverified
6	GloVe (none)	Tags (Full) Acc	77.5	—	Unverified
7	GloVe (pred POS/lemmas)	Tags (Full) Acc	77.1	—	Unverified
8	SVM (feature-rich, gold syntax)	Role F1 (Preps)	62.2	—	Unverified
9	BiLSTM + MLP (gold syntax)	Role F1 (Preps)	62.2	—	Unverified
10	SVM (feature-rich, auto syntax)	Role F1 (Preps)	58.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CaseLaw-BERT	CaseHOLD	75.6	—	Unverified
2	Legal-BERT	CaseHOLD	75.1	—	Unverified
3	DeBERTa	CaseHOLD	72.1	—	Unverified
4	Longformer	CaseHOLD	72	—	Unverified
5	RoBERTa	CaseHOLD	71.7	—	Unverified
6	BERT	CaseHOLD	70.7	—	Unverified
7	BigBird	CaseHOLD	70.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT-DG	Average	74.6	—	Unverified
2	ConvBERT-DG + Pre + Multi	Average	73.8	—	Unverified
3	mslm	Average	73.49	—	Unverified
4	ConvBERT + Pre + Multi	Average	68.22	—	Unverified
5	BanLanGen	Average	39.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT + Pre + Multi	Average	86.89	—	Unverified
2	mslm	Average	85.83	—	Unverified
3	ConvBERT-DG + Pre + Multi	Average	85.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MT-DNN-SMART	Average	89.9	—	Unverified
2	BERT-LARGE	Average	82.1	—	Unverified