Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 1978 papers

Title	Date	Tasks	Status	Hype
Mixture-of-Agents Enhances Large Language Model Capabilities	Jun 7, 2024	Language ModelingLanguage Modelling	CodeCode Available	7
DataComp-LM: In search of the next generation of training sets for language models	Jun 17, 2024	Language ModellingMMLU	CodeCode Available	7
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond	Apr 26, 2023	Language ModellingNatural Language Understanding	CodeCode Available	6
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts	Apr 13, 2024	DiversityLanguage Modeling	CodeCode Available	5
Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG	Jan 15, 2025	Natural Language UnderstandingRAG	CodeCode Available	5
TaskWeaver: A Code-First Agent Framework	Nov 29, 2023	Natural Language Understanding	CodeCode Available	5
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation	Jun 25, 2024	DiversityNatural Language Understanding	CodeCode Available	5
How to Design Translation Prompts for ChatGPT: An Empirical Study	Apr 5, 2023	Machine TranslationNatural Language Understanding	CodeCode Available	5
DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation	Oct 14, 2022	Natural Language UnderstandingText Generation	CodeCode Available	4
What Makes Good In-Context Examples for GPT-3?	Jan 17, 2021	Few-Shot LearningNatural Language Understanding	CodeCode Available	4
A Survey on Vision-Language-Action Models for Autonomous Driving	Jun 30, 2025	Autonomous DrivingAutonomous Vehicles	CodeCode Available	4
Decoder Tuning: Efficient Language Understanding as Decoding	Dec 16, 2022	DecoderNatural Language Understanding	CodeCode Available	4
Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective	Oct 16, 2022	Coreference ResolutionMultiple-choice	CodeCode Available	4
GLM: General Language Model Pretraining with Autoregressive Blank Infilling	Mar 18, 2021	Abstractive Text SummarizationClassification	CodeCode Available	3
Efficient Large Language Models: A Survey	Dec 6, 2023	Natural Language UnderstandingSurvey	CodeCode Available	3
Tree Search for Language Model Agents	Jul 1, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible Pipeline	Jan 16, 2024	GSM8KMath	CodeCode Available	3
Large Language Model-Brained GUI Agents: A Survey	Nov 27, 2024	Code GenerationLanguage Modeling	CodeCode Available	3
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding	Oct 11, 2018	Citation Intent ClassificationCommon Sense Reasoning	CodeCode Available	3
Ludwig: a type-based declarative deep learning toolbox	Sep 17, 2019	DecoderDeep Learning	CodeCode Available	3
Attention Is All You Need	Jun 12, 2017	Abstractive Text SummarizationAll	CodeCode Available	3
GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents	Jun 7, 2024	Natural Language Understanding	CodeCode Available	3
Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL	May 22, 2025	Natural Language UnderstandingReinforcement Learning (RL)	CodeCode Available	3
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding	Oct 23, 2020	Language ModelingLanguage Modelling	CodeCode Available	3
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation	Nov 26, 2024	Natural Language UnderstandingReferring Video Object Segmentation	CodeCode Available	3

Show:10 25 50

← PrevPage 1 of 80Next →

All datasets PDP60 STREUSLE LexGLUE DialoGLUE fewshot DialoGLUE full GLUE

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	HNN	Accuracy	90	—	Unverified
2	UDSSM-II (ensemble)	Accuracy	78.3	—	Unverified
3	BERT-large 340M	Accuracy	78.3	—	Unverified
4	UDSSM-I (ensemble)	Accuracy	76.7	—	Unverified
5	DSSM	Accuracy	75	—	Unverified
6	UDSSM-II	Accuracy	75	—	Unverified
7	BERT-base 110M + MAS	Accuracy	68.3	—	Unverified
8	USSM + Supervised Deepnet + 3 Knowledge Bases	Accuracy	66.7	—	Unverified
9	Word-level CNN+LSTM (full scoring)	Accuracy	60	—	Unverified
10	Subword-level Transformer LM	Accuracy	58.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BERT (pred POS/lemmas)	Tags (Full) Acc	82.5	—	Unverified
2	BERT (none)	Tags (Full) Acc	82	—	Unverified
3	BERT (gold POS/lemmas)	Tags (Full) Acc	81	—	Unverified
4	GloVe (gold POS/lemmas)	Tags (Full) Acc	79.3	—	Unverified
5	RoBERTa + Linear	Full F1 (Preps)	78.2	—	Unverified
6	GloVe (none)	Tags (Full) Acc	77.5	—	Unverified
7	GloVe (pred POS/lemmas)	Tags (Full) Acc	77.1	—	Unverified
8	SVM (feature-rich, gold syntax)	Role F1 (Preps)	62.2	—	Unverified
9	BiLSTM + MLP (gold syntax)	Role F1 (Preps)	62.2	—	Unverified
10	SVM (feature-rich, auto syntax)	Role F1 (Preps)	58.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CaseLaw-BERT	CaseHOLD	75.6	—	Unverified
2	Legal-BERT	CaseHOLD	75.1	—	Unverified
3	DeBERTa	CaseHOLD	72.1	—	Unverified
4	Longformer	CaseHOLD	72	—	Unverified
5	RoBERTa	CaseHOLD	71.7	—	Unverified
6	BERT	CaseHOLD	70.7	—	Unverified
7	BigBird	CaseHOLD	70.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT-DG	Average	74.6	—	Unverified
2	ConvBERT-DG + Pre + Multi	Average	73.8	—	Unverified
3	mslm	Average	73.49	—	Unverified
4	ConvBERT + Pre + Multi	Average	68.22	—	Unverified
5	BanLanGen	Average	39.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT + Pre + Multi	Average	86.89	—	Unverified
2	mslm	Average	85.83	—	Unverified
3	ConvBERT-DG + Pre + Multi	Average	85.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MT-DNN-SMART	Average	89.9	—	Unverified
2	BERT-LARGE	Average	82.1	—	Unverified