Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 1978 papers

Title	Date	Tasks	Status	Hype
Parameter-Efficient Fine-Tuning with Discrete Fourier Transform	May 5, 2024	image-classificationImage Classification	CodeCode Available	2
PERT: Pre-training BERT with Permuted Language Model	Mar 14, 2022	Language ModelingLanguage Modelling	CodeCode Available	2
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models	Sep 21, 2023	Arithmetic ReasoningGSM8K	CodeCode Available	2
MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data	Jun 26, 2024	BenchmarkingMath	CodeCode Available	2
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models	May 23, 2024	Natural Language UnderstandingQuantization	CodeCode Available	2
MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models	Aug 17, 2023	Decision MakingHallucination	CodeCode Available	2
PMC-LLaMA: Towards Building Open-source Language Models for Medicine	Apr 27, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Balancing LoRA Performance and Efficiency with Simple Shard Sharing	Sep 19, 2024	Computational EfficiencyGSM8K	CodeCode Available	2
The First Place Solution of WSDM Cup 2024: Leveraging Large Language Models for Conversational Multi-Doc QA	Feb 28, 2024	Natural Language UnderstandingQuestion Answering	CodeCode Available	2
LoRA-Pro: Are Low-Rank Adapters Properly Optimized?	Jul 25, 2024	Code GenerationComputational Efficiency	CodeCode Available	2
Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities	May 26, 2025	Knowledge GraphsNatural Language Understanding	CodeCode Available	2
MCP-Solver: Integrating Language Models with Constraint Programming Systems	Dec 31, 2024	Natural Language Understanding	CodeCode Available	2
LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation	Apr 10, 2025	Code GenerationContinual Learning	CodeCode Available	2
JGLUE: Japanese General Language Understanding Evaluation	Jun 1, 2022	FLUENatural Language Understanding	CodeCode Available	2
An empirical study of LLaMA3 quantization: from LLMs to MLLMs	Apr 22, 2024	Language ModellingLarge Language Model	CodeCode Available	2
BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models	Sep 12, 2023	DiagnosticNatural Language Understanding	CodeCode Available	2
I-BERT: Integer-only BERT Quantization	Jan 5, 2021	GPUNatural Language Inference	CodeCode Available	2
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks	Jun 19, 2024	DecoderLanguage Modeling	CodeCode Available	2
GPT Understands, Too	Mar 18, 2021	Knowledge ProbingLanguage Modeling	CodeCode Available	2
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing	Nov 18, 2021	Language ModelingLanguage Modelling	CodeCode Available	2
DeBERTa: Decoding-enhanced BERT with Disentangled Attention	Jun 5, 2020	Common Sense ReasoningCoreference Resolution	CodeCode Available	2
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI	Jul 19, 2023	Conversational RecommendationDiversity	CodeCode Available	2
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment	Feb 24, 2025	image-classificationImage Classification	CodeCode Available	2
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners	Sep 15, 2020	Natural Language Understanding	CodeCode Available	2
Autonomous GIS: the next-generation AI-powered GIS	May 10, 2023	Code GenerationInformation Retrieval	CodeCode Available	2
Large Language Model Safety: A Holistic Survey	Dec 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach	Apr 11, 2021	Machine TranslationNatural Language Understanding	CodeCode Available	1
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models	May 24, 2023	document understandingImage Captioning	CodeCode Available	1
Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus	Apr 14, 2020	Natural Language UnderstandingSemantic Role Labeling	CodeCode Available	1
Anno-MI: A Dataset of Expert-Annotated Counselling Dialogues	Apr 27, 2022	Dialogue GenerationNatural Language Understanding	CodeCode Available	1
Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models	Nov 4, 2021	Adversarial AttackAdversarial Robustness	CodeCode Available	1
CREAK: A Dataset for Commonsense Reasoning over Entity Knowledge	Sep 3, 2021	Fact CheckingFact Verification	CodeCode Available	1
CSKG: The CommonSense Knowledge Graph	Dec 21, 2020	Knowledge GraphsNatural Language Understanding	CodeCode Available	1
ConvBERT: Improving BERT with Span-based Dynamic Convolution	Aug 6, 2020	Natural Language Understanding	CodeCode Available	1
Advances of Transformer-Based Models for News Headline Generation	Jul 9, 2020	Headline Generationnamed-entity-recognition	CodeCode Available	1
Convolution-enhanced Evolving Attention Networks	Dec 16, 2022	image-classificationImage Classification	CodeCode Available	1
ConFiguRe: Exploring Discourse-level Chinese Figures of Speech	Sep 16, 2022	Natural Language UnderstandingSentence	CodeCode Available	1
CONDAQA: A Contrastive Reading Comprehension Dataset for Reasoning about Negation	Nov 1, 2022	Natural Language UnderstandingNegation	CodeCode Available	1
Conic10K: A Challenging Math Problem Understanding and Reasoning Dataset	Nov 9, 2023	MathNatural Language Understanding	CodeCode Available	1
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation	Sep 13, 2021	DecoderDenoising	CodeCode Available	1
C-STS: Conditional Semantic Textual Similarity	May 24, 2023	Information RetrievalLanguage Model Evaluation	CodeCode Available	1
Collaborative Multi-Agent Dialogue Model Training Via Reinforcement Learning	Jul 11, 2019	Natural Language Understandingreinforcement-learning	CodeCode Available	1
COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs	Oct 12, 2020	Knowledge GraphsNatural Language Understanding	CodeCode Available	1
CLUES: Few-Shot Learning Evaluation in Natural Language Understanding	Nov 4, 2021	Few-Shot LearningNatural Language Understanding	CodeCode Available	1
CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text	Aug 16, 2019	DiagnosticGraph Neural Network	CodeCode Available	1
Comparison by Conversion: Reverse-Engineering UCCA from Syntax and Lexical Semantics	Nov 2, 2020	Natural Language UnderstandingSentence	CodeCode Available	1
AceGPT, Localizing Large Language Models in Arabic	Sep 21, 2023	Instruction FollowingLanguage Modeling	CodeCode Available	1
CLINE: Contrastive Learning with Semantic Negative Examples for Natural Language Understanding	Jul 1, 2021	Contrastive LearningNatural Language Understanding	CodeCode Available	1
Causality-aware Concept Extraction based on Knowledge-guided Prompting	May 3, 2023	Knowledge GraphsNatural Language Understanding	CodeCode Available	1
A Dataset for Statutory Reasoning in Tax Law Entailment and Question Answering	May 11, 2020	Natural Language UnderstandingQuestion Answering	CodeCode Available	1

Show:10 25 50

← PrevPage 2 of 40Next →

All datasets PDP60 STREUSLE LexGLUE DialoGLUE fewshot DialoGLUE full GLUE

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	HNN	Accuracy	90	—	Unverified
2	UDSSM-II (ensemble)	Accuracy	78.3	—	Unverified
3	BERT-large 340M	Accuracy	78.3	—	Unverified
4	UDSSM-I (ensemble)	Accuracy	76.7	—	Unverified
5	DSSM	Accuracy	75	—	Unverified
6	UDSSM-II	Accuracy	75	—	Unverified
7	BERT-base 110M + MAS	Accuracy	68.3	—	Unverified
8	USSM + Supervised Deepnet + 3 Knowledge Bases	Accuracy	66.7	—	Unverified
9	Word-level CNN+LSTM (full scoring)	Accuracy	60	—	Unverified
10	Subword-level Transformer LM	Accuracy	58.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BERT (pred POS/lemmas)	Tags (Full) Acc	82.5	—	Unverified
2	BERT (none)	Tags (Full) Acc	82	—	Unverified
3	BERT (gold POS/lemmas)	Tags (Full) Acc	81	—	Unverified
4	GloVe (gold POS/lemmas)	Tags (Full) Acc	79.3	—	Unverified
5	RoBERTa + Linear	Full F1 (Preps)	78.2	—	Unverified
6	GloVe (none)	Tags (Full) Acc	77.5	—	Unverified
7	GloVe (pred POS/lemmas)	Tags (Full) Acc	77.1	—	Unverified
8	SVM (feature-rich, gold syntax)	Role F1 (Preps)	62.2	—	Unverified
9	BiLSTM + MLP (gold syntax)	Role F1 (Preps)	62.2	—	Unverified
10	SVM (feature-rich, auto syntax)	Role F1 (Preps)	58.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CaseLaw-BERT	CaseHOLD	75.6	—	Unverified
2	Legal-BERT	CaseHOLD	75.1	—	Unverified
3	DeBERTa	CaseHOLD	72.1	—	Unverified
4	Longformer	CaseHOLD	72	—	Unverified
5	RoBERTa	CaseHOLD	71.7	—	Unverified
6	BERT	CaseHOLD	70.7	—	Unverified
7	BigBird	CaseHOLD	70.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT-DG	Average	74.6	—	Unverified
2	ConvBERT-DG + Pre + Multi	Average	73.8	—	Unverified
3	mslm	Average	73.49	—	Unverified
4	ConvBERT + Pre + Multi	Average	68.22	—	Unverified
5	BanLanGen	Average	39.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT + Pre + Multi	Average	86.89	—	Unverified
2	mslm	Average	85.83	—	Unverified
3	ConvBERT-DG + Pre + Multi	Average	85.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MT-DNN-SMART	Average	89.9	—	Unverified
2	BERT-LARGE	Average	82.1	—	Unverified