Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 1978 papers

Title	Date	Tasks	Status	Hype
Vision Language Action Models in Robotic Manipulation: A Systematic Review	Jul 14, 2025	Dataset GenerationNatural Language Understanding	CodeCode Available	2
State and Memory is All You Need for Robust and Reliable AI Agents	Jun 30, 2025	AllBenchmarking	—Unverified	0
A Survey on Vision-Language-Action Models for Autonomous Driving	Jun 30, 2025	Autonomous DrivingAutonomous Vehicles	CodeCode Available	4
skLEP: A Slovak General Language Understanding Benchmark	Jun 26, 2025	Natural Language UnderstandingSentence	CodeCode Available	0
SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models	Jun 25, 2025	Code GenerationIn-Context Learning	—Unverified	0
Semantic similarity estimation for domain specific data using BERT and other techniques	Jun 23, 2025	Information RetrievalMachine Translation	—Unverified	0
Towards Pervasive Distributed Agentic Generative AI -- A State of The Art	Jun 16, 2025	Natural Language UnderstandingSurvey	—Unverified	0
An Interdisciplinary Review of Commonsense Reasoning and Intent Detection	Jun 16, 2025	Intent DetectionNatural Language Understanding	—Unverified	0
Motion-R1: Chain-of-Thought Reasoning and Reinforcement Learning for Human Motion Generation	Jun 12, 2025	Language ModelingLanguage Modelling	—Unverified	0
Dialect Normalization using Large Language Models and Morphological Rules	Jun 10, 2025	Natural Language Understanding	CodeCode Available	0
EdgeProfiler: A Fast Profiling Framework for Lightweight LLMs on Edge Using Analytical Model	Jun 6, 2025	Natural Language UnderstandingQuantization	CodeCode Available	0
Demonstrations of Integrity Attacks in Multi-Agent Systems	Jun 5, 2025	Code GenerationNatural Language Understanding	—Unverified	0
ChemGraph: An Agentic Framework for Computational Chemistry Workflows	Jun 3, 2025	Computational chemistryGraph Neural Network	—Unverified	0
ChemAU: Harness the Reasoning of LLMs in Chemical Research with Adaptive Uncertainty Estimation	Jun 1, 2025	Natural Language Understanding	—Unverified	0
Benchmarking Large Language Models for Cryptanalysis and Mismatched-Generalization	May 30, 2025	BenchmarkingCryptanalysis	—Unverified	0
MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection	May 29, 2025	image-classificationImage Classification	—Unverified	0
ICH-Qwen: A Large Language Model Towards Chinese Intangible Cultural Heritage	May 28, 2025	Language ModelingLanguage Modelling	—Unverified	0
StreamLink: Large-Language-Model Driven Distributed Data Engineering System	May 27, 2025	Language ModelingLanguage Modelling	—Unverified	0
Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression	May 26, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities	May 26, 2025	Knowledge GraphsNatural Language Understanding	CodeCode Available	2
RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models	May 24, 2025	Natural Language Understanding	—Unverified	0
Exploring the Vulnerability of the Content Moderation Guardrail in Large Language Models via Intent Manipulation	May 24, 2025	Intent DetectionNatural Language Understanding	—Unverified	0
Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL	May 22, 2025	Natural Language UnderstandingReinforcement Learning (RL)	CodeCode Available	3
Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach	May 22, 2025	Decision MakingNatural Language Understanding	—Unverified	0
Logic-of-Thought: Empowering Large Language Models with Logic Programs for Solving Puzzles in Natural Language	May 22, 2025	Natural Language Understanding	CodeCode Available	0
Large Language Model-Empowered Interactive Load Forecasting	May 22, 2025	Language ModelingLanguage Modelling	—Unverified	0
SAE-SSV: Supervised Steering in Sparse Representation Spaces for Reliable Control of Language Models	May 22, 2025	Natural Language Understanding	—Unverified	0
Transfer of Structural Knowledge from Synthetic Languages	May 21, 2025	Natural Language UnderstandingTransfer Learning	CodeCode Available	0
Evolutionary Computation and Large Language Models: A Survey of Methods, Synergies, and Applications	May 21, 2025	Evolutionary AlgorithmsNatural Language Understanding	—Unverified	0
Krikri: Advancing Open Large Language Models for Greek	May 19, 2025	Code GenerationLanguage Modeling	—Unverified	0
Cross-Lingual Representation Alignment Through Contrastive Image-Caption Tuning	May 19, 2025	Natural Language UnderstandingRetrieval	CodeCode Available	0
A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMs	May 19, 2025	Machine Translationnamed-entity-recognition	CodeCode Available	0
ModernGBERT: German-only 1B Encoder Model Trained from Scratch	May 19, 2025	DecoderNatural Language Understanding	—Unverified	0
Cloud-Based AI Systems: Leveraging Large Language Models for Intelligent Fault Detection and Autonomous Self-Healing	May 16, 2025	Anomaly DetectionCloud Computing	—Unverified	0
Do LLMs Memorize Recommendation Datasets? A Preliminary Study on MovieLens-1M	May 15, 2025	BenchmarkingMemorization	CodeCode Available	0
Human-like Cognitive Generalization for Large Models via Brain-in-the-loop Supervision	May 14, 2025	Natural Language UnderstandingZero-Shot Learning	—Unverified	0
Hakim: Farsi Text Embedding Model	May 13, 2025	Information RetrievalLanguage Modeling	—Unverified	0
A Social Robot with Inner Speech for Dietary Guidance	May 13, 2025	Computational EfficiencyDecision Making	CodeCode Available	0
TUMS: Enhancing Tool-use Abilities of LLMs with Multi-structure Handlers	May 13, 2025	Natural Language UnderstandingTask 2	—Unverified	0
Vision-Language-Action Models: Concepts, Progress, Applications and Challenges	May 7, 2025	Autonomous VehiclesNatural Language Understanding	—Unverified	0
A Comparative Analysis of Ethical and Safety Gaps in LLMs using Relative Danger Coefficient	May 6, 2025	Natural Language Understanding	—Unverified	0
An Empirical Study of Qwen3 Quantization	May 4, 2025	Natural Language UnderstandingQuantization	CodeCode Available	2
Structured Prompting and Feedback-Guided Reasoning with LLMs for Data Interpretation	May 3, 2025	Natural Language Understanding	—Unverified	0
TRAVELER: A Benchmark for Evaluating Temporal Reasoning across Vague, Implicit and Explicit References	May 2, 2025	Natural Language UnderstandingQuestion Answering	—Unverified	0
Understanding LLM Scientific Reasoning through Promptings and Model's Explanation on the Answers	May 2, 2025	Natural Language UnderstandingPrompt Engineering	—Unverified	0
OET: Optimization-based prompt injection Evaluation Toolkit	May 1, 2025	Adversarial RobustnessNatural Language Understanding	CodeCode Available	1
Hallucinations and Key Information Extraction in Medical Texts: A Comprehensive Assessment of Open-Source Large Language Models	Apr 27, 2025	Key Information ExtractionNatural Language Understanding	—Unverified	0
Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant	Apr 25, 2025	Natural Language UnderstandingResponse Generation	CodeCode Available	0
Pushing the boundary on Natural Language Inference	Apr 25, 2025	Fact CheckingInformation Retrieval	—Unverified	0
A Survey of Foundation Model-Powered Recommender Systems: From Feature-Based, Generative to Agentic Paradigms	Apr 23, 2025	Natural Language UnderstandingRecommendation Systems	—Unverified	0

Show:10 25 50

← PrevPage 1 of 40Next →

All datasets PDP60 STREUSLE LexGLUE DialoGLUE fewshot DialoGLUE full GLUE

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	HNN	Accuracy	90	—	Unverified
2	UDSSM-II (ensemble)	Accuracy	78.3	—	Unverified
3	BERT-large 340M	Accuracy	78.3	—	Unverified
4	UDSSM-I (ensemble)	Accuracy	76.7	—	Unverified
5	DSSM	Accuracy	75	—	Unverified
6	UDSSM-II	Accuracy	75	—	Unverified
7	BERT-base 110M + MAS	Accuracy	68.3	—	Unverified
8	USSM + Supervised Deepnet + 3 Knowledge Bases	Accuracy	66.7	—	Unverified
9	Word-level CNN+LSTM (full scoring)	Accuracy	60	—	Unverified
10	Subword-level Transformer LM	Accuracy	58.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BERT (pred POS/lemmas)	Tags (Full) Acc	82.5	—	Unverified
2	BERT (none)	Tags (Full) Acc	82	—	Unverified
3	BERT (gold POS/lemmas)	Tags (Full) Acc	81	—	Unverified
4	GloVe (gold POS/lemmas)	Tags (Full) Acc	79.3	—	Unverified
5	RoBERTa + Linear	Full F1 (Preps)	78.2	—	Unverified
6	GloVe (none)	Tags (Full) Acc	77.5	—	Unverified
7	GloVe (pred POS/lemmas)	Tags (Full) Acc	77.1	—	Unverified
8	SVM (feature-rich, gold syntax)	Role F1 (Preps)	62.2	—	Unverified
9	BiLSTM + MLP (gold syntax)	Role F1 (Preps)	62.2	—	Unverified
10	SVM (feature-rich, auto syntax)	Role F1 (Preps)	58.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CaseLaw-BERT	CaseHOLD	75.6	—	Unverified
2	Legal-BERT	CaseHOLD	75.1	—	Unverified
3	DeBERTa	CaseHOLD	72.1	—	Unverified
4	Longformer	CaseHOLD	72	—	Unverified
5	RoBERTa	CaseHOLD	71.7	—	Unverified
6	BERT	CaseHOLD	70.7	—	Unverified
7	BigBird	CaseHOLD	70.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT-DG	Average	74.6	—	Unverified
2	ConvBERT-DG + Pre + Multi	Average	73.8	—	Unverified
3	mslm	Average	73.49	—	Unverified
4	ConvBERT + Pre + Multi	Average	68.22	—	Unverified
5	BanLanGen	Average	39.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT + Pre + Multi	Average	86.89	—	Unverified
2	mslm	Average	85.83	—	Unverified
3	ConvBERT-DG + Pre + Multi	Average	85.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MT-DNN-SMART	Average	89.9	—	Unverified
2	BERT-LARGE	Average	82.1	—	Unverified