Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 376–400 of 1978 papers

Title	Date	Tasks	Status	Hype
CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text	Aug 16, 2019	DiagnosticGraph Neural Network	CodeCode Available	1
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks	Jul 29, 2019	DecoderMachine Translation	CodeCode Available	1
Collaborative Multi-Agent Dialogue Model Training Via Reinforcement Learning	Jul 11, 2019	Natural Language Understandingreinforcement-learning	CodeCode Available	1
Does It Make Sense? And Why? A Pilot Study for Sense Making and Explanation	Jun 2, 2019	Common Sense ReasoningLanguage Modeling	CodeCode Available	1
Attention Is (not) All You Need for Commonsense Reasoning	May 31, 2019	AllCoreference Resolution	CodeCode Available	1
A Surprisingly Robust Trick for Winograd Schema Challenge	May 15, 2019	Common Sense ReasoningCoreference Resolution	CodeCode Available	1
Dual Supervised Learning for Natural Language Understanding and Generation	May 15, 2019	Natural Language UnderstandingText Generation	CodeCode Available	1
Benchmarking Natural Language Understanding Services for building Conversational Agents	Mar 13, 2019	BenchmarkingGeneral Classification	CodeCode Available	1
BERT for Joint Intent Classification and Slot Filling	Feb 28, 2019	General Classificationintent-classification	CodeCode Available	1
A Comprehensive Survey on Graph Neural Networks	Jan 3, 2019	BIG-bench Machine Learningimage-classification	CodeCode Available	1
Jack the Reader - A Machine Reading Framework	Jun 20, 2018	ArticlesLink Prediction	CodeCode Available	1
Improving Language Understanding by Generative Pre-Training	Jun 11, 2018	Cloze TestDocument Classification	CodeCode Available	1
Know What You Don't Know: Unanswerable Questions for SQuAD	Jun 11, 2018	Natural Language UnderstandingQuestion Answering	CodeCode Available	1
Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces	May 25, 2018	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding	Apr 20, 2018	DiagnosticNatural Language Inference	CodeCode Available	1
AllenNLP: A Deep Semantic Natural Language Processing Platform	Mar 20, 2018	Natural Language UnderstandingReading Comprehension	CodeCode Available	1
Evaluating Scoped Meaning Representations	Feb 23, 2018	Natural Language UnderstandingNegation	CodeCode Available	1
Whodunnit? Crime Drama as a Case for Natural Language Understanding	Oct 31, 2017	Natural Language Understanding	CodeCode Available	1
State and Memory is All You Need for Robust and Reliable AI Agents	Jun 30, 2025	AllBenchmarking	—Unverified	0
skLEP: A Slovak General Language Understanding Benchmark	Jun 26, 2025	Natural Language UnderstandingSentence	CodeCode Available	0
SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models	Jun 25, 2025	Code GenerationIn-Context Learning	—Unverified	0
Semantic similarity estimation for domain specific data using BERT and other techniques	Jun 23, 2025	Information RetrievalMachine Translation	—Unverified	0
An Interdisciplinary Review of Commonsense Reasoning and Intent Detection	Jun 16, 2025	Intent DetectionNatural Language Understanding	—Unverified	0
Towards Pervasive Distributed Agentic Generative AI -- A State of The Art	Jun 16, 2025	Natural Language UnderstandingSurvey	—Unverified	0
Motion-R1: Chain-of-Thought Reasoning and Reinforcement Learning for Human Motion Generation	Jun 12, 2025	Language ModelingLanguage Modelling	—Unverified	0

Show:10 25 50

← PrevPage 16 of 80Next →

All datasets PDP60 STREUSLE LexGLUE DialoGLUE fewshot DialoGLUE full GLUE

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	HNN	Accuracy	90	—	Unverified
2	UDSSM-II (ensemble)	Accuracy	78.3	—	Unverified
3	BERT-large 340M	Accuracy	78.3	—	Unverified
4	UDSSM-I (ensemble)	Accuracy	76.7	—	Unverified
5	DSSM	Accuracy	75	—	Unverified
6	UDSSM-II	Accuracy	75	—	Unverified
7	BERT-base 110M + MAS	Accuracy	68.3	—	Unverified
8	USSM + Supervised Deepnet + 3 Knowledge Bases	Accuracy	66.7	—	Unverified
9	Word-level CNN+LSTM (full scoring)	Accuracy	60	—	Unverified
10	Subword-level Transformer LM	Accuracy	58.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BERT (pred POS/lemmas)	Tags (Full) Acc	82.5	—	Unverified
2	BERT (none)	Tags (Full) Acc	82	—	Unverified
3	BERT (gold POS/lemmas)	Tags (Full) Acc	81	—	Unverified
4	GloVe (gold POS/lemmas)	Tags (Full) Acc	79.3	—	Unverified
5	RoBERTa + Linear	Full F1 (Preps)	78.2	—	Unverified
6	GloVe (none)	Tags (Full) Acc	77.5	—	Unverified
7	GloVe (pred POS/lemmas)	Tags (Full) Acc	77.1	—	Unverified
8	SVM (feature-rich, gold syntax)	Role F1 (Preps)	62.2	—	Unverified
9	BiLSTM + MLP (gold syntax)	Role F1 (Preps)	62.2	—	Unverified
10	SVM (feature-rich, auto syntax)	Role F1 (Preps)	58.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CaseLaw-BERT	CaseHOLD	75.6	—	Unverified
2	Legal-BERT	CaseHOLD	75.1	—	Unverified
3	DeBERTa	CaseHOLD	72.1	—	Unverified
4	Longformer	CaseHOLD	72	—	Unverified
5	RoBERTa	CaseHOLD	71.7	—	Unverified
6	BERT	CaseHOLD	70.7	—	Unverified
7	BigBird	CaseHOLD	70.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT-DG	Average	74.6	—	Unverified
2	ConvBERT-DG + Pre + Multi	Average	73.8	—	Unverified
3	mslm	Average	73.49	—	Unverified
4	ConvBERT + Pre + Multi	Average	68.22	—	Unverified
5	BanLanGen	Average	39.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT + Pre + Multi	Average	86.89	—	Unverified
2	mslm	Average	85.83	—	Unverified
3	ConvBERT-DG + Pre + Multi	Average	85.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MT-DNN-SMART	Average	89.9	—	Unverified
2	BERT-LARGE	Average	82.1	—	Unverified