Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 1978 papers

Title	Date	Tasks	Status	Hype
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models	Sep 21, 2023	Arithmetic ReasoningGSM8K	CodeCode Available	2
BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models	Sep 12, 2023	DiagnosticNatural Language Understanding	CodeCode Available	2
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants	Aug 31, 2023	BelebeleCross-Lingual Transfer	CodeCode Available	2
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding	Aug 21, 2023	Entity TypingEvent Extraction	CodeCode Available	2
MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models	Aug 17, 2023	Decision MakingHallucination	CodeCode Available	2
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI	Jul 19, 2023	Conversational RecommendationDiversity	CodeCode Available	2
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models	May 24, 2023	ChatbotNatural Language Understanding	CodeCode Available	2
Autonomous GIS: the next-generation AI-powered GIS	May 10, 2023	Code GenerationInformation Retrieval	CodeCode Available	2
PMC-LLaMA: Towards Building Open-source Language Models for Medicine	Apr 27, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Scaling Transformer to 1M tokens and beyond with RMT	Apr 19, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
NusaCrowd: Open Source Initiative for Indonesian NLP Resources	Dec 19, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	2
Solving Quantitative Reasoning Problems with Language Models	Jun 29, 2022	Arithmetic ReasoningLanguage Modeling	CodeCode Available	2
JGLUE: Japanese General Language Understanding Evaluation	Jun 1, 2022	FLUENatural Language Understanding	CodeCode Available	2
MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages	Apr 18, 2022	intent-classificationIntent Classification	CodeCode Available	2
PERT: Pre-training BERT with Permuted Language Model	Mar 14, 2022	Language ModelingLanguage Modelling	CodeCode Available	2
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing	Nov 18, 2021	Language ModelingLanguage Modelling	CodeCode Available	2
GPT Understands, Too	Mar 18, 2021	Knowledge ProbingLanguage Modeling	CodeCode Available	2
Learning Transferable Visual Models From Natural Language Supervision	Feb 26, 2021	Action RecognitionBenchmarking	CodeCode Available	2
Spark NLP: Natural Language Understanding at Scale	Jan 26, 2021	BIG-bench Machine LearningNatural Language Understanding	CodeCode Available	2
I-BERT: Integer-only BERT Quantization	Jan 5, 2021	GPUNatural Language Inference	CodeCode Available	2
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners	Sep 15, 2020	Natural Language Understanding	CodeCode Available	2
DeBERTa: Decoding-enhanced BERT with Disentangled Attention	Jun 5, 2020	Common Sense ReasoningCoreference Resolution	CodeCode Available	2
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT	Apr 27, 2020	Document RankingInformation Retrieval	CodeCode Available	2
CLUE: A Chinese Language Understanding Evaluation Benchmark	Apr 13, 2020	General ClassificationMachine Reading Comprehension	CodeCode Available	2
Using Speech Synthesis to Train End-to-End Spoken Language Understanding Models	Oct 21, 2019	Data AugmentationNatural Language Understanding	CodeCode Available	2

Show:10 25 50

← PrevPage 3 of 80Next →

All datasets PDP60 STREUSLE LexGLUE DialoGLUE fewshot DialoGLUE full GLUE

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	HNN	Accuracy	90	—	Unverified
2	BERT-large 340M	Accuracy	78.3	—	Unverified
3	UDSSM-II (ensemble)	Accuracy	78.3	—	Unverified
4	UDSSM-I (ensemble)	Accuracy	76.7	—	Unverified
5	DSSM	Accuracy	75	—	Unverified
6	UDSSM-II	Accuracy	75	—	Unverified
7	BERT-base 110M + MAS	Accuracy	68.3	—	Unverified
8	USSM + Supervised Deepnet + 3 Knowledge Bases	Accuracy	66.7	—	Unverified
9	Word-level CNN+LSTM (full scoring)	Accuracy	60	—	Unverified
10	Subword-level Transformer LM	Accuracy	58.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BERT (pred POS/lemmas)	Tags (Full) Acc	82.5	—	Unverified
2	BERT (none)	Tags (Full) Acc	82	—	Unverified
3	BERT (gold POS/lemmas)	Tags (Full) Acc	81	—	Unverified
4	GloVe (gold POS/lemmas)	Tags (Full) Acc	79.3	—	Unverified
5	RoBERTa + Linear	Full F1 (Preps)	78.2	—	Unverified
6	GloVe (none)	Tags (Full) Acc	77.5	—	Unverified
7	GloVe (pred POS/lemmas)	Tags (Full) Acc	77.1	—	Unverified
8	SVM (feature-rich, gold syntax)	Role F1 (Preps)	62.2	—	Unverified
9	BiLSTM + MLP (gold syntax)	Role F1 (Preps)	62.2	—	Unverified
10	SVM (feature-rich, auto syntax)	Role F1 (Preps)	58.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CaseLaw-BERT	CaseHOLD	75.6	—	Unverified
2	Legal-BERT	CaseHOLD	75.1	—	Unverified
3	DeBERTa	CaseHOLD	72.1	—	Unverified
4	Longformer	CaseHOLD	72	—	Unverified
5	RoBERTa	CaseHOLD	71.7	—	Unverified
6	BERT	CaseHOLD	70.7	—	Unverified
7	BigBird	CaseHOLD	70.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT-DG	Average	74.6	—	Unverified
2	ConvBERT-DG + Pre + Multi	Average	73.8	—	Unverified
3	mslm	Average	73.49	—	Unverified
4	ConvBERT + Pre + Multi	Average	68.22	—	Unverified
5	BanLanGen	Average	39.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT + Pre + Multi	Average	86.89	—	Unverified
2	mslm	Average	85.83	—	Unverified
3	ConvBERT-DG + Pre + Multi	Average	85.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MT-DNN-SMART	Average	89.9	—	Unverified
2	BERT-LARGE	Average	82.1	—	Unverified