Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–300 of 1978 papers

Title	Date	Tasks	Status	Hype
Semantic Change Characterization with LLMs using Rhetorics	Jul 23, 2024	Natural Language Understanding	—Unverified	0
SQLfuse: Enhancing Text-to-SQL Performance through Comprehensive LLM Synergy	Jul 19, 2024	Natural Language QueriesNatural Language Understanding	—Unverified	0
GeoHard: Towards Measuring Class-wise Hardness through Modelling Class Semantics	Jul 17, 2024	Natural Language Understanding	—Unverified	0
CCoE: A Compact LLM with Collaboration of Experts	Jul 16, 2024	Language ModellingLarge Language Model	—Unverified	0
Automatic Pruning of Fine-tuning Datasets for Transformer-based Language Models	Jul 11, 2024	Natural Language UnderstandingNavigate	CodeCode Available	0
Investigating LLMs as Voting Assistants via Contextual Augmentation: A Case Study on the European Parliament Elections 2024	Jul 11, 2024	Natural Language UnderstandingRAG	—Unverified	0
ROSA: Random Subspace Adaptation for Efficient Fine-Tuning	Jul 10, 2024	Natural Language Understandingparameter-efficient fine-tuning	CodeCode Available	0
TRACE: TRansformer-based Attribution using Contrastive Embeddings in LLMs	Jul 6, 2024	AttributeContrastive Learning	—Unverified	0
Embodied AI in Mobile Robots: Coverage Path Planning with Large Language Models	Jul 2, 2024	Natural Language Understanding	—Unverified	0
Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning	Jul 1, 2024	image-classificationImage Classification	CodeCode Available	1
Tree Search for Language Model Agents	Jul 1, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Data Generation Using Large Language Models for Text Classification: An Empirical Case Study	Jun 27, 2024	DiversityNatural Language Understanding	—Unverified	0
MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data	Jun 26, 2024	BenchmarkingMath	CodeCode Available	2
ViANLI: Adversarial Natural Language Inference for Vietnamese	Jun 25, 2024	Adversarial Natural Language InferenceNatural Language Inference	—Unverified	0
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation	Jun 25, 2024	DiversityNatural Language Understanding	CodeCode Available	5
Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models	Jun 24, 2024	Logical ReasoningNatural Language Understanding	CodeCode Available	0
KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning	Jun 24, 2024	Hierarchical Reinforcement LearningKnowledge Graphs	CodeCode Available	0
UniPSDA: Unsupervised Pseudo Semantic Data Augmentation for Zero-Shot Cross-Lingual Natural Language Understanding	Jun 24, 2024	Data AugmentationNatural Language Understanding	CodeCode Available	0
SuperGLEBer: German Language Understanding Evaluation Benchmark	Jun 20, 2024	Document ClassificationNatural Language Understanding	CodeCode Available	1
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks	Jun 19, 2024	DecoderLanguage Modeling	CodeCode Available	2
LangTopo: Aligning Language Descriptions of Graphs with Tokenized Topological Modeling	Jun 19, 2024	Natural Language Understanding	—Unverified	0
Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages	Jun 18, 2024	Cross-Lingual TransferLanguage Modeling	—Unverified	0
LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation	Jun 18, 2024	GPUNatural Language Understanding	CodeCode Available	1
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding	Jun 17, 2024	Mixture-of-ExpertsNatural Language Understanding	CodeCode Available	0
InternalInspector I^2: Robust Confidence Estimation in LLMs through Internal States	Jun 17, 2024	BenchmarkingContrastive Learning	—Unverified	0
LiLiuM: eBay's Large Language Models for e-commerce	Jun 17, 2024	Language ModelingLanguage Modelling	—Unverified	0
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning	Jun 17, 2024	Document AIModel Optimization	CodeCode Available	1
CodeGemma: Open Code Models Based on Gemma	Jun 17, 2024	Code CompletionMathematical Reasoning	—Unverified	0
DataComp-LM: In search of the next generation of training sets for language models	Jun 17, 2024	Language ModellingMMLU	CodeCode Available	7
Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment	Jun 17, 2024	Logical ReasoningMath	—Unverified	0
Large Language Models for Automatic Milestone Detection in Group Discussions	Jun 16, 2024	Natural Language UnderstandingSemantic Similarity	—Unverified	0
MMLU-SR: A Benchmark for Stress-Testing Reasoning Capability of Large Language Models	Jun 15, 2024	Mathematical ReasoningMMLU	—Unverified	0
3D-RPE: Enhancing Long-Context Modeling Through 3D Rotary Position Encoding	Jun 14, 2024	Language ModelingLanguage Modelling	—Unverified	0
Self-Knowledge Distillation for Learning Ambiguity	Jun 14, 2024	Knowledge DistillationNatural Language Understanding	—Unverified	0
Transformers meet Neural Algorithmic Reasoners	Jun 13, 2024	Graph Neural NetworkLanguage Modeling	—Unverified	0
Language Models are Crossword Solvers	Jun 13, 2024	Natural Language UnderstandingWorld Knowledge	—Unverified	0
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL	Jun 12, 2024	Natural Language UnderstandingText to SQL	—Unverified	0
Making Task-Oriented Dialogue Datasets More Natural by Synthetically Generating Indirect User Requests	Jun 12, 2024	Dialogue State TrackingNatural Language Understanding	—Unverified	0
Paraphrasing in Affirmative Terms Improves Negation Understanding	Jun 11, 2024	Natural Language InferenceNatural Language Understanding	—Unverified	0
MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension	Jun 10, 2024	Natural Language UnderstandingRetrosynthesis	—Unverified	0
Multi-Prompting Decoder Helps Better Language Understanding	Jun 10, 2024	DecoderNatural Language Understanding	—Unverified	0
GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents	Jun 7, 2024	Natural Language Understanding	CodeCode Available	3
TLEX: An Efficient Method for Extracting Exact Timelines from TimeML Temporal Graphs	Jun 7, 2024	Natural Language Understanding	—Unverified	0
Mixture-of-Agents Enhances Large Language Model Capabilities	Jun 7, 2024	Language ModelingLanguage Modelling	CodeCode Available	7
Are Large Language Models the New Interface for Data Pipelines?	Jun 6, 2024	AutoMLExplainable artificial intelligence	—Unverified	0
mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans	Jun 6, 2024	Common Sense ReasoningNatural Language Understanding	—Unverified	0
ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions	Jun 6, 2024	Data AugmentationDiversity	CodeCode Available	0
RAG-based Crowdsourcing Task Decomposition via Masked Contrastive Learning with Prompts	Jun 4, 2024	Common Sense ReasoningContrastive Learning	—Unverified	0
MACT: Model-Agnostic Cross-Lingual Training for Discourse Representation Structure Parsing	Jun 3, 2024	DRS ParsingNatural Language Understanding	CodeCode Available	0
Role-playing Prompt Framework: Generation and Evaluation	Jun 2, 2024	Natural Language UnderstandingText Generation	—Unverified	0

Show:10 25 50

← PrevPage 6 of 40Next →

All datasets PDP60 STREUSLE LexGLUE DialoGLUE fewshot DialoGLUE full GLUE

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	HNN	Accuracy	90	—	Unverified
2	UDSSM-II (ensemble)	Accuracy	78.3	—	Unverified
3	BERT-large 340M	Accuracy	78.3	—	Unverified
4	UDSSM-I (ensemble)	Accuracy	76.7	—	Unverified
5	DSSM	Accuracy	75	—	Unverified
6	UDSSM-II	Accuracy	75	—	Unverified
7	BERT-base 110M + MAS	Accuracy	68.3	—	Unverified
8	USSM + Supervised Deepnet + 3 Knowledge Bases	Accuracy	66.7	—	Unverified
9	Word-level CNN+LSTM (full scoring)	Accuracy	60	—	Unverified
10	Subword-level Transformer LM	Accuracy	58.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BERT (pred POS/lemmas)	Tags (Full) Acc	82.5	—	Unverified
2	BERT (none)	Tags (Full) Acc	82	—	Unverified
3	BERT (gold POS/lemmas)	Tags (Full) Acc	81	—	Unverified
4	GloVe (gold POS/lemmas)	Tags (Full) Acc	79.3	—	Unverified
5	RoBERTa + Linear	Full F1 (Preps)	78.2	—	Unverified
6	GloVe (none)	Tags (Full) Acc	77.5	—	Unverified
7	GloVe (pred POS/lemmas)	Tags (Full) Acc	77.1	—	Unverified
8	SVM (feature-rich, gold syntax)	Role F1 (Preps)	62.2	—	Unverified
9	BiLSTM + MLP (gold syntax)	Role F1 (Preps)	62.2	—	Unverified
10	SVM (feature-rich, auto syntax)	Role F1 (Preps)	58.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CaseLaw-BERT	CaseHOLD	75.6	—	Unverified
2	Legal-BERT	CaseHOLD	75.1	—	Unverified
3	DeBERTa	CaseHOLD	72.1	—	Unverified
4	Longformer	CaseHOLD	72	—	Unverified
5	RoBERTa	CaseHOLD	71.7	—	Unverified
6	BERT	CaseHOLD	70.7	—	Unverified
7	BigBird	CaseHOLD	70.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT-DG	Average	74.6	—	Unverified
2	ConvBERT-DG + Pre + Multi	Average	73.8	—	Unverified
3	mslm	Average	73.49	—	Unverified
4	ConvBERT + Pre + Multi	Average	68.22	—	Unverified
5	BanLanGen	Average	39.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT + Pre + Multi	Average	86.89	—	Unverified
2	mslm	Average	85.83	—	Unverified
3	ConvBERT-DG + Pre + Multi	Average	85.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MT-DNN-SMART	Average	89.9	—	Unverified
2	BERT-LARGE	Average	82.1	—	Unverified