Natural Language Understanding

Natural Language Understanding is an important field of Natural Language Processing which contains various tasks such as text classification, natural language inference and story comprehension. Applications enabled by natural language understanding range from question answering to automated reasoning.

Source: Find a Reasonable Ending for Stories: Does Logic Relation Help the Story Cloze Test?

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 451–500 of 1978 papers

Title	Date	Tasks	Status	Hype
Self Generated Wargame AI: Double Layer Agent Task Planning Based on Large Language Model	Dec 2, 2023	Decision MakingLanguage Modeling	—Unverified	0
Summarization-based Data Augmentation for Document Classification	Dec 1, 2023	ClassificationData Augmentation	CodeCode Available	0
TaskWeaver: A Code-First Agent Framework	Nov 29, 2023	Natural Language Understanding	CodeCode Available	5
Exploring the Robustness of Model-Graded Evaluations and Automated Interpretability	Nov 26, 2023	Natural Language Understanding	—Unverified	0
Explore the Potential of LLMs in Misinformation Detection: An Empirical Study	Nov 21, 2023	MisinformationNatural Language Understanding	—Unverified	0
MultiLoRA: Democratizing LoRA for Better Multi-Task Learning	Nov 20, 2023	Multi-Task LearningNatural Language Understanding	—Unverified	0
You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments	Nov 16, 2023	Natural Language UnderstandingNegation	CodeCode Available	0
Effective Large Language Model Adaptation for Improved Grounding and Citation Generation	Nov 16, 2023	Language ModelingLanguage Modelling	—Unverified	0
SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLU	Nov 16, 2023	Intent DetectionNatural Language Understanding	CodeCode Available	0
XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs	Nov 15, 2023	Decision MakingDecoder	CodeCode Available	1
MAVEN-Arg: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation	Nov 15, 2023	AllEvent Argument Extraction	CodeCode Available	1
Fusion-Eval: Integrating Assistant Evaluators with LLMs	Nov 15, 2023	Natural Language Understanding	—Unverified	0
On the Calibration of Multilingual Question Answering LLMs	Nov 15, 2023	Cross-Lingual TransferData Augmentation	—Unverified	0
Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning	Nov 13, 2023	In-Context LearningLanguage Modeling	—Unverified	0
Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language Understanding	Nov 12, 2023	Contrastive LearningData Augmentation	—Unverified	0
Conic10K: A Challenging Math Problem Understanding and Reasoning Dataset	Nov 9, 2023	MathNatural Language Understanding	CodeCode Available	1
DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert Pretraining	Nov 8, 2023	GPUMRPC	—Unverified	0
RankAug: Augmented data ranking for text classification	Nov 8, 2023	ClassificationDiversity	—Unverified	0
Relation Extraction Model Based on Semantic Enhancement Mechanism	Nov 5, 2023	Information Retrievalmodel	—Unverified	0
A Systematic Review of Deep Graph Neural Networks: Challenges, Classification, Architectures, Applications & Potential Utility in Bioinformatics	Nov 3, 2023	Graph Neural NetworkNatural Language Understanding	—Unverified	0
MARRS: Multimodal Reference Resolution System	Nov 3, 2023	Natural Language Understanding	—Unverified	0
Automatic Disfluency Detection from Untranscribed Speech	Nov 1, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
Instructive Decoding: Instruction-Tuned Large Language Models are Self-Refiner from Noisy Instructions	Nov 1, 2023	Few-Shot NLIInstruction Following	CodeCode Available	1
IBADR: an Iterative Bias-Aware Dataset Refinement Framework for Debiasing NLU models	Nov 1, 2023	Natural Language Understanding	—Unverified	0
Dense Retrieval as Indirect Supervision for Large-space Decision Making	Oct 28, 2023	Decision MakingEntity Typing	CodeCode Available	0
TLM: Token-Level Masking for Transformers	Oct 28, 2023	Data-to-Text GenerationGrammatical Error Correction	CodeCode Available	0
Large-scale Foundation Models and Generative AI for BigData Neuroscience	Oct 27, 2023	Data AugmentationNatural Language Understanding	—Unverified	0
Evaluation of large language models using an Indian language LGBTI+ lexicon	Oct 26, 2023	Machine TranslationMMLU	—Unverified	0
Meaning and understanding in large language models	Oct 26, 2023	Natural Language Understanding	—Unverified	0
Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors	Oct 25, 2023	en-US domain classificationen-US Intent Classification	CodeCode Available	0
tagE: Enabling an Embodied Agent to Understand Human Instructions	Oct 24, 2023	DecoderNatural Language Understanding	CodeCode Available	0
Large Language Models are Temporal and Causal Reasoners for Video Question Answering	Oct 24, 2023	Natural Language UnderstandingQuestion Answering	CodeCode Available	1
Pre-Trained Language Models Augmented with Synthetic Scanpaths for Natural Language Understanding	Oct 23, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
CoF-CoT: Enhancing Large Language Models with Coarse-to-Fine Chain-of-Thought Prompting for Multi-domain NLU Tasks	Oct 23, 2023	Abstract Meaning RepresentationNatural Language Understanding	CodeCode Available	0
Primacy Effect of ChatGPT	Oct 20, 2023	Natural Language UnderstandingQuestion Answering	CodeCode Available	0
Explaining Interactions Between Text Spans	Oct 20, 2023	Community DetectionDecision Making	CodeCode Available	0
StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding	Oct 19, 2023	Multiple-choiceNatural Language Understanding	CodeCode Available	0
Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language Understanding	Oct 18, 2023	Natural Language UnderstandingRelation Extraction	—Unverified	0
Adaptation with Self-Evaluation to Improve Selective Prediction in LLMs	Oct 18, 2023	Decision MakingNatural Language Understanding	—Unverified	0
Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning	Oct 18, 2023	Natural Language Understanding	CodeCode Available	2
Core Building Blocks: Next Gen Geo Spatial GPT Application	Oct 17, 2023	Natural Language Understanding	—Unverified	0
Towards Automatic Satellite Images Captions Generation Using Large Language Models	Oct 17, 2023	Image CaptioningManagement	—Unverified	0
Rethinking Relation Classification with Graph Meaning Representations	Oct 15, 2023	ClassificationNatural Language Understanding	—Unverified	0
Empirical study of pretrained multilingual language models for zero-shot cross-lingual knowledge transfer in generation	Oct 15, 2023	Language ModelingLanguage Modelling	—Unverified	0
GLoRE: Evaluating Logical Reasoning of Large Language Models	Oct 13, 2023	Logical ReasoningNatural Language Understanding	CodeCode Available	1
Developing a Natural Language Understanding Model to Characterize Cable News Bias	Oct 13, 2023	named-entity-recognitionNamed Entity Recognition	—Unverified	0
Split-and-Denoise: Protect large language model inference with local differential privacy	Oct 13, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
Evaluating The Effectiveness of Capsule Neural Network in Toxic Comment Classification using Pre-trained BERT Embeddings	Oct 12, 2023	Multilingual NLPNatural Language Understanding	CodeCode Available	0
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models	Oct 12, 2023	Natural Language UnderstandingQuantization	CodeCode Available	2
PHALM: Building a Knowledge Graph from Scratch by Prompting Humans and a Language Model	Oct 11, 2023	Language ModelingLanguage Modelling	CodeCode Available	1

Show:10 25 50

← PrevPage 10 of 40Next →

All datasets PDP60 STREUSLE LexGLUE DialoGLUE fewshot DialoGLUE full GLUE

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	HNN	Accuracy	90	—	Unverified
2	UDSSM-II (ensemble)	Accuracy	78.3	—	Unverified
3	BERT-large 340M	Accuracy	78.3	—	Unverified
4	UDSSM-I (ensemble)	Accuracy	76.7	—	Unverified
5	DSSM	Accuracy	75	—	Unverified
6	UDSSM-II	Accuracy	75	—	Unverified
7	BERT-base 110M + MAS	Accuracy	68.3	—	Unverified
8	USSM + Supervised Deepnet + 3 Knowledge Bases	Accuracy	66.7	—	Unverified
9	Word-level CNN+LSTM (full scoring)	Accuracy	60	—	Unverified
10	Subword-level Transformer LM	Accuracy	58.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BERT (pred POS/lemmas)	Tags (Full) Acc	82.5	—	Unverified
2	BERT (none)	Tags (Full) Acc	82	—	Unverified
3	BERT (gold POS/lemmas)	Tags (Full) Acc	81	—	Unverified
4	GloVe (gold POS/lemmas)	Tags (Full) Acc	79.3	—	Unverified
5	RoBERTa + Linear	Full F1 (Preps)	78.2	—	Unverified
6	GloVe (none)	Tags (Full) Acc	77.5	—	Unverified
7	GloVe (pred POS/lemmas)	Tags (Full) Acc	77.1	—	Unverified
8	SVM (feature-rich, gold syntax)	Role F1 (Preps)	62.2	—	Unverified
9	BiLSTM + MLP (gold syntax)	Role F1 (Preps)	62.2	—	Unverified
10	SVM (feature-rich, auto syntax)	Role F1 (Preps)	58.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CaseLaw-BERT	CaseHOLD	75.6	—	Unverified
2	Legal-BERT	CaseHOLD	75.1	—	Unverified
3	DeBERTa	CaseHOLD	72.1	—	Unverified
4	Longformer	CaseHOLD	72	—	Unverified
5	RoBERTa	CaseHOLD	71.7	—	Unverified
6	BERT	CaseHOLD	70.7	—	Unverified
7	BigBird	CaseHOLD	70.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT-DG	Average	74.6	—	Unverified
2	ConvBERT-DG + Pre + Multi	Average	73.8	—	Unverified
3	mslm	Average	73.49	—	Unverified
4	ConvBERT + Pre + Multi	Average	68.22	—	Unverified
5	BanLanGen	Average	39.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ConvBERT + Pre + Multi	Average	86.89	—	Unverified
2	mslm	Average	85.83	—	Unverified
3	ConvBERT-DG + Pre + Multi	Average	85.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MT-DNN-SMART	Average	89.9	—	Unverified
2	BERT-LARGE	Average	82.1	—	Unverified