Natural Language Inference

Natural language inference (NLI) is the task of determining whether a "hypothesis" is true (entailment), false (contradiction), or undetermined (neutral) given a "premise".

Example:

| Premise | Label | Hypothesis | | --- | ---| --- | | A man inspects the uniform of a figure in some East Asian country. | contradiction | The man is sleeping. | | An older and younger man smiling. | neutral | Two men are smiling and laughing at the cats playing on the floor. | | A soccer game with multiple males playing. | entailment | Some men are playing a sport. |

Approaches used for NLI include earlier symbolic and statistical approaches to more recent deep learning approaches. Benchmark datasets used for NLI include SNLI, MultiNLI, SciTail, among others. You can get hands-on practice on the SNLI task by following this d2l.ai chapter.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1451–1500 of 1961 papers

Title	Date	Tasks	Status
LexSemTm: A Semantic Dataset Based on All-words Unsupervised Sense Distribution Learning	Aug 1, 2016	AllLexical Simplification	—Unverified
Lifting the Curse of Multilinguality by Pre-training Modular Transformers	Nov 16, 2021	named-entity-recognitionNamed Entity Recognition	—Unverified
Lifting the Curse of Multilinguality by Pre-training Modular Transformers	May 12, 2022	named-entity-recognitionNamed Entity Recognition	—Unverified
Light Textual Inference for Semantic Parsing	Dec 1, 2012	Natural Language InferenceSemantic Parsing	—Unverified
LIMSIILES: Basic English Substitution for Student Answer Assessment at SemEval 2013	Jun 1, 2013	Language ModellingMachine Translation	—Unverified
LIPN-CORE: Semantic Text Similarity using n-grams, WordNet, Syntactic Analysis, ESA and Information Retrieval based Features	Jun 1, 2013	Information RetrievalNatural Language Inference	—Unverified
Literal, Metphorical or Both? Detecting Metaphoricity in Isolated Adjective-Noun Phrases	Jun 1, 2018	General ClassificationMachine Translation	—Unverified
Local and Global Context for Supervised and Unsupervised Metonymy Resolution	Jul 1, 2012	Information RetrievalNatural Language Inference	—Unverified
Locality Preserving Loss: Neighbors that Live together, Align together	Apr 7, 2020	Natural Language InferenceSentence Embeddings	—Unverified
Logical Semantics, Dialogical Argumentation, and Textual Entailment	Aug 17, 2020	Natural Language InferenceSentence	—Unverified
Logic-guided Semantic Representation Learning for Zero-Shot Relation Classification	Oct 30, 2020	ClassificationDescriptive	—Unverified
Extracting and Encoding: Leveraging Large Language Models and Medical Knowledge to Enhance Radiological Text Representation	Jul 2, 2024	Natural Language InferenceRepresentation Learning	CodeCode Available
Extracting and filtering paraphrases by bridging natural language inference and paraphrasing	Nov 13, 2021	Natural Language Inference	CodeCode Available
CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs	Jun 5, 2024	ClusteringNatural Language Inference	CodeCode Available
BERTSel: Answer Selection with Pre-trained Models	May 18, 2019	Answer SelectionNatural Language Inference	CodeCode Available
Language Models Meet Anomaly Detection for Better Interpretability and Generalizability	Apr 11, 2024	Anomaly DetectionLanguage Modelling	CodeCode Available
Exploring Transitivity in Neural NLI Models through Veridicality	Jan 26, 2021	Natural Language InferenceNatural Language Understanding	CodeCode Available
BERT and PALs: Projected Attention Layers for Efficient Adaptation in Multi-Task Learning	Feb 7, 2019	Multi-Task LearningNatural Language Inference	CodeCode Available
Transformation of Dense and Sparse Text Representations	Nov 7, 2019	General ClassificationNatural Language Inference	CodeCode Available
Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving	May 2, 2024	Automated Theorem ProvingNatural Language Inference	CodeCode Available
A Multilingual Benchmark for Probing Negation-Awareness with Minimal Pairs	Nov 1, 2021	Natural Language InferenceNegation	CodeCode Available
VERITAS-NLI : Validation and Extraction of Reliable Information Through Automated Scraping and Natural Language Inference	Oct 12, 2024	Fake News DetectionNatural Language Inference	CodeCode Available
Multimodal Coherent Explanation Generation of Robot Failures	Oct 1, 2024	Explanation GenerationNatural Language Inference	CodeCode Available
Fake News Detection as Natural Language Inference	Jul 17, 2019	Fake News DetectionManagement	CodeCode Available
Annotating omission in statement pairs	Apr 1, 2017	Natural Language Inference	CodeCode Available
Exploring Tokenization Strategies and Vocabulary Sizes for Enhanced Arabic Language Models	Mar 17, 2024	Computational EfficiencyHate Speech Detection	CodeCode Available
Zero-shot Factual Consistency Evaluation Across Domains	Aug 7, 2024	Domain GeneralizationNatural Language Inference	CodeCode Available
Falsesum: Generating Document-level NLI Examples for Recognizing Factual Inconsistency in Summarization	May 12, 2022	Abstractive Text SummarizationNatural Language Inference	CodeCode Available
Translate and Classify: Improving Sequence Level Classification for English-Hindi Code-Mixed Data	Jun 1, 2021	Machine TranslationNatural Language Inference	CodeCode Available
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language based on Textually Represented Environments	Jul 13, 2024	Entity LinkingNatural Language Inference	CodeCode Available
Exploring the Limits of Natural Language Inference Based Setup for Few-Shot Intent Detection	Dec 14, 2021	Few-Shot LearningGeneralized Few-Shot Learning	CodeCode Available
Exploring Robustness of Multilingual LLMs on Real-World Noisy Data	Jan 14, 2025	intent-classificationIntent Classification	CodeCode Available
FastTrees: Parallel Latent Tree-Induction for Faster Sequence Encoding	Nov 28, 2021	Language ModelingLanguage Modelling	CodeCode Available
Multi-Task Deep Neural Networks for Natural Language Understanding	Jan 31, 2019	Domain AdaptationLanguage Modeling	CodeCode Available
SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative Examples	May 13, 2023	Data AugmentationNatural Language Inference	CodeCode Available
Multi-turn Inference Matching Network for Natural Language Inference	Jan 8, 2019	Natural Language Inference	CodeCode Available
Multiway Attention Networks for Modeling Sentence Pairs	Jul 1, 2018	Natural Language InferenceParaphrase Identification	CodeCode Available
Muppet: Massive Multi-task Representations with Pre-Finetuning	Jan 26, 2021	Abstractive Text SummarizationCommon Sense Reasoning	CodeCode Available
Schema-Guided Semantic Accuracy: Faithfulness in Task-Oriented Dialogue Response Generation	Jan 29, 2023	Natural Language InferenceResponse Generation	CodeCode Available
Benchmarking Long-tail Generalization with Likelihood Splits	Oct 13, 2022	BenchmarkingLanguage Modeling	CodeCode Available
Exploring Continual Learning of Compositional Generalization in NLI	Mar 7, 2024	Continual LearningNatural Language Inference	CodeCode Available
BatchPrompt: Accomplish more with less	Sep 1, 2023	8kLanguage Modelling	CodeCode Available
Baselines and test data for cross-lingual inference	Apr 18, 2017	Cross-Lingual Word EmbeddingsMachine Translation	CodeCode Available
Exploiting Word Semantics to Enrich Character Representations of Chinese Pre-trained Models	Jul 13, 2022	Machine Reading ComprehensionNatural Language Inference	CodeCode Available
Scoring and Classifying Implicit Positive Interpretations: A Challenge of Class Imbalance	Aug 1, 2018	General ClassificationNatural Language Inference	CodeCode Available
Exploiting BERT to improve aspect-based sentiment analysis performance on Persian language	Dec 2, 2020	Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA)	CodeCode Available
CsFEVER and CTKFacts: Acquiring Czech data for fact verification	Jan 26, 2022	ArticlesFact Checking	CodeCode Available
Explanation Selection Using Unlabeled Data for Chain-of-Thought Prompting	Feb 9, 2023	Mathematical ReasoningNatural Language Inference	CodeCode Available
Natural Language Inference over Interaction Space: ICLR 2018 Reproducibility Report	Feb 9, 2018	Model SelectionNatural Language Inference	CodeCode Available
Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations in a Label-Abundant Setup	Dec 12, 2021	Natural Language InferenceTransfer Learning	CodeCode Available

Show:10 25 50

← PrevPage 30 of 40Next →

All datasets SNLI RTE MultiNLI QNLI ANLI test WNLI LiDiRus RCB TERRa CommitmentBank SciTail FarsTail

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	UnitedSynT5 (3B)	% Test Accuracy	94.7	—	Unverified
2	UnitedSynT5 (335M)	% Test Accuracy	93.5	—	Unverified
3	EFL (Entailment as Few-shot Learner) + RoBERTa-large	% Test Accuracy	93.1	—	Unverified
4	Neural Tree Indexers for Text Understanding	% Test Accuracy	93.1	—	Unverified
5	RoBERTa-large + self-explaining layer	% Test Accuracy	92.3	—	Unverified
6	RoBERTa-large+Self-Explaining	% Test Accuracy	92.3	—	Unverified
7	CA-MTL	% Test Accuracy	92.1	—	Unverified
8	SemBERT	% Test Accuracy	91.9	—	Unverified
9	MT-DNN-SMARTLARGEv0	% Test Accuracy	91.7	—	Unverified
10	MT-DNN-SMART_100%ofTrainingData	Dev Accuracy	91.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Vega v2 6B (KD-based prompt transfer)	Accuracy	96	—	Unverified
2	PaLM 540B (fine-tuned)	Accuracy	95.7	—	Unverified
3	Turing NLR v5 XXL 5.4B (fine-tuned)	Accuracy	94.1	—	Unverified
4	ST-MoE-32B 269B (fine-tuned)	Accuracy	93.5	—	Unverified
5	DeBERTa-1.5B	Accuracy	93.2	—	Unverified
6	MUPPET Roberta Large	Accuracy	92.8	—	Unverified
7	DeBERTaV3large	Accuracy	92.7	—	Unverified
8	T5-XXL 11B (fine-tuned)	Accuracy	92.5	—	Unverified
9	T5-XXL 11B	Accuracy	92.5	—	Unverified
10	UL2 20B (fine-tuned)	Accuracy	92.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UnitedSynT5 (3B)	Matched	92.6	—	Unverified
2	Turing NLR v5 XXL 5.4B (fine-tuned)	Matched	92.6	—	Unverified
3	T5-XXL 11B (fine-tuned)	Matched	92	—	Unverified
4	T5	Matched	92	—	Unverified
5	T5-11B	Mismatched	91.7	—	Unverified
6	T5-3B	Matched	91.4	—	Unverified
7	ALBERT	Matched	91.3	—	Unverified
8	Adv-RoBERTa ensemble	Matched	91.1	—	Unverified
9	DeBERTa (large)	Matched	91.1	—	Unverified
10	SMARTRoBERTa	Dev Matched	91.1	—	Unverified