Natural Language Inference

Natural language inference (NLI) is the task of determining whether a "hypothesis" is true (entailment), false (contradiction), or undetermined (neutral) given a "premise".

Example:

| Premise | Label | Hypothesis | | --- | ---| --- | | A man inspects the uniform of a figure in some East Asian country. | contradiction | The man is sleeping. | | An older and younger man smiling. | neutral | Two men are smiling and laughing at the cats playing on the floor. | | A soccer game with multiple males playing. | entailment | Some men are playing a sport. |

Approaches used for NLI include earlier symbolic and statistical approaches to more recent deep learning approaches. Benchmark datasets used for NLI include SNLI, MultiNLI, SciTail, among others. You can get hands-on practice on the SNLI task by following this d2l.ai chapter.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1201–1250 of 1961 papers

Title	Date	Tasks	Status	Hype
Align, Mask and Select: A Simple Method for Incorporating Commonsense Knowledge into Language Representation Models	Aug 19, 2019	Common Sense ReasoningNatural Language Inference	—Unverified	0
SenseBERT: Driving Some Sense into BERT	Aug 15, 2019	Language ModelingLanguage Modelling	—Unverified	0
Abductive Commonsense Reasoning	Aug 15, 2019	Multiple-choiceNatural Language Inference	CodeCode Available	0
Reasoning-Driven Question-Answering for Natural Language Understanding	Aug 14, 2019	Common Sense ReasoningNatural Language Inference	—Unverified	0
StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding	Aug 13, 2019	Language ModelingLanguage Modelling	—Unverified	0
Do Neural Language Representations Learn Physical Commonsense?	Aug 8, 2019	Natural Language InferencePhysical Commonsense Reasoning	CodeCode Available	0
DELTA: A DEep learning based Language Technology plAtform	Aug 2, 2019	Abstractive Text SummarizationDeep Learning	CodeCode Available	0
Simple and Effective Text Matching with Richer Alignment Features	Aug 1, 2019	Answer SelectionNatural Language Inference	CodeCode Available	0
Saama Research at MEDIQA 2019: Pre-trained BioBERT with Attention Visualisation for Medical Natural Language Inference	Aug 1, 2019	Natural Language Inference	—Unverified	0
LasigeBioTM at MEDIQA 2019: Biomedical Question Answering using Bidirectional Transformers and Named Entity Recognition	Aug 1, 2019	named-entity-recognitionNamed Entity Recognition	—Unverified	0
MSIT\_SRIB at MEDIQA 2019: Knowledge Directed Multi-task Framework for Natural Language Inference in Clinical Domain.	Aug 1, 2019	Natural Language InferenceTransfer Learning	—Unverified	0
KU\_ai at MEDIQA 2019: Domain-specific Pre-training and Transfer Learning for Medical NLI	Aug 1, 2019	De-identificationLanguage Modeling	—Unverified	0
Overview of the MEDIQA 2019 Shared Task on Textual Inference, Question Entailment and Question Answering	Aug 1, 2019	Information RetrievalNatural Language Inference	CodeCode Available	1
UU\_TAILS at MEDIQA 2019: Learning Textual Entailment in the Medical Domain	Aug 1, 2019	Natural Language InferenceQuestion Answering	—Unverified	0
WTMED at MEDIQA 2019: A Hybrid Approach to Biomedical Natural Language Inference	Aug 1, 2019	Natural Language Inference	CodeCode Available	0
ANU-CSIRO at MEDIQA 2019: Question Answering Using Deep Contextual Knowledge	Aug 1, 2019	Medical Question AnsweringNatural Language Inference	—Unverified	0
Sieg at MEDIQA 2019: Multi-task Neural Ensemble for Biomedical Inference and Entailment	Aug 1, 2019	Multi-Task LearningNatural Language Inference	—Unverified	0
ARS\_NITK at MEDIQA 2019:Analysing Various Methods for Natural Language Inference, Recognising Question Entailment and Medical Question Answering System	Aug 1, 2019	Information RetrievalMedical Question Answering	—Unverified	0
NCUEE at MEDIQA 2019: Medical Text Inference Using Ensemble BERT-BiLSTM-Attention Model	Aug 1, 2019	Natural Language Inference	—Unverified	0
Explaining Simple Natural Language Inference	Aug 1, 2019	Natural Language Inference	CodeCode Available	0
Fill the GAP: Exploiting BERT for Pronoun Resolution	Aug 1, 2019	Coreference ResolutionData Augmentation	CodeCode Available	0
Annotating and analyzing the interactions between meaning relations	Aug 1, 2019	Natural Language InferenceSemantic Similarity	CodeCode Available	0
ERNIE 2.0: A Continual Pre-training Framework for Language Understanding	Jul 29, 2019	Chinese Named Entity RecognitionChinese Reading Comprehension	CodeCode Available	3
Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment	Jul 27, 2019	Adversarial TextGeneral Classification	CodeCode Available	1
A Hybrid Neural Network Model for Commonsense Reasoning	Jul 27, 2019	Common Sense ReasoningCoreference Resolution	—Unverified	0
LINSPECTOR WEB: A Multilingual Probing Suite for Word Representations	Jul 26, 2019	Dependency Parsingnamed-entity-recognition	CodeCode Available	0
RoBERTa: A Robustly Optimized BERT Pretraining Approach	Jul 26, 2019	Common Sense ReasoningDocument Image Classification	CodeCode Available	1
SpanBERT: Improving Pre-training by Representing and Predicting Spans	Jul 24, 2019	Coreference ResolutionLinguistic Acceptability	CodeCode Available	0
Dr.Quad at MEDIQA 2019: Towards Textual Inference and Question Entailment using contextualized representations	Jul 23, 2019	Data AugmentationNatural Language Inference	—Unverified	0
Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference	Jul 23, 2019	Natural Language Inferencereinforcement-learning	CodeCode Available	0
A Pragmatics-Centered Evaluation Framework for Natural Language Understanding	Jul 19, 2019	Multi-Task LearningNatural Language Inference	CodeCode Available	0
Fake News Detection as Natural Language Inference	Jul 17, 2019	Fake News DetectionManagement	CodeCode Available	0
On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference	Jul 9, 2019	Natural Language Inference	CodeCode Available	0
Don't Take the Premise for Granted: Mitigating Artifacts in Natural Language Inference	Jul 9, 2019	Natural Language Inference	CodeCode Available	0
UW-BHI at MEDIQA 2019: An Analysis of Representation Methods for Medical Natural Language Inference	Jul 9, 2019	Language ModelingLanguage Modelling	—Unverified	0
A Study of the Effect of Resolving Negation and Sentiment Analysis in Recognizing Text Entailment for Arabic	Jul 5, 2019	Natural Language InferenceNegation	—Unverified	0
Answer Extraction for Why Arabic Questions Answering Systems: EWAQ	Jul 4, 2019	Natural Language Inference	—Unverified	0
Reversing Gradients in Adversarial Domain Adaptation for Question Deduplication and Textual Entailment Tasks	Jul 1, 2019	Domain AdaptationNatural Language Inference	—Unverified	0
Ranking Generated Summaries by Correctness: An Interesting but Challenging Application for Natural Language Inference	Jul 1, 2019	Abstractive Text SummarizationNatural Language Inference	—Unverified	0
Deep Neural Model Inspection and Comparison via Functional Neuron Pathways	Jul 1, 2019	named-entity-recognitionNamed Entity Recognition	—Unverified	0
Latent Structure Models for Natural Language Processing	Jul 1, 2019	Language ModelingLanguage Modelling	—Unverified	0
Learning Latent Trees with Stochastic Perturbations and Differentiable Dynamic Programming	Jun 24, 2019	Natural Language InferenceSentiment Analysis	CodeCode Available	0
Investigating Biases in Textual Entailment Datasets	Jun 23, 2019	BIG-bench Machine LearningNatural Language Inference	—Unverified	0
XLNet: Generalized Autoregressive Pretraining for Language Understanding	Jun 19, 2019	Audio Question AnsweringChinese Reading Comprehension	CodeCode Available	1
Surf at MEDIQA 2019: Improving Performance of Natural Language Inference in the Clinical Domain by Adopting Pre-trained Language Model	Jun 19, 2019	Language ModelingLanguage Modelling	—Unverified	0
Pre-Training with Whole Word Masking for Chinese BERT	Jun 19, 2019	Document ClassificationGeneral Classification	CodeCode Available	3
Can neural networks understand monotonicity reasoning?	Jun 15, 2019	Data AugmentationNatural Language Inference	CodeCode Available	0
IITP at MEDIQA 2019: Systems Report for Natural Language Inference, Question Entailment and Question Answering	Jun 14, 2019	Medical Question AnsweringNatural Language Inference	—Unverified	0
Augmenting Neural Networks with First-order Logic	Jun 14, 2019	ChunkingNatural Language Inference	CodeCode Available	0
Transfer Learning in Biomedical Natural Language Processing: An Evaluation of BERT and ELMo on Ten Benchmarking Datasets	Jun 13, 2019	BenchmarkingDocument Classification	CodeCode Available	1

Show:10 25 50

← PrevPage 25 of 40Next →

All datasets SNLI RTE MultiNLI QNLI ANLI test WNLI LiDiRus RCB TERRa CommitmentBank SciTail FarsTail

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	UnitedSynT5 (3B)	% Test Accuracy	94.7	—	Unverified
2	UnitedSynT5 (335M)	% Test Accuracy	93.5	—	Unverified
3	EFL (Entailment as Few-shot Learner) + RoBERTa-large	% Test Accuracy	93.1	—	Unverified
4	Neural Tree Indexers for Text Understanding	% Test Accuracy	93.1	—	Unverified
5	RoBERTa-large+Self-Explaining	% Test Accuracy	92.3	—	Unverified
6	RoBERTa-large + self-explaining layer	% Test Accuracy	92.3	—	Unverified
7	CA-MTL	% Test Accuracy	92.1	—	Unverified
8	SemBERT	% Test Accuracy	91.9	—	Unverified
9	MT-DNN-SMARTLARGEv0	% Test Accuracy	91.7	—	Unverified
10	MT-DNN-SMART_100%ofTrainingData	Dev Accuracy	91.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Vega v2 6B (KD-based prompt transfer)	Accuracy	96	—	Unverified
2	PaLM 540B (fine-tuned)	Accuracy	95.7	—	Unverified
3	Turing NLR v5 XXL 5.4B (fine-tuned)	Accuracy	94.1	—	Unverified
4	ST-MoE-32B 269B (fine-tuned)	Accuracy	93.5	—	Unverified
5	DeBERTa-1.5B	Accuracy	93.2	—	Unverified
6	MUPPET Roberta Large	Accuracy	92.8	—	Unverified
7	DeBERTaV3large	Accuracy	92.7	—	Unverified
8	T5-XXL 11B	Accuracy	92.5	—	Unverified
9	T5-XXL 11B (fine-tuned)	Accuracy	92.5	—	Unverified
10	ST-MoE-L 4.1B (fine-tuned)	Accuracy	92.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UnitedSynT5 (3B)	Matched	92.6	—	Unverified
2	Turing NLR v5 XXL 5.4B (fine-tuned)	Matched	92.6	—	Unverified
3	T5-XXL 11B (fine-tuned)	Matched	92	—	Unverified
4	T5	Matched	92	—	Unverified
5	T5-11B	Mismatched	91.7	—	Unverified
6	T5-3B	Matched	91.4	—	Unverified
7	ALBERT	Matched	91.3	—	Unverified
8	DeBERTa (large)	Matched	91.1	—	Unverified
9	Adv-RoBERTa ensemble	Matched	91.1	—	Unverified
10	SMARTRoBERTa	Dev Matched	91.1	—	Unverified