Natural Language Inference
Natural language inference (NLI) is the task of determining whether a "hypothesis" is true (entailment), false (contradiction), or undetermined (neutral) given a "premise".
Example:
| Premise | Label | Hypothesis | | --- | ---| --- | | A man inspects the uniform of a figure in some East Asian country. | contradiction | The man is sleeping. | | An older and younger man smiling. | neutral | Two men are smiling and laughing at the cats playing on the floor. | | A soccer game with multiple males playing. | entailment | Some men are playing a sport. |
Approaches used for NLI include earlier symbolic and statistical approaches to more recent deep learning approaches. Benchmark datasets used for NLI include SNLI, MultiNLI, SciTail, among others. You can get hands-on practice on the SNLI task by following this d2l.ai chapter.
Further readings:
Papers
Showing 1–10 of 1961 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | T5-3B (explanation prompting) | A1 | 81.8 | — | Unverified |
| 2 | T0-11B (explanation prompting) | A1 | 75.6 | — | Unverified |
| 3 | InfoBERT (RoBERTa) | A1 | 75 | — | Unverified |
| 4 | PaLM 2-L (one-shot) | A1 | 73.1 | — | Unverified |
| 5 | RoBERTa (Large) | A1 | 72.4 | — | Unverified |
| 6 | ALUM (RoBERTa-LARGE) | A1 | 72.3 | — | Unverified |
| 7 | XLNet (Large) | A1 | 70.3 | — | Unverified |
| 8 | PaLM 540B (Self Improvement, Self Consistency) | A2 | 66.5 | — | Unverified |
| 9 | PaLM 540B (Self Improvement, CoT Prompting) | A2 | 65.3 | — | Unverified |
| 10 | PaLM 540B (Self Improvement, Standard-Prompting) | A2 | 64.8 | — | Unverified |