SOTAVerified

StrategyQA

StrategyQA aims to measure the ability of models to answer questions that require multi-step implicit reasoning.

Source: BIG-bench

Papers

Showing 3140 of 40 papers

TitleStatusHype
Self-Evaluation Guided Beam Search for Reasoning0
Visconde: Multi-document QA with GPT-3 and Neural RerankingCode1
Distilling Reasoning Capabilities into Smaller Language ModelsCode0
Learning to Decompose: Hypothetical Question Decomposition Based on Comparable Texts0
Better Retrieval May Not Lead to Better Question Answering0
PaLM: Scaling Language Modeling with PathwaysCode2
Training Compute-Optimal Large Language ModelsCode6
Self-Consistency Improves Chain of Thought Reasoning in Language ModelsCode1
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning StrategiesCode1
Show:102550
← PrevPage 4 of 4Next →

No leaderboard results yet.