SOTAVerified

StrategyQA

StrategyQA aims to measure the ability of models to answer questions that require multi-step implicit reasoning.

Source: BIG-bench

Papers

Showing 2130 of 40 papers

TitleStatusHype
IAG: Induction-Augmented Generation Framework for Answering Reasoning Questions0
The ART of LLM Refinement: Ask, Refine, and Trust0
Tailoring Self-Rationalizers with Multi-Reward DistillationCode0
Improving Planning with Large Language Models: A Modular Agentic ArchitectureCode1
Large Language Models Are Also Good Prototypical Commonsense Reasoners0
Answering Unseen Questions With Smaller Language Models Using Rationale Generation and Dense Retrieval0
Teaching Smaller Language Models To Generalise To Unseen Compositional QuestionsCode0
Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive TasksCode1
Deduction under Perturbed Evidence: Probing Student Simulation Capabilities of Large Language Models0
Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs0
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.