SOTAVerified

Winogrande

Papers

Showing 2126 of 26 papers

TitleStatusHype
Who's Harry Potter? Approximate Unlearning in LLMs0
An Application of Pseudo-Log-Likelihoods to Natural Language Scoring0
On Curriculum Learning for Commonsense ReasoningCode0
metabench -- A Sparse Benchmark to Measure General Ability in Large Language ModelsCode0
Are Hard Examples also Harder to Explain? A Study with Human and Model-Generated ExplanationsCode0
Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations in a Label-Abundant SetupCode0
Show:102550
← PrevPage 3 of 3Next →

No leaderboard results yet.