SOTAVerified

Winogrande

Papers

Showing 125 of 26 papers

TitleStatusHype
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-TuningCode9
ST-MoE: Designing Stable and Transferable Sparse Expert ModelsCode3
LayerSkip: Enabling Early Exit Inference and Self-Speculative DecodingCode3
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Bridging the Gap: Enhancing LLM Performance for Low-Resource African Languages with New Benchmarks, Fine-Tuning, and Cultural AdjustmentsCode1
WinoGrande: An Adversarial Winograd Schema Challenge at ScaleCode1
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask BenchmarkCode1
Generative Data Augmentation for Commonsense ReasoningCode1
PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches0
Promises, Outlooks and Challenges of Diffusion Language Modeling0
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization0
A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language Models0
A Warm Start and a Clean Crawled Corpus - A Recipe for Good Language Models0
Elastic Weight Consolidation for Full-Parameter Continual Pre-Training of Gemma20
Judgment of Thoughts: Courtroom of the Binary Logical Reasoning in Large Language Models0
More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment0
Not-so fine-tuning: Measures of Common Sense for Language Models0
Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models0
TTTTTackling WinoGrande Schemas0
Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction0
Who's Harry Potter? Approximate Unlearning in LLMs0
An Application of Pseudo-Log-Likelihoods to Natural Language Scoring0
On Curriculum Learning for Commonsense ReasoningCode0
metabench -- A Sparse Benchmark to Measure General Ability in Large Language ModelsCode0
Are Hard Examples also Harder to Explain? A Study with Human and Model-Generated ExplanationsCode0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.