SOTAVerified

Winogrande

Papers

Showing 125 of 26 papers

TitleStatusHype
Elastic Weight Consolidation for Full-Parameter Continual Pre-Training of Gemma20
More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment0
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization0
Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models0
Bridging the Gap: Enhancing LLM Performance for Low-Resource African Languages with New Benchmarks, Fine-Tuning, and Cultural AdjustmentsCode1
PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches0
Judgment of Thoughts: Courtroom of the Binary Logical Reasoning in Large Language Models0
metabench -- A Sparse Benchmark to Measure General Ability in Large Language ModelsCode0
Promises, Outlooks and Challenges of Diffusion Language Modeling0
LayerSkip: Enabling Early Exit Inference and Self-Speculative DecodingCode3
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-TuningCode9
Who's Harry Potter? Approximate Unlearning in LLMs0
Are Hard Examples also Harder to Explain? A Study with Human and Model-Generated ExplanationsCode0
On Curriculum Learning for Commonsense ReasoningCode0
A Warm Start and a Clean Crawled Corpus - A Recipe for Good Language Models0
ST-MoE: Designing Stable and Transferable Sparse Expert ModelsCode3
An Application of Pseudo-Log-Likelihoods to Natural Language Scoring0
A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language Models0
Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations in a Label-Abundant SetupCode0
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Not-so fine-tuning: Measures of Common Sense for Language Models0
Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction0
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask BenchmarkCode1
Generative Data Augmentation for Commonsense ReasoningCode1
TTTTTackling WinoGrande Schemas0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.