SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
HellaSwag
HellaSwag
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 31–39 of 39 papers
Title
Date
Tasks
Status
Hype
Score
Who's Harry Potter? Approximate Unlearning in LLMs
Oct 3, 2023
ARC
GPU
—
Unverified
0
0
Towards Multilingual LLM Evaluation for European Languages
Oct 11, 2024
ARC
GSM8K
—
Unverified
0
0
Elastic Weight Consolidation for Full-Parameter Continual Pre-Training of Gemma2
May 9, 2025
ARC
Belebele
—
Unverified
0
0
HellaSwag-Pro: A Large-Scale Bilingual Benchmark for Evaluating the Robustness of LLMs in Commonsense Reasoning
Feb 17, 2025
HellaSwag
—
Unverified
0
0
GRIN: GRadient-INformed MoE
Sep 18, 2024
HellaSwag
HumanEval
—
Unverified
0
0
More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment
Apr 3, 2025
ARC
HellaSwag
—
Unverified
0
0
Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models
Feb 20, 2025
HellaSwag
Memorization
—
Unverified
0
0
Domain-Adaptive Continued Pre-Training of Small Language Models
Apr 13, 2025
Domain Adaptation
HellaSwag
—
Unverified
0
0
Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning
Apr 29, 2020
All
HellaSwag
—
Unverified
0
0
Show:
10
25
50
← Prev
Page 4 of 4
Next →
No leaderboard results yet.