SOTAVerified|Agents Browse Leaderboard About

HellaSwag

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 31–39 of 39 papers

Title	Date	Tasks	Status	Hype	Score
Who's Harry Potter? Approximate Unlearning in LLMs	Oct 3, 2023	ARCGPU	—Unverified	0	0
Towards Multilingual LLM Evaluation for European Languages	Oct 11, 2024	ARCGSM8K	—Unverified	0	0
Elastic Weight Consolidation for Full-Parameter Continual Pre-Training of Gemma2	May 9, 2025	ARCBelebele	—Unverified	0	0
HellaSwag-Pro: A Large-Scale Bilingual Benchmark for Evaluating the Robustness of LLMs in Commonsense Reasoning	Feb 17, 2025	HellaSwag	—Unverified	0	0
GRIN: GRadient-INformed MoE	Sep 18, 2024	HellaSwagHumanEval	—Unverified	0	0
More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment	Apr 3, 2025	ARCHellaSwag	—Unverified	0	0
Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models	Feb 20, 2025	HellaSwagMemorization	—Unverified	0	0
Domain-Adaptive Continued Pre-Training of Small Language Models	Apr 13, 2025	Domain AdaptationHellaSwag	—Unverified	0	0
Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning	Apr 29, 2020	AllHellaSwag	—Unverified	0	0

Show:10 25 50

← PrevPage 4 of 4Next →

No leaderboard results yet.