SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
2k
2k
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 31–40 of 288 papers
Title
Date
Tasks
Status
Hype
Stackelberg Game Preference Optimization for Data-Efficient Alignment of Language Models
Feb 25, 2025
2k
Models Alignment
—
Unverified
0
Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks
Feb 24, 2025
2k
ARC
—
Unverified
0
Exact Recovery of Sparse Binary Vectors from Generalized Linear Measurements
Feb 21, 2025
2k
Quantization
—
Unverified
0
Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning
Feb 18, 2025
2k
Long-Context Understanding
—
Unverified
0
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
Feb 17, 2025
2k
Autonomous Driving
Code
Code Available
3
Improved Regret in Stochastic Decision-Theoretic Online Learning under Differential Privacy
Feb 16, 2025
2k
—
Unverified
0
CascadeV: An Implementation of Wurstchen Architecture for Video Generation
Jan 28, 2025
2k
Video Generation
Code
Code Available
1
Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains
Jan 24, 2025
2k
Legal Reasoning
—
Unverified
0
TimeLogic: A Temporal Logic Benchmark for Video QA
Jan 13, 2025
2k
Action Segmentation
—
Unverified
0
LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Jan 9, 2025
2k
8k
—
Unverified
0
Show:
10
25
50
← Prev
Page 4 of 29
Next →
No leaderboard results yet.