SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 801825 of 659983 papers

TitleStatusHype
Weakly Supervised Detection of Hallucinations in LLM ActivationsCode5
Vectorized and performance-portable QuicksortCode5
Less-to-More Generalization: Unlocking More Controllability by In-Context GenerationCode5
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View SynthesisCode5
PaSa: An LLM Agent for Comprehensive Academic Paper SearchCode5
Voyager: An Open-Ended Embodied Agent with Large Language ModelsCode5
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIsCode5
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-ExpertsCode5
On the Computation of the Fisher Information in Continual LearningCode5
Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse AttentionCode5
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a SurveyCode5
GRUtopia: Dream General Robots in a City at ScaleCode5
Fractal Generative ModelsCode5
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal RepresentationsCode5
Factuality Enhanced Language Models for Open-Ended Text GenerationCode5
Tool Learning with Foundation ModelsCode5
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic EvaluatorsCode5
Deep Lake: a Lakehouse for Deep LearningCode5
MARIO Eval: Evaluate Your Math LLM with your Math LLM--A mathematical dataset evaluation toolkitCode5
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-DictionaryCode5
Efficient Diffusion Model for Image Restoration by Residual ShiftingCode5
τ^2-Bench: Evaluating Conversational Agents in a Dual-Control EnvironmentCode5
DUSt3R: Geometric 3D Vision Made EasyCode5
Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar ModelingCode5
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution EngineCode5
Show:102550
← PrevPage 33 of 26400Next →