SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 11511175 of 177339 papers

TitleStatusHype
Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMsCode5
Benchmarking the Myopic Trap: Positional Bias in Information RetrievalCode5
Randomized Autoregressive Visual GenerationCode5
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal DecompositionCode5
FlowTok: Flowing Seamlessly Across Text and Image TokensCode5
Loki: An Open-Source Tool for Fact VerificationCode5
NeuralSVG: An Implicit Representation for Text-to-Vector GenerationCode5
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer UseCode5
Weakly Supervised Detection of Hallucinations in LLM ActivationsCode5
Vectorized and performance-portable QuicksortCode5
Less-to-More Generalization: Unlocking More Controllability by In-Context GenerationCode5
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View SynthesisCode5
PaSa: An LLM Agent for Comprehensive Academic Paper SearchCode5
Voyager: An Open-Ended Embodied Agent with Large Language ModelsCode5
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIsCode5
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-ExpertsCode5
On the Computation of the Fisher Information in Continual LearningCode5
Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse AttentionCode5
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a SurveyCode5
GRUtopia: Dream General Robots in a City at ScaleCode5
Fractal Generative ModelsCode5
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal RepresentationsCode5
Factuality Enhanced Language Models for Open-Ended Text GenerationCode5
Tool Learning with Foundation ModelsCode5
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic EvaluatorsCode5
Show:102550
← PrevPage 47 of 7094Next →