SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 726750 of 659983 papers

TitleStatusHype
Evaluating Real-World Robot Manipulation Policies in SimulationCode5
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction ModelCode5
Orbit: A Unified Simulation Framework for Interactive Robot Learning EnvironmentsCode5
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language ModelsCode5
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-InstructCode5
Break the Sequential Dependency of LLM Inference Using Lookahead DecodingCode5
Allegro: Open the Black Box of Commercial-Level Video Generation ModelCode5
Show-o: One Single Transformer to Unify Multimodal Understanding and GenerationCode5
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the WildCode5
XFeat: Accelerated Features for Lightweight Image MatchingCode5
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by TencentCode5
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic AlignmentCode5
ShareGPT4Video: Improving Video Understanding and Generation with Better CaptionsCode5
Video Depth Anything: Consistent Depth Estimation for Super-Long VideosCode5
Fast Inference from Transformers via Speculative DecodingCode5
TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length PenaltyCode5
Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in MedicineCode5
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training ParadigmsCode5
OmniRe: Omni Urban Scene ReconstructionCode5
CogView3: Finer and Faster Text-to-Image Generation via Relay DiffusionCode5
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language ModelsCode5
GenCast: Diffusion-based ensemble forecasting for medium-range weatherCode5
Gaussian Opacity Fields: Efficient Adaptive Surface Reconstruction in Unbounded ScenesCode5
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of ExpertsCode5
How to Design Translation Prompts for ChatGPT: An Empirical StudyCode5
Show:102550
← PrevPage 30 of 26400Next →