SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 376400 of 659983 papers

TitleStatusHype
aiXcoder-7B: A Lightweight and Effective Large Language Model for Code ProcessingCode7
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution RefinementCode7
Gravity-aligned Rotation Averaging with Circular RegressionCode7
DocETL: Agentic Query Rewriting and Evaluation for Complex Document ProcessingCode7
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real VideosCode7
AFlow: Automating Agentic Workflow GenerationCode7
Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image AnimationCode7
O1 Replication Journey: A Strategic Progress Report -- Part 1Code7
Pyramidal Flow Matching for Efficient Video Generative ModelingCode7
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?Code7
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference AccelerationCode7
Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion ModelsCode7
ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AICode7
PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI SystemCode7
OmniGen: Unified Image GenerationCode7
LLaMA-Omni: Seamless Speech Interaction with Large Language ModelsCode7
gsplat: An Open-Source Library for Gaussian SplattingCode7
MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge DiscoveryCode7
Mini-Omni: Language Models Can Hear, Talk While Thinking in StreamingCode7
FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual OdometryCode7
Real-Time Video Generation with Pyramid Attention BroadcastCode7
FourierKAN outperforms MLP on Text Classification Head Fine-tuningCode7
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language ModelsCode7
VITA: Towards Open-Source Interactive Omni Multimodal LLMCode7
CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph DatabasesCode7
Show:102550
← PrevPage 16 of 26400Next →