SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers247,172 code links4,818 tasks

Papers

Showing 451500 of 658356 papers

TitleStatusHype
Better than classical? The subtle art of benchmarking quantum machine learning modelsCode7
Ichigo: Mixed-Modal Early-Fusion Realtime Voice AssistantCode7
GenAD: Generalized Predictive Model for Autonomous DrivingCode7
FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual OdometryCode7
aiXcoder-7B: A Lightweight and Effective Large Language Model for Code ProcessingCode7
AgentOrchestra: A Hierarchical Multi-Agent Framework for General-Purpose Task SolvingCode7
MAGI-1: Autoregressive Video Generation at ScaleCode7
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese UnderstandingCode7
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow DevelopmentCode7
Kimi-Audio Technical ReportCode7
Bilateral Reference for High-Resolution Dichotomous Image SegmentationCode7
EvoGP: A GPU-accelerated Framework for Tree-based Genetic ProgrammingCode7
AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied SystemsCode7
StarCoder 2 and The Stack v2: The Next GenerationCode7
Mini-Omni: Language Models Can Hear, Talk While Thinking in StreamingCode7
Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning SystemsCode7
Neural Codec Language Models are Zero-Shot Text to Speech SynthesizersCode7
DocETL: Agentic Query Rewriting and Evaluation for Complex Document ProcessingCode7
Intent-based Prompt Calibration: Enhancing prompt optimization with synthetic boundary casesCode7
Improving Sample Quality of Diffusion Models Using Self-Attention GuidanceCode7
EasyAnimate: A High-Performance Long Video Generation Method based on Transformer ArchitectureCode7
HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple CharactersCode7
MagicQuill: An Intelligent Interactive Image Editing SystemCode7
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit TrainingCode7
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text EmbeddingCode7
Faster Video Diffusion with Trainable Sparse AttentionCode7
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?Code7
EasySpider: A No-Code Visual System for Crawling the WebCode7
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model ServingCode7
MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction PriorsCode7
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-ThoughtCode7
Simulating 500 million years of evolution with a language modelCode7
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction DataCode7
Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion ModelsCode7
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep ThinkingCode7
Chinese-Vicuna: A Chinese Instruction-following Llama-based ModelCode7
Fast Video Generation with Sliding Tile AttentionCode7
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real VideosCode7
Learning Multi-dimensional Human Preference for Text-to-Image GenerationCode7
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image GenerationCode7
Mixture-of-Agents Enhances Large Language Model CapabilitiesCode7
Scaling Speech-Text Pre-training with Synthetic Interleaved DataCode7
Foundation Models for Time Series Analysis: A Tutorial and SurveyCode7
Scaling Vision Pre-Training to 4K ResolutionCode7
DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent DiffusionCode7
Symmetry Considerations for Learning Task Symmetric Robot PoliciesCode7
PromptWizard: Task-Aware Prompt Optimization FrameworkCode7
ColPali: Efficient Document Retrieval with Vision Language ModelsCode7
Large Language Diffusion ModelsCode7
Chameleon: Mixed-Modal Early-Fusion Foundation ModelsCode7
Show:102550
← PrevPage 10 of 13168Next →