SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 11511200 of 659983 papers

TitleStatusHype
Wonder3D: Single Image to 3D using Cross-Domain DiffusionCode5
MobileVLM V2: Faster and Stronger Baseline for Vision Language ModelCode5
MV-Adapter: Multi-view Consistent Image Generation Made EasyCode5
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM WorkflowsCode5
DeepPhase: Periodic Autoencoders for Learning Motion Phase ManifoldsCode5
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language ModelingCode5
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech ModelCode5
Understanding R1-Zero-Like Training: A Critical PerspectiveCode5
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive AnnotationsCode5
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to VerificationCode5
CogAgent: A Visual Language Model for GUI AgentsCode5
Transformer-Squared: Self-adaptive LLMsCode5
CogVLM: Visual Expert for Pretrained Language ModelsCode5
Aria: An Open Multimodal Native Mixture-of-Experts ModelCode5
Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model LearningCode5
τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World DomainsCode5
A Brief Overview of AI Governance for Responsible Machine Learning SystemsCode5
Autoregressive Image Generation without Vector QuantizationCode5
Representing Long Volumetric Video with Temporal Gaussian HierarchyCode5
Scalable Diffusion Models with TransformersCode5
Awesome Multi-modal Object TrackingCode5
Trajectory Prediction Meets Large Language Models: A SurveyCode5
PaperBench: Evaluating AI's Ability to Replicate AI ResearchCode5
4th PVUW MeViS 3rd Place Report: Sa2VACode5
GAM(e) changer or not? An evaluation of interpretable machine learning models based on additive model constraintsCode5
Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and ProspectsCode5
EfficientRep:An Efficient Repvgg-style ConvNets with Hardware-aware Neural Network DesignCode5
SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant TransformersCode5
Long-context LLMs Struggle with Long In-context LearningCode5
Track Anything: Segment Anything Meets VideosCode5
PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information FunnelingCode5
AppAgent: Multimodal Agents as Smartphone UsersCode5
CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale ScenesCode5
High-Fidelity Simultaneous Speech-To-Speech TranslationCode5
ReFT: Representation Finetuning for Language ModelsCode5
LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language ModelsCode5
Kimi-VL Technical ReportCode5
WebThinker: Empowering Large Reasoning Models with Deep Research CapabilityCode5
Segment Anything for Videos: A Systematic SurveyCode5
Multi-Agent Reinforcement Learning for Autonomous Driving: A SurveyCode5
Point Transformer V3: Simpler Faster StrongerCode5
DUET: Dual Clustering Enhanced Multivariate Time Series ForecastingCode5
TFB: Towards Comprehensive and Fair Benchmarking of Time Series Forecasting MethodsCode5
From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder PipelineCode5
Watermark Anything with Localized MessagesCode5
PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM CompressionCode5
R-KV: Redundancy-aware KV Cache Compression for Training-Free Reasoning Models AccelerationCode5
Differentiable Tree Search NetworkCode5
A Survey of Text-to-SQL in the Era of LLMs: Where are we, and where are we going?Code5
LeVo: High-Quality Song Generation with Multi-Preference AlignmentCode5
Show:102550
← PrevPage 24 of 13200Next →