SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 9761000 of 177339 papers

TitleStatusHype
EfficientRep:An Efficient Repvgg-style ConvNets with Hardware-aware Neural Network DesignCode5
SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant TransformersCode5
Long-context LLMs Struggle with Long In-context LearningCode5
Track Anything: Segment Anything Meets VideosCode5
PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information FunnelingCode5
AppAgent: Multimodal Agents as Smartphone UsersCode5
CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale ScenesCode5
High-Fidelity Simultaneous Speech-To-Speech TranslationCode5
ReFT: Representation Finetuning for Language ModelsCode5
LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language ModelsCode5
Kimi-VL Technical ReportCode5
WebThinker: Empowering Large Reasoning Models with Deep Research CapabilityCode5
Segment Anything for Videos: A Systematic SurveyCode5
Multi-Agent Reinforcement Learning for Autonomous Driving: A SurveyCode5
Point Transformer V3: Simpler Faster StrongerCode5
DUET: Dual Clustering Enhanced Multivariate Time Series ForecastingCode5
TFB: Towards Comprehensive and Fair Benchmarking of Time Series Forecasting MethodsCode5
From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder PipelineCode5
Watermark Anything with Localized MessagesCode5
PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM CompressionCode5
R-KV: Redundancy-aware KV Cache Compression for Training-Free Reasoning Models AccelerationCode5
Differentiable Tree Search NetworkCode5
A Survey of Text-to-SQL in the Era of LLMs: Where are we, and where are we going?Code5
LeVo: High-Quality Song Generation with Multi-Preference AlignmentCode5
EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language ModelsCode5
Show:102550
← PrevPage 40 of 7094Next →