SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1002610050 of 177340 papers

TitleStatusHype
LLM-FP4: 4-Bit Floating-Point Quantized TransformersCode2
OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with TransformerCode2
RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMsCode2
A Comprehensive Survey on Knowledge DistillationCode2
TimberTrek: Exploring and Curating Sparse Decision Trees with Interactive VisualizationCode2
LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion ModelsCode2
MambaIC: State Space Models for High-Performance Learned Image CompressionCode2
Single Image Iterative Subject-driven Generation and EditingCode2
NuiScene: Exploring Efficient Generation of Unbounded Outdoor ScenesCode2
SaMam: Style-aware State Space Model for Arbitrary Image Style TransferCode2
Splat-LOAM: Gaussian Splatting LiDAR Odometry and MappingCode2
Correcting Deviations from Normality: A Reformulated Diffusion Model for Multi-Class Unsupervised Anomaly DetectionCode2
Datasets for Depression Modeling in Social Media: An OverviewCode2
AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real WorldCode2
On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile DevicesCode2
Efficient Federated Learning Tiny Language Models for Mobile Network Feature PredictionCode2
An Illusion of Progress? Assessing the Current State of Web AgentsCode2
Re-thinking Temporal Search for Long-Form Video UnderstandingCode2
A Decade of Deep Learning for Remote Sensing Spatiotemporal Fusion: Advances, Challenges, and OpportunitiesCode2
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal PromptingCode2
VocalNet: Speech LLM with Multi-Token Prediction for Faster and High-Quality GenerationCode2
Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language ModelsCode2
LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language ModelsCode2
Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large ImagesCode2
GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting EditingCode2
Show:102550
← PrevPage 402 of 7094Next →