SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 64016425 of 177340 papers

TitleStatusHype
AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed GradientsCode2
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and TransportationCode2
Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and AnalysisCode2
The Power of Noise: Redefining Retrieval for RAG SystemsCode2
BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop DrivingCode2
CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D RecognitionCode2
Class-Incremental Learning with CLIP: Adaptive Representation Adjustment and Parameter FusionCode2
One Configuration to Rule Them All? Towards Hyperparameter Transfer in Topic Models using Multi-Objective Bayesian OptimizationCode2
NeuRAD: Neural Rendering for Autonomous DrivingCode2
ControlVideo: Conditional Control for One-shot Text-driven Video Editing and BeyondCode2
MDETR - Modulated Detection for End-to-End Multi-Modal UnderstandingCode2
A Multi-task Learning Model for Chinese-oriented Aspect Polarity Classification and Aspect Term ExtractionCode2
vSHARP: variable Splitting Half-quadratic Admm algorithm for Reconstruction of inverse-ProblemsCode2
ReGenNet: Towards Human Action-Reaction SynthesisCode2
Scalable 3D Registration via Truncated Entry-wise Absolute ResidualsCode2
CRA5: Extreme Compression of ERA5 for Portable Global Climate and Weather Research via an Efficient Variational TransformerCode2
EffiBench: Benchmarking the Efficiency of Automatically Generated CodeCode2
Towards High-Quality 3D Motion Transfer with Realistic Apparel AnimationCode2
FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language ModelsCode2
Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene UnderstandingCode2
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection EditingCode2
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward SystemsCode2
DRLE: Decentralized Reinforcement Learning at the Edge for Traffic Light Control in the IoVCode2
Perception Test: A Diagnostic Benchmark for Multimodal ModelsCode2
Log-based Anomaly Detection with Deep Learning: How Far Are We?Code2
Show:102550
← PrevPage 257 of 7094Next →