SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 80768100 of 474278 papers

TitleStatusHype
BlackboxNLP-2025 MIB Shared Task: Improving Circuit Faithfulness via Better Edge SelectionCode0
Understanding Multi-View TransformersCode0
Towards Real Unsupervised Anomaly Detection Via Confident Meta-LearningCode0
Uniform Discrete Diffusion with Metric Path for Video GenerationCode0
PSScreen V2: Partially Supervised Multiple Retinal Disease ScreeningCode0
Tree Ensemble Explainability through the Hoeffding Functional Decomposition and TreeHFD AlgorithmCode0
Augmenting Biological Fitness Prediction Benchmarks with Landscapes Features from GraphFLACode0
InteractComp: Evaluating Search Agents With Ambiguous QueriesCode0
Training-Free Safe Text Embedding Guidance for Text-to-Image Diffusion ModelsCode0
Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification0
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning0
A Luminance-Aware Multi-Scale Network for Polarization Image Fusion with a Multi-Scene DatasetCode0
Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs?0
ZTRS: Zero-Imitation End-to-end Autonomous Driving with Trajectory ScoringCode0
GenTrack: A New Generation of Multi-Object TrackingCode0
Enhancing Pre-trained Representation Classifiability can Boost its InterpretabilityCode0
SCOPE: Saliency-Coverage Oriented Token Pruning for Efficient Multimodel LLMsCode0
RDB2G-Bench: A Comprehensive Benchmark for Automatic Graph Modeling of Relational DatabasesCode0
Radar and Event Camera Fusion for Agile Robot Ego-Motion EstimationCode0
PEARL: Peer-Enhanced Adaptive Radio via On-Device LLMCode0
Kernelized Sparse Fine-Tuning with Bi-level Parameter Competition for Vision ModelsCode0
FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point ArithmeticCode0
Information-Theoretic Discrete DiffusionCode0
Model-Guided Dual-Role Alignment for High-Fidelity Open-Domain Video-to-Audio GenerationCode0
MAGNET: A Multi-Graph Attentional Network for Code Clone DetectionCode0
Show:102550
← PrevPage 324 of 18972Next →