SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 72017225 of 474278 papers

TitleStatusHype
Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to AutomationCode2
Thought2Text: Text Generation from EEG Signal using Large Language Models (LLMs)Code2
Reversible Decoupling Network for Single Image Reflection RemovalCode2
TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked TextCode2
PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object DetectionCode2
Interactive4D: Interactive 4D LiDAR SegmentationCode2
Benchmarking Agentic Workflow GenerationCode2
Enhancing Soccer Camera Calibration Through Keypoint ExploitationCode2
Rodimus*: Breaking the Accuracy-Efficiency Trade-Off with Efficient AttentionsCode2
Compositional Entailment Learning for Hyperbolic Vision-Language ModelsCode2
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration RateCode2
LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor ExtractionCode2
Cheating Automatic LLM Benchmarks: Null Models Achieve High Win RatesCode2
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific HypothesesCode2
Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and TrainingCode2
MatMamba: A Matryoshka State Space ModelCode2
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision TransformersCode2
Towards Natural Image Matting in the Wild via Real-Scenario PriorCode2
An Undetectable Watermark for Generative Image ModelsCode2
SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference AccelerationCode2
CursorCore: Assist Programming through Aligning AnythingCode2
Sylber: Syllabic Embedding Representation of Speech from Raw AudioCode2
Spiking GS: Towards High-Accuracy and Low-Cost Surface Reconstruction via Spiking Neuron-based Gaussian SplattingCode2
Quanda: An Interpretability Toolkit for Training Data Attribution Evaluation and BeyondCode2
Towards Interpreting Visual Information Processing in Vision-Language ModelsCode2
Show:102550
← PrevPage 289 of 18972Next →