SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 78517875 of 177340 papers

TitleStatusHype
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene UnderstandingCode2
SEGAN: Speech Enhancement Generative Adversarial NetworkCode2
Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object DetectionCode2
Progressive Distillation for Fast Sampling of Diffusion ModelsCode2
Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic SegmentationCode2
SC-DepthV3: Robust Self-supervised Monocular Depth Estimation for Dynamic ScenesCode2
Think While You Generate: Discrete Diffusion with Planned DenoisingCode2
VDT: General-purpose Video Diffusion Transformers via Mask ModelingCode2
Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMambaCode2
Lost in the Middle: How Language Models Use Long ContextsCode2
Representation Engineering: A Top-Down Approach to AI TransparencyCode2
WeatherGS: 3D Scene Reconstruction in Adverse Weather Conditions via Gaussian SplattingCode2
The Surprising Effectiveness of Multimodal Large Language Models for Video Moment RetrievalCode2
Aligning Text-to-Image Diffusion Models with Reward BackpropagationCode2
Temporal Graph Benchmark for Machine Learning on Temporal GraphsCode2
A Survey on Data Augmentation in Large Model EraCode2
On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model InferenceCode2
Active-Learning-as-a-Service: An Automatic and Efficient MLOps System for Data-Centric AICode2
XRAG: eXamining the Core -- Benchmarking Foundational Components in Advanced Retrieval-Augmented GenerationCode2
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via CipherCode2
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPOCode2
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMsCode2
AnyLoc: Towards Universal Visual Place RecognitionCode2
Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor RegressionCode2
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory MatchingCode2
Show:102550
← PrevPage 315 of 7094Next →