SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 17261750 of 659983 papers

TitleStatusHype
Wildfire Spread Scenarios: Increasing Sample Diversity of Segmentation Diffusion Models with Training-Free MethodsCode0
CoVR-R:Reason-Aware Composed Video RetrievalCode0
From Masks to Pixels and Meaning: A New Taxonomy, Benchmark, and Metrics for VLM Image TamperingCode0
EvidenceRL: Reinforcing Evidence Consistency for Trustworthy Language ModelsCode0
CurveStream: Boosting Streaming Video Understanding in MLLMs via Curvature-Aware Hierarchical Visual Memory ManagementCode0
MagicSeg: Open-World Segmentation Pretraining via Counterfactural Diffusion-Based Auto-GenerationCode0
Semantic Audio-Visual Navigation in Continuous EnvironmentsCode0
RouterKGQA: Specialized--General Model Routing for Constraint-Aware Knowledge Graph Question AnsweringCode0
Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMsCode0
MFil-Mamba: Multi-Filter Scanning for Spatial Redundancy-Aware Visual State Space ModelsCode0
Agentic Harness for Real-World CompilersCode0
The Y-Combinator for LLMs: Solving Long-Context Rot with λ-CalculusCode0
ReLi3D: Relightable Multi-view 3D Reconstruction with Disentangled Illumination1
Continual Learning for Food Category Classification Dataset: Enhancing Model Adaptability and Performance0
AIGQ: An End-to-End Hybrid Generative Architecture for E-commerce Query Recommendation0
RAM: Recover Any 3D Human Motion in-the-Wild0
NEC-Diff: Noise-Robust Event-RAW Complementary Diffusion for Seeing Motion in Extreme DarknessCode0
ODySSeI: An Open-Source End-to-End Framework for Automated Detection, Segmentation, and Severity Estimation of Lesions in Invasive Coronary Angiography Images0
Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation0
UniFluids: Unified Neural Operator Learning with Conditional Flow-matching0
Ca2+ transient detection and segmentation with the Astronomically motivated algorithm for Background Estimation And Transient Segmentation (Astro-BEATS)0
The Efficiency Attenuation Phenomenon: A Computational Challenge to the Language of Thought Hypothesis0
LLM-Enhanced Energy Contrastive Learning for Out-of-Distribution Detection in Text-Attributed Graphs0
MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery0
InjectFlow: Weak Guides Strong via Orthogonal Injection for Flow Matching0
Show:102550
← PrevPage 70 of 26400Next →