SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 58265850 of 661570 papers

TitleStatusHype
Semantic Context Matters: Improving Conditioning for Autoregressive ModelsCode0
Echo-CoPilot: A Multiple-Perspective Agentic Framework for Reliable Echocardiography InterpretationCode0
SoliReward: Mitigating Susceptibility to Reward Hacking and Annotation Noise in Video Generation Reward ModelsCode0
Cross-RAG: Zero-Shot Retrieval-Augmented Time Series Forecasting via Cross-AttentionCode0
HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory SystemCode0
WiT: Waypoint Diffusion Transformers via Trajectory Conflict NavigationCode0
TextOVSR: Text-Guided Real-World Opera Video Super-ResolutionCode0
Dataset Diversity Metrics and Impact on Classification ModelsCode0
Flash-Unified: A Training-Free and Task-Aware Acceleration Framework for Native Unified ModelsCode0
IRIS: Intersection-aware Ray-based Implicit Editable ScenesCode0
GradCFA: A Hybrid Gradient-Based Counterfactual and Feature Attribution Explanation Algorithm for Local Interpretation of Neural NetworksCode0
When Does Sparsity Mitigate the Curse of Depth in LLMsCode0
Unlocking the Value of Text: Event-Driven Reasoning and Multi-Level Alignment for Time Series ForecastingCode0
Seeing Beyond: Extrapolative Domain Adaptive Panoramic SegmentationCode0
Mixture-of-Depths AttentionCode0
Hilbert: Recursively Building Formal Proofs with Informal ReasoningCode0
Flow Matching for Tabular Data SynthesisCode0
Learning complete and explainable visual representations from itemized text supervisionCode0
CRIMSON: A Clinically-Grounded LLM-Based Metric for Generative Radiology Report EvaluationCode0
The Agentic Researcher: A Practical Guide to AI-Assisted Research in Mathematics and Machine LearningCode0
RealVLG-R1: A Large-Scale Real-World Visual-Language Grounding Benchmark for Robotic Perception and ManipulationCode0
Real-Time Oriented Object Detection Transformer in Remote Sensing ImagesCode0
CoD: A Diffusion Foundation Model for Image CompressionCode0
IgPose: A Generative Data-Augmented Pipeline for Robust Immunoglobulin-Antigen Binding PredictionCode0
Curriculum Reinforcement Learning from Easy to Hard Tasks Improves LLM ReasoningCode0
Show:102550
← PrevPage 234 of 26463Next →