SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 13511400 of 659983 papers

TitleStatusHype
Stop Overthinking: A Survey on Efficient Reasoning for Large Language ModelsCode4
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
UniK3D: Universal Camera Monocular 3D EstimationCode4
Sonata: Self-Supervised Learning of Reliable Point RepresentationsCode4
Cube: A Roblox View of 3D IntelligenceCode4
DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid FrameworkCode4
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal ControlCode4
Cosmos-Reason1: From Physical Common Sense To Embodied ReasoningCode4
Multimodal Chain-of-Thought Reasoning: A Comprehensive SurveyCode4
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and BeyondCode4
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal FormalizationCode4
Retrieval-Augmented Generation with Hierarchical KnowledgeCode4
VLog: Video-Language Models by Generative Retrieval of Narration VocabularyCode4
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language ModelsCode4
LocAgent: Graph-Guided LLM Agents for Code LocalizationCode4
PharMolixFM: All-Atom Foundation Models for Molecular Modeling and GenerationCode4
Towards All-in-One Medical Image Re-IdentificationCode4
Beyond Outlining: Heterogeneous Recursive Planning for Adaptive Long-form Writing with Language ModelsCode4
LBM: Latent Bridge Matching for Fast Image-to-Image TranslationCode4
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement LearningCode4
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image GenerationCode4
Ideas in Inference-time Scaling can Benefit Generative Pre-training AlgorithmsCode4
Inductive Moment MatchingCode4
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RLCode4
PointVLA: Injecting the 3D World into Vision-Language-Action ModelsCode4
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive ReinforcementCode4
VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context ControlCode4
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement LearningCode4
R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT ModelCode4
Unified Reward Model for Multimodal Understanding and GenerationCode4
Factorio Learning EnvironmentCode4
ReasonGraph: Visualisation of Reasoning PathsCode4
DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement LearningCode4
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic KernelsCode4
UniTok: A Unified Tokenizer for Visual Generation and UnderstandingCode4
HVI: A New color space for Low-light Image EnhancementCode4
Distill Any Depth: Distillation Creates a Stronger Monocular Depth EstimatorCode4
ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning AgentsCode4
SpargeAttention: Accurate and Training-free Sparse Attention Accelerating Any Model InferenceCode4
R1-Onevision:An Open-Source Multimodal Large Language Model Capable of Deep ReasoningCode4
LettuceDetect: A Hallucination Detection Framework for RAG ApplicationsCode4
TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot ControlCode4
Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic EvaluationCode4
REFINE: Inversion-Free Backdoor Defense via Model ReprogrammingCode4
Natural Language GenerationCode4
SurveyX: Academic Survey Automation via Large Language ModelsCode4
LServe: Efficient Long-sequence LLM Serving with Unified Sparse AttentionCode4
Building reliable sim driving agents by scaling self-playCode4
Craw4LLM: Efficient Web Crawling for LLM PretrainingCode4
A deep learning framework for efficient pathology image analysisCode4
Show:102550
← PrevPage 28 of 13200Next →