SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 27012750 of 659983 papers

TitleStatusHype
Improved 3D Point-Line Mapping Regression for Camera RelocalizationCode3
Attention Distillation: A Unified Approach to Visual Characteristics TransferCode3
AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMsCode3
LongRoPE2: Near-Lossless LLM Context Window ScalingCode3
InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object InteractionsCode3
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action DetectionCode3
LangProBe: a Language Programs BenchmarkCode3
Beyond Next-Token: Next-X Prediction for Autoregressive Visual GenerationCode3
The Mighty ToRR: A Benchmark for Table Reasoning and RobustnessCode3
BatteryLife: A Comprehensive Dataset and Benchmark for Battery Life PredictionCode3
Self-rewarding correction for mathematical reasoningCode3
Harnessing Multiple Large Language Models: A Survey on LLM EnsembleCode3
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image GenerationCode3
S-Graphs 2.0 -- A Hierarchical-Semantic Optimization and Loop Closure for SLAMCode3
Chain of Draft: Thinking Faster by Writing LessCode3
Verdict: A Library for Scaling Judge-Time ComputeCode3
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMsCode3
AnyTop: Character Animation Diffusion with Any TopologyCode3
DICEPTION: A Generalist Diffusion Model for Visual Perceptual TasksCode3
Baichuan-Audio: A Unified Framework for End-to-End Speech InteractionCode3
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMsCode3
AlphaAgent: LLM-Driven Alpha Mining with Regularized Exploration to Counteract Alpha DecayCode3
KV-Edit: Training-Free Image Editing for Precise Background PreservationCode3
AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and ImprovementCode3
SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place RecognitionCode3
Curie: Toward Rigorous and Automated Scientific Experimentation with AI AgentsCode3
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMsCode3
Prompt-to-LeaderboardCode3
Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and Texture GenerationCode3
Accelerating Neural Network Training: An Analysis of the AlgoPerf CompetitionCode3
PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise DataCode3
CrossOver: 3D Scene Cross-Modal AlignmentCode3
A Comprehensive Survey on Composed Image RetrievalCode3
Slamming: Training a Speech Language Model on One GPU in a DayCode3
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object ManipulationCode3
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song GenerationCode3
Soundwave: Less is More for Speech-Text Alignment in LLMsCode3
Personalized Image Generation with Deep Generative Models: A Decade SurveyCode3
PathRAG: Pruning Graph-based Retrieval Augmented Generation with Relational PathsCode3
Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge NetworksCode3
MaskGWM: A Generalizable Driving World Model with Video Mask ReconstructionCode3
TokenSkip: Controllable Chain-of-Thought Compression in LLMsCode3
Intuitive physics understanding emerges from self-supervised pretraining on natural videosCode3
Learning Getting-Up Policies for Real-World Humanoid RobotsCode3
Stonefish: Supporting Machine Learning Research in Marine RoboticsCode3
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information RetrievalCode3
LIMR: Less is More for RL ScalingCode3
Text-guided Sparse Voxel Pruning for Efficient 3D Visual GroundingCode3
Strassen Multisystolic Array Hardware ArchitecturesCode3
Automated Hypothesis Validation with Agentic Sequential FalsificationsCode3
Show:102550
← PrevPage 55 of 13200Next →