SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 72267250 of 474278 papers

TitleStatusHype
Enhancing Soccer Camera Calibration Through Keypoint ExploitationCode2
BEVLoc: Cross-View Localization and Matching via Birds-Eye-View SynthesisCode2
TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video GenerationCode2
DeMo: Decoupling Motion Forecasting into Directional Intentions and Dynamic StatesCode2
Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to SeeCode2
Motion Forecasting in Continuous DrivingCode2
PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse SamplingCode2
LeanAgent: Lifelong Learning for Formal Theorem ProvingCode2
MedUniSeg: 2D and 3D Medical Image Segmentation via a Prompt-driven Universal ModelCode2
ReFIR: Grounding Large Restoration Models with Retrieval AugmentationCode2
LLM-based SPARQL Query Generation from Natural Language over Federated Knowledge GraphsCode2
Think While You Generate: Discrete Diffusion with Planned DenoisingCode2
TRACE: Temporal Grounding Video LLM via Causal Event ModelingCode2
Prompting DirectSAM for Semantic Contour Extraction in Remote Sensing ImagesCode2
BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile ManipulationCode2
TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation DataCode2
Large Continual Instruction AssistantCode2
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains MoreCode2
FedGraph: A Research Library and Benchmark for Federated Graph LearningCode2
Unlocking the Capabilities of Thought: A Reasoning Boundary Framework to Quantify and Optimize Chain-of-ThoughtCode2
CAR: Controllable Autoregressive Modeling for Visual GenerationCode2
Ensured: Explanations for Decreasing the Epistemic Uncertainty in PredictionsCode2
SecAlign: Defending Against Prompt Injection with Preference OptimizationCode2
Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNetCode2
TurtleBench: Evaluating Top Language Models via Real-World Yes/No PuzzlesCode2
Show:102550
← PrevPage 290 of 18972Next →