SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 71267150 of 474278 papers

TitleStatusHype
Multiview Scene GraphCode2
MLLM can see? Dynamic Correction Decoding for Hallucination MitigationCode2
WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic SegmentationCode2
GS^3: Efficient Relighting with Triple Gaussian SplattingCode2
Improving Long-Text Alignment for Text-to-Image Diffusion ModelsCode2
It Takes Two to Tango: Directly Optimizing for Constrained Synthesizability in Generative Molecular DesignCode2
Process Reward Model with Q-Value RankingsCode2
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language ModelsCode2
MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language UnderstandingCode2
nvTorchCam: An Open-source Library for Camera-Agnostic Differentiable Geometric VisionCode2
Open World Object Detection: A SurveyCode2
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For FreeCode2
VideoAgent: Self-Improving Video GenerationCode2
SensorLLM: Human-Intuitive Alignment of Multivariate Sensor Data with LLMs for Activity RecognitionCode2
A Consistency-Aware Spot-Guided Transformer for Versatile and Hierarchical Point Cloud RegistrationCode2
Beyond Fixed Topologies: Unregistered Training and Comprehensive Evaluation Metrics for 3D Talking HeadsCode2
High-Precision Dichotomous Image Segmentation via Probing Diffusion CapacityCode2
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World TasksCode2
GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed GraphsCode2
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family ExpertsCode2
Adaptive Probabilistic ODE Solvers Without Adaptive Memory RequirementsCode2
TRESTLE: A Model of Concept Formation in Structured DomainsCode2
Few-shot Novel View Synthesis using Depth Aware 3D Gaussian SplattingCode2
When Attention Sink Emerges in Language Models: An Empirical ViewCode2
Simplifying, Stabilizing and Scaling Continuous-Time Consistency ModelsCode2
Show:102550
← PrevPage 286 of 18972Next →