SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 71017125 of 474278 papers

TitleStatusHype
Solving Spatial Supersensing Without Spatial SupersensingCode0
Towards Unified Vision Language Models for Forest Ecological Analysis in Earth ObservationCode0
Membership Inference Attacks Beyond OverfittingCode0
vMFCoOp: Towards Equilibrium on a Unified Hyperspherical Manifold for Prompting Biomedical VLMsCode0
MiMo-Embodied: X-Embodied Foundation Model Technical ReportCode0
Formal Abductive Latent Explanations for Prototype-Based NetworksCode0
TRIM: Scalable 3D Gaussian Diffusion Inference with Temporal and Spatial TrimmingCode0
Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual GenerationCode0
SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose ManipulationCode0
ReviewGuard: Enhancing Deficient Peer Review Detection via LLM-Driven Data AugmentationCode0
VisPlay: Self-Evolving Vision-Language Models from Images0
TC-Light: Temporally Coherent Generative Rendering for Realistic World Transfer0
CRISP: Persistent Concept Unlearning via Sparse Autoencoders0
Towards Efficient Multimodal Unified Reasoning Model via Model MergingCode0
Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence0
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation0
NaTex: Seamless Texture Generation as Latent Color Diffusion0
CAMS: Towards Compositional Zero-Shot Learning via Gated Cross-Attention and Multi-Space DisentanglementCode0
Enhancing Video Large Language Models with Structured Multi-Video Collaborative ReasoningCode0
Benchmarking Multi-Step Legal Reasoning and Analyzing Chain-of-Thought Effects in Large Language ModelsCode0
Simple Lines, Big Ideas: Towards Interpretable Assessment of Human Creativity from DrawingsCode0
Towards Metric-Aware Multi-Person Mesh Recovery by Jointly Optimizing Human Crowd in Camera SpaceCode0
RoMa v2: Harder Better Faster Denser Feature MatchingCode0
InfCode: Adversarial Iterative Refinement of Tests and Patches for Reliable Software Issue ResolutionCode0
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated ReasoningCode0
Show:102550
← PrevPage 285 of 18972Next →