SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1390113950 of 474278 papers

TitleStatusHype
From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3DCode2
Autonomous clustering by fast find of mass and distance peaksCode2
Reconstructive Visual Instruction TuningCode2
Risk-Aware Off-Road Navigation via a Learned Speed Distribution MapCode2
Risk-mediated dynamic regulation of effective contacts de-synchronizes outbreaks in metapopulation epidemic modelsCode2
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene UnderstandingCode2
FreeTumor: Advance Tumor Segmentation via Large-Scale Tumor SynthesisCode2
From implicit learning to explicit representationsCode2
Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in ClutterCode2
Video Probabilistic Diffusion Models in Projected Latent SpaceCode2
PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle AdjustmentCode2
Safety of Sampled-Data Systems with Control Barrier Functions via Approximate Discrete Time ModelsCode2
λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent SpaceCode2
Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-trainingCode2
MegaPose: 6D Pose Estimation of Novel Objects via Render & CompareCode2
Quantifying Memorization Across Neural Language ModelsCode2
A General Language Assistant as a Laboratory for AlignmentCode2
Wukong: Towards a Scaling Law for Large-Scale RecommendationCode2
Fine-Grained Face Swapping via Regional GAN InversionCode2
3DGen: Triplane Latent Diffusion for Textured Mesh GenerationCode2
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU TasksCode2
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion ModelsCode2
Situational Graphs for Robot Navigation in Structured Indoor EnvironmentsCode2
Neural Kernel Surface ReconstructionCode2
MachMap: End-to-End Vectorized Solution for Compact HD-Map ConstructionCode2
First Place Solution of KDD Cup 2021 & OGB Large-Scale Challenge Graph Prediction TrackCode2
MMAU: A Massive Multi-Task Audio Understanding and Reasoning BenchmarkCode2
VidChapters-7M: Video Chapters at ScaleCode2
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous TokensCode2
FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural OperatorsCode2
Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function OptimizationCode2
Every Painting Awakened: A Training-free Framework for Painting-to-Animation GenerationCode2
VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool UseCode2
MimicGen: A Data Generation System for Scalable Robot Learning using Human DemonstrationsCode2
Where am I? Cross-View Geo-localization with Natural Language DescriptionsCode2
VideoComposer: Compositional Video Synthesis with Motion ControllabilityCode2
Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree SearchCode2
Excess Mass Estimates and Tests for MultimodalityCode2
Recommender Systems with Generative RetrievalCode2
BatchFormerV2: Exploring Sample Relationships for Dense Representation LearningCode2
CausalVAE: Structured Causal Disentanglement in Variational AutoencoderCode2
Euclidean, Projective, Conformal: Choosing a Geometric Algebra for Equivariant TransformersCode2
Some things are more CRINGE than others: Iterative Preference Optimization with the Pairwise Cringe LossCode2
Depth Field Networks for Generalizable Multi-view Scene RepresentationCode2
Urban Architect: Steerable 3D Urban Scene Generation with Layout PriorCode2
Diffsound: Discrete Diffusion Model for Text-to-sound GenerationCode2
BitNet: Scaling 1-bit Transformers for Large Language ModelsCode2
Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time SeriesCode2
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic ImagesCode2
PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object DetectionCode2
Show:102550
← PrevPage 279 of 9486Next →