SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1310113150 of 474278 papers

TitleStatusHype
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous TokensCode2
FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural OperatorsCode2
Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function OptimizationCode2
Every Painting Awakened: A Training-free Framework for Painting-to-Animation GenerationCode2
VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool UseCode2
MimicGen: A Data Generation System for Scalable Robot Learning using Human DemonstrationsCode2
Where am I? Cross-View Geo-localization with Natural Language DescriptionsCode2
VideoComposer: Compositional Video Synthesis with Motion ControllabilityCode2
Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree SearchCode2
Excess Mass Estimates and Tests for MultimodalityCode2
Recommender Systems with Generative RetrievalCode2
BatchFormerV2: Exploring Sample Relationships for Dense Representation LearningCode2
CausalVAE: Structured Causal Disentanglement in Variational AutoencoderCode2
Euclidean, Projective, Conformal: Choosing a Geometric Algebra for Equivariant TransformersCode2
Some things are more CRINGE than others: Iterative Preference Optimization with the Pairwise Cringe LossCode2
Depth Field Networks for Generalizable Multi-view Scene RepresentationCode2
Urban Architect: Steerable 3D Urban Scene Generation with Layout PriorCode2
Diffsound: Discrete Diffusion Model for Text-to-sound GenerationCode2
BitNet: Scaling 1-bit Transformers for Large Language ModelsCode2
Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time SeriesCode2
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic ImagesCode2
PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object DetectionCode2
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image GenerationCode2
ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion RefinementCode2
Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMsCode2
eRST: A Signaled Graph Theory of Discourse Relations and OrganizationCode2
self-prompting analogical reasoning for uav object detectionCode2
SkillMimic-V2: Learning Robust and Generalizable Interaction Skills from Sparse and Noisy DemonstrationsCode2
Explainable AI in Spatial AnalysisCode2
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq ModelCode2
Meta-Design Matters: A Self-Design Multi-Agent SystemCode2
One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object TrajectoryCode2
GSPMD: General and Scalable Parallelization for ML Computation GraphsCode2
The More You See in 2D, the More You Perceive in 3DCode2
SpreadsheetLLM: Encoding Spreadsheets for Large Language ModelsCode2
Multi-Grained Angle Representation for Remote Sensing Object DetectionCode2
What Makes a Good Diffusion Planner for Decision Making?Code2
Tightly-Coupled LiDAR-IMU-Leg Odometry with Online Learned Leg Kinematics Incorporating Foot Tactile InformationCode2
4-bit Conformer with Native Quantization Aware Training for Speech RecognitionCode2
MVDream: Multi-view Diffusion for 3D GenerationCode2
Evolving Self-Assembling Neural Networks: From Spontaneous Activity to Experience-Dependent LearningCode2
Scaling Down Text Encoders of Text-to-Image Diffusion ModelsCode2
Fully Geometric Panoramic LocalizationCode2
Find Any Part in 3DCode2
GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image PromptingCode2
AMP: Adversarial Motion Priors for Stylized Physics-Based Character ControlCode2
PaLM-E: An Embodied Multimodal Language ModelCode2
Quantized Neural Networks: Training Neural Networks with Low Precision Weights and ActivationsCode2
Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document RestorationCode2
PRAM: Place Recognition Anywhere Model for Efficient Visual LocalizationCode2
Show:102550
← PrevPage 263 of 9486Next →