SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1395114000 of 474278 papers

TitleStatusHype
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image GenerationCode2
ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion RefinementCode2
Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMsCode2
eRST: A Signaled Graph Theory of Discourse Relations and OrganizationCode2
self-prompting analogical reasoning for uav object detectionCode2
SkillMimic-V2: Learning Robust and Generalizable Interaction Skills from Sparse and Noisy DemonstrationsCode2
Explainable AI in Spatial AnalysisCode2
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq ModelCode2
Meta-Design Matters: A Self-Design Multi-Agent SystemCode2
One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object TrajectoryCode2
GSPMD: General and Scalable Parallelization for ML Computation GraphsCode2
The More You See in 2D, the More You Perceive in 3DCode2
SpreadsheetLLM: Encoding Spreadsheets for Large Language ModelsCode2
Multi-Grained Angle Representation for Remote Sensing Object DetectionCode2
What Makes a Good Diffusion Planner for Decision Making?Code2
Tightly-Coupled LiDAR-IMU-Leg Odometry with Online Learned Leg Kinematics Incorporating Foot Tactile InformationCode2
4-bit Conformer with Native Quantization Aware Training for Speech RecognitionCode2
MVDream: Multi-view Diffusion for 3D GenerationCode2
Evolving Self-Assembling Neural Networks: From Spontaneous Activity to Experience-Dependent LearningCode2
Scaling Down Text Encoders of Text-to-Image Diffusion ModelsCode2
Fully Geometric Panoramic LocalizationCode2
Find Any Part in 3DCode2
GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image PromptingCode2
AMP: Adversarial Motion Priors for Stylized Physics-Based Character ControlCode2
PaLM-E: An Embodied Multimodal Language ModelCode2
Quantized Neural Networks: Training Neural Networks with Low Precision Weights and ActivationsCode2
Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document RestorationCode2
PRAM: Place Recognition Anywhere Model for Efficient Visual LocalizationCode2
Learning to Predict Without Looking Ahead: World Models Without Forward PredictionCode2
P2Object: Single Point Supervised Object Detection and Instance SegmentationCode2
The Revolution of Multimodal Large Language Models: A SurveyCode2
SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse ViewsCode2
RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking FrameworkCode2
CodeSAM: Source Code Representation Learning by Infusing Self-Attention with Multi-Code-View GraphsCode2
Imagine while Reasoning in Space: Multimodal Visualization-of-ThoughtCode2
Vikhr: Constructing a State-of-the-art Bilingual Open-Source Instruction-Following Large Language Model for RussianCode2
Uncertainty Quantification in Scientific Machine Learning: Methods, Metrics, and ComparisonsCode2
Learning to Act from Actionless Videos through Dense CorrespondencesCode2
Effective Long-Context Scaling of Foundation ModelsCode2
DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional TransformerCode2
What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?Code2
Palette: Image-to-Image Diffusion ModelsCode2
EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial NetworksCode2
PaLM: Scaling Language Modeling with PathwaysCode2
RPN 2: On Interdependence Function Learning Towards Unifying and Advancing CNN, RNN, GNN, and TransformerCode2
TIPS: Text-Image Pretraining with Spatial AwarenessCode2
Equivariance and partial observations in Koopman operator theory for partial differential equationsCode2
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion TransferCode2
cadrille: Multi-modal CAD Reconstruction with Online Reinforcement LearningCode2
Fast protein backbone generation with SE(3) flow matchingCode2
Show:102550
← PrevPage 280 of 9486Next →