SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1280112850 of 474278 papers

TitleStatusHype
One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object TrajectoryCode2
GSPMD: General and Scalable Parallelization for ML Computation GraphsCode2
The More You See in 2D, the More You Perceive in 3DCode2
SpreadsheetLLM: Encoding Spreadsheets for Large Language ModelsCode2
Multi-Grained Angle Representation for Remote Sensing Object DetectionCode2
What Makes a Good Diffusion Planner for Decision Making?Code2
Tightly-Coupled LiDAR-IMU-Leg Odometry with Online Learned Leg Kinematics Incorporating Foot Tactile InformationCode2
4-bit Conformer with Native Quantization Aware Training for Speech RecognitionCode2
MVDream: Multi-view Diffusion for 3D GenerationCode2
Evolving Self-Assembling Neural Networks: From Spontaneous Activity to Experience-Dependent LearningCode2
Scaling Down Text Encoders of Text-to-Image Diffusion ModelsCode2
Fully Geometric Panoramic LocalizationCode2
Find Any Part in 3DCode2
GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image PromptingCode2
AMP: Adversarial Motion Priors for Stylized Physics-Based Character ControlCode2
PaLM-E: An Embodied Multimodal Language ModelCode2
Quantized Neural Networks: Training Neural Networks with Low Precision Weights and ActivationsCode2
Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document RestorationCode2
PRAM: Place Recognition Anywhere Model for Efficient Visual LocalizationCode2
Learning to Predict Without Looking Ahead: World Models Without Forward PredictionCode2
P2Object: Single Point Supervised Object Detection and Instance SegmentationCode2
The Revolution of Multimodal Large Language Models: A SurveyCode2
SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse ViewsCode2
RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking FrameworkCode2
CodeSAM: Source Code Representation Learning by Infusing Self-Attention with Multi-Code-View GraphsCode2
Imagine while Reasoning in Space: Multimodal Visualization-of-ThoughtCode2
Vikhr: Constructing a State-of-the-art Bilingual Open-Source Instruction-Following Large Language Model for RussianCode2
Uncertainty Quantification in Scientific Machine Learning: Methods, Metrics, and ComparisonsCode2
Learning to Act from Actionless Videos through Dense CorrespondencesCode2
Effective Long-Context Scaling of Foundation ModelsCode2
DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional TransformerCode2
What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?Code2
Palette: Image-to-Image Diffusion ModelsCode2
EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial NetworksCode2
PaLM: Scaling Language Modeling with PathwaysCode2
RPN 2: On Interdependence Function Learning Towards Unifying and Advancing CNN, RNN, GNN, and TransformerCode2
TIPS: Text-Image Pretraining with Spatial AwarenessCode2
Equivariance and partial observations in Koopman operator theory for partial differential equationsCode2
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion TransferCode2
cadrille: Multi-modal CAD Reconstruction with Online Reinforcement LearningCode2
Fast protein backbone generation with SE(3) flow matchingCode2
DeepMol: An Automated Machine and Deep Learning Framework for Computational ChemistrCode2
Immiscible Diffusion: Accelerating Diffusion Training with Noise AssignmentCode2
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video GenerationCode2
Remasking Discrete Diffusion Models with Inference-Time ScalingCode2
SCoralDet: Efficient real-time underwater soft coral detection with YOLOCode2
Simplifying, Stabilizing and Scaling Continuous-Time Consistency ModelsCode2
GestureDiffuCLIP: Gesture Diffusion Model with CLIP LatentsCode2
JourneyDB: A Benchmark for Generative Image UnderstandingCode2
X-maps: Direct Depth Lookup for Event-based Structured Light SystemsCode2
Show:102550
← PrevPage 257 of 9486Next →