SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 66516700 of 661570 papers

TitleStatusHype
Tactile DreamFusion: Exploiting Tactile Sensing for 3D GenerationCode2
Proactive Agents for Multi-Turn Text-to-Image Generation Under UncertaintyCode2
Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular VideoCode2
M^3-20M: A Large-Scale Multi-Modal Molecule Dataset for AI-driven Drug Design and DiscoveryCode2
TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-ActionCode2
Perceptually Transparent Binaural Auralization of Simulated Sound FieldsCode2
PanoDreamer: Optimization-Based Single Image to 360 3D Scene With DiffusionCode2
LinVT: Empower Your Image-level Large Language Model to Understand VideosCode2
DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object DetectionCode2
Stag-1: Towards Realistic 4D Driving Simulation with Video Generation ModelCode2
Wavelet Diffusion Neural OperatorCode2
DreamColour: Controllable Video Colour Editing without TrainingCode2
C^2LEVA: Toward Comprehensive and Contamination-Free Language Model EvaluationCode2
Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene ReconstructionCode2
SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation LearningCode2
Federated Learning in Mobile Networks: A Comprehensive Case Study on Traffic ForecastingCode2
Monet: Mixture of Monosemantic Experts for TransformersCode2
QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint VideosCode2
Divot: Diffusion Powers Video Tokenizer for Comprehension and GenerationCode2
FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual CompressionCode2
Closed-Loop Supervised Fine-Tuning of Tokenized Traffic ModelsCode2
SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal ModelCode2
Mask-Adapter: The Devil is in the Masks for Open-Vocabulary SegmentationCode2
Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from VideosCode2
HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian SplattingCode2
EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene UnderstandingCode2
Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic SegmentationCode2
ZipAR: Accelerating Auto-regressive Image Generation through Spatial LocalityCode2
Beyond Local Sharpness: Communication-Efficient Global Sharpness-aware Minimization for Federated LearningCode2
CleanDIFT: Diffusion Features without NoiseCode2
Volumetrically Consistent 3D Gaussian RasterizationCode2
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and PruningCode2
JPC: Flexible Inference for Predictive Coding Networks in JAXCode2
FLAIR: VLM with Fine-grained Language-informed Image RepresentationsCode2
Good practices for evaluation of machine learning systemsCode2
Distilling Diffusion Models to Efficient 3D LiDAR Scene CompletionCode2
How to Correctly do Semantic Backpropagation on Language-based Agentic SystemsCode2
MmCows: A Multimodal Dataset for Dairy Cattle MonitoringCode2
Video Quality Assessment: A Comprehensive SurveyCode2
HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale DatasetCode2
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented GenerationCode2
Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image SynthesisCode2
Diffusion-based Visual Anagram as Multi-task LearningCode2
ProbPose: A Probabilistic Approach to 2D Human Pose EstimationCode2
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video GenerationCode2
Enhanced Photovoltaic Power Forecasting: An iTransformer and LSTM-Based Model Integrating Temporal and Covariate InteractionsCode2
Conformal Symplectic Optimization for Stable Reinforcement LearningCode2
Hacking CTFs with Plain AgentsCode2
Many-MobileNet: Multi-Model Augmentation for Robust Retinal Disease ClassificationCode2
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified FlowsCode2
Show:102550
← PrevPage 134 of 13232Next →