SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1685116900 of 474278 papers

TitleStatusHype
U-REPA: Aligning Diffusion U-Nets to ViTsCode1
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite ImageryCode1
Panorama Generation From NFoV Image Done RightCode1
Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal RepresentationsCode1
xKV: Cross-Layer SVD for KV-Cache CompressionCode1
TrackID3x3: A Dataset and Algorithm for Multi-Player Tracking with Identification and Pose Estimation in 3x3 Basketball Full-court VideosCode1
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual TrackingCode1
Efficient Self-Supervised Adaptation for Medical Image AnalysisCode1
Global-Local Tree Search in VLMs for 3D Indoor Scene GenerationCode1
LoTUS: Large-Scale Machine Unlearning with a Taste of UncertaintyCode1
SyncVP: Joint Diffusion for Synchronous Multi-Modal Video PredictionCode1
Minimum Volume Conformal Sets for Multivariate RegressionCode1
Context-Enhanced Memory-Refined Transformer for Online Action DetectionCode1
Diff-Palm: Realistic Palmprint Generation with Polynomial Creases and Intra-Class Variation Controllable Diffusion ModelsCode1
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion ModelsCode1
Do Your Best and Get Enough Rest for Continual LearningCode1
AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent CollaborationCode1
WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article GenerationCode1
Sun-Shine: A Large Language Model for Tibetan CultureCode1
Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-OptimizationCode1
AMD-Hummingbird: Towards an Efficient Text-to-Video ModelCode1
CoMP: Continual Multimodal Pre-training for Vision Foundation ModelsCode1
PM4Bench: A Parallel Multilingual Multi-Modal Multi-task Benchmark for Large Vision Language ModelCode1
Channel Consistency Prior and Self-Reconstruction Strategy Based Unsupervised Image DerainingCode1
Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-TrainingCode1
Bootstrapped Model Predictive ControlCode1
Adaptive Unimodal Regulation for Balanced Multimodal Information AcquisitionCode1
Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive LearningCode1
Equivariant Image ModelingCode1
CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AICode1
LookAhead Tuning: Safer Language Models via Partial Answer PreviewsCode1
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text RecognitionCode1
InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model AlignmentCode1
MoST: Efficient Monarch Sparse Tuning for 3D Representation LearningCode1
Language Model Uncertainty Quantification with Attention ChainCode1
FRESA:Feedforward Reconstruction of Personalized Skinned Avatars from Few ImagesCode1
Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality RobustnessCode1
TensoFlow: Tensorial Flow-based Sampler for Inverse RenderingCode1
LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual LearningCode1
SimMotionEdit: Text-Based Human Motion Editing with Motion Similarity PredictionCode1
Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational CapabilitiesCode1
MammAlps: A multi-view video behavior monitoring dataset of wild mammals in the Swiss AlpsCode1
M3Net: Multimodal Multi-task Learning for 3D Detection, Segmentation, and Occupancy Prediction in Autonomous DrivingCode1
PG-SAM: Prior-Guided SAM with Medical for Multi-organ SegmentationCode1
GeoBenchX: Benchmarking LLMs for Multistep Geospatial TasksCode1
PHT-CAD: Efficient CAD Parametric Primitive Analysis with Progressive Hierarchical TuningCode1
HyperNOs: Automated and Parallel Library for Neural Operators ResearchCode1
End-to-End Implicit Neural Representations for ClassificationCode1
DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided DistillationCode1
Real-World Remote Sensing Image Dehazing: Benchmark and BaselineCode1
Show:102550
← PrevPage 338 of 9486Next →