SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 65516600 of 661570 papers

TitleStatusHype
ChatDiT: A Training-Free Baseline for Task-Agnostic Free-Form Chatting with Diffusion TransformersCode2
AIR-Bench: Automated Heterogeneous Information Retrieval BenchmarkCode2
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask LearningCode2
DINO-Foresight: Looking into the Future with DINOCode2
Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text DetectionCode2
ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual DataCode2
DLF: Disentangled-Language-Focused Multimodal Sentiment AnalysisCode2
Generative Inbetweening through Frame-wise Conditions-Driven Video GenerationCode2
Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning ApproachCode2
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation LearningCode2
Predicting the Original Appearance of Damaged Historical DocumentsCode2
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object DetectionCode2
BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform MotionsCode2
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within GenerationCode2
The dark side of the forces: assessing non-conservative force models for atomistic machine learningCode2
SCoralDet: Efficient real-time underwater soft coral detection with YOLOCode2
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial ReasoningCode2
Gramian Multimodal Representation Learning and AlignmentCode2
Causal Diffusion Transformers for Generative ModelingCode2
LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input ContextsCode2
No More Adam: Learning Rate Scaling at Initialization is All You NeedCode2
FSTA-SNN:Frequency-based Spatial-Temporal Attention Module for Spiking Neural NetworksCode2
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image RetrievalCode2
SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion ModelsCode2
Reliable, Reproducible, and Really Fast Leaderboards with EvalicaCode2
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video RecognitionCode2
AirMorph: Topology-Preserving Deep Learning for Pulmonary Airway AnalysisCode2
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition ControlCode2
Physics-based battery model parametrisation from impedance dataCode2
NeuralPLexer3: Accurate Biomolecular Complex Structure Prediction with Flow ModelsCode2
Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-ReflectionCode2
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic PromptCode2
Memory Efficient Matting with Adaptive Token RoutingCode2
Mr. DETR: Instructive Multi-Route Training for Detection TransformersCode2
EvalGIM: A Library for Evaluating Generative Image ModelsCode2
Financial Fine-tuning a Large Time Series ModelCode2
UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging ModalitiesCode2
Simple Guidance Mechanisms for Discrete Diffusion ModelsCode2
Efficient Large-Scale Traffic Forecasting with Transformers: A Spatial Data Management PerspectiveCode2
GAOKAO-Eval: Does high scores truly reflect strong capabilities in LLMs?Code2
You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary ProjectsCode2
GaussianAD: Gaussian-Centric End-to-End Autonomous DrivingCode2
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy PredictionCode2
AutoPatent: A Multi-Agent Framework for Automatic Patent GenerationCode2
RemDet: Rethinking Efficient Model Design for UAV Object DetectionCode2
V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position EncodingCode2
Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM ReasoningCode2
Diffusion-Enhanced Test-time Adaptation with Text and Image AugmentationCode2
Show:102550
← PrevPage 132 of 13232Next →