SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1530115350 of 474278 papers

TitleStatusHype
Contrastive Self-Supervised Learning As Neural Manifold Packing0
Calibrated Predictive Lower Bounds on Time-to-Unsafe-Sampling in LLMs0
X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability0
Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR0
Atomizer: Generalizing to new modalities by breaking satellite images down to a set of scalars0
A Semantically-Aware Relevance Measure for Content-Based Medical Image Retrieval Evaluation0
BUT System for the MLC-SLM Challenge0
OTFusion: Bridging Vision-only and Vision-Language Models via Optimal Transport for Transductive Zero-Shot Learning0
DualEdit: Dual Editing for Knowledge Updating in Vision-Language Models0
Hierarchical Multi-Positive Contrastive Learning for Patent Image Retrieval0
AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video UnderstandingCode0
Sharpness-Aware Machine Unlearning0
A Two-stage Optimization Method for Wide-range Single-electron Quantum Magnetic Sensing0
A Memetic Walrus Algorithm with Expert-guided Strategy for Adaptive Curriculum Sequencing0
Evolution of ReID: From Early Methods to LLM Integration0
ESRPCB: an Edge guided Super-Resolution model and Ensemble learning for tiny Printed Circuit Board Defect detection0
Brain Imaging Foundation Models, Are We There Yet? A Systematic Review of Foundation Models for Brain Imaging and Biomedical Research0
Equitable Electronic Health Record Prediction with FAME: Fairness-Aware Multimodal Embedding0
FinLMM-R1: Enhancing Financial Reasoning in LMM through Scalable Data and Reward Design0
PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue0
CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model0
Deep Diffusion Models and Unsupervised Hyperspectral Unmixing for Realistic Abundance Map Synthesis0
Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems0
Assessing the Limits of In-Context Learning beyond Functions using Partially Ordered Relation0
Understand the Implication: Learning to Think for Pragmatic Understanding0
Sparse Convolutional Recurrent Learning for Efficient Event-based Neuromorphic Object Detection0
WildCAT3D: Appearance-Aware Multi-View Diffusion in the Wild0
Hybrid Polynomial Zonotopes: A Set Representation for Reachability Analysis in Hybrid Nonaffine Systems0
RL-Guided MPC for Autonomous Greenhouse Control0
Language Agents for Hypothesis-driven Clinical Decision Making with Reinforcement Learning0
Efficient Medical VIE via Reinforcement Learning0
Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention0
xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations0
MultiViT2: A Data-augmented Multimodal Neuroimaging Prediction Framework via Latent Diffusion Model0
Forecast-Then-Optimize Deep Learning Methods0
TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented ContrastCode1
Steering LLM Thinking with Budget GuidanceCode1
Test3R: Learning to Reconstruct 3D at Test TimeCode2
Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMsCode0
Gradient-Normalized Smoothness for Optimization with Approximate HessiansCode0
EvolvTrip: Enhancing Literary Character Understanding with Temporal Theory-of-Mind GraphsCode0
Imaging at the quantum limit with convolutional neural networksCode0
Enhancing Omics Cohort Discovery for Research on Neurodegeneration through Ontology-Augmented Embedding ModelsCode0
Discrete Diffusion in Large Language and Multimodal Models: A SurveyCode3
LTRR: Learning To Rank Retrievers for LLMsCode0
What Happens During the Loss Plateau? Understanding Abrupt Learning in TransformersCode0
Enforcing tail calibration when training probabilistic forecast modelsCode0
Federated ADMM from Bayesian DualityCode0
Align-then-Unlearn: Embedding Alignment for LLM UnlearningCode0
C-TLSAN: Content-Enhanced Time-Aware Long- and Short-Term Attention Network for Personalized RecommendationCode0
Show:102550
← PrevPage 307 of 9486Next →