SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 67516800 of 661570 papers

TitleStatusHype
A Systematic Benchmark of GAN Architectures for MRI-to-CT SynthesisCode0
Instructing Large Language Models for Low-Resource Languages: A Systematic Study for BasqueCode0
Position: Agentic Evolution is the Path to Evolving LLMsCode0
MR-GNF: Multi-Resolution Graph Neural Forecasting on Ellipsoidal Meshes for Efficient Regional Weather PredictionCode0
Visual-ERM: Reward Modeling for Visual Equivalence1
V-Bridge: Bridging Video Generative Priors to Versatile Few-shot Image Restoration1
MIRAGE: Model-agnostic Industrial Realistic Anomaly Generation and Evaluation for Visual Anomaly Detection0
Benchmarking Large Language Models on Reference Extraction and Parsing in the Social Sciences and Humanities0
Developing and evaluating a chatbot to support maternal health care0
Global Sensitivity Analysis for Engineering Design Based on Individual Conditional Expectations0
Synthetic Melanoma Image Generation and Evaluation Using Generative Adversarial Networks0
NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL0
LLM-driven Multimodal Recommendation0
mAceReason-Math: A Dataset of High-Quality Multilingual Math Problems Ready For RLVR0
ViewMask-1-to-3: Multi-View Consistent Image Generation via Multimodal Diffusion Models0
DeCode: Decoupling Content and Delivery for Medical QA0
Expert Selections In MoE Models Reveal (Almost) As Much As Text0
SPRig: Self-Supervised Pose-Invariant Rigging from Mesh Sequences0
Context Engineering: From Prompts to Corporate Multi-Agent Architecture0
Sobolev--Ricci Curvature0
NeuCo-Bench: A Novel Benchmark Framework for Neural Embeddings in Earth Observation0
Examining Users' Behavioural Intention to Use OpenClaw Through the Cognition--Affect--Conation Framework0
Taming the Long Tail: Efficient Item-wise Sharpness-Aware Minimization for LLM-based Recommender Systems0
DirPA: Addressing Prior Shift in Imbalanced Few-shot Crop-type Classification0
Rethinking VLMs for Image Forgery Detection and LocalizationCode0
Anchored Alignment: Preventing Positional Collapse in Multimodal Recommender SystemsCode0
Speech-Worthy Alignment for Japanese SpeechLLMs via Direct Preference Optimization0
Spatial Reasoning is Not a Free Lunch: A Controlled Study on LLaVA0
TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning0
Spatio-Semantic Expert Routing Architecture with Mixture-of-Experts for Referring Image Segmentation0
Embedded Quantum Machine Learning in Embedded Systems: Feasibility, Hybrid Architectures, and Quantum Co-Processors0
Decoding Matters: Efficient Mamba-Based Decoder with Distribution-Aware Deep Supervision for Medical Image Segmentation0
Asymptotic and Finite-Time Guarantees for Langevin-Based Temperature Annealing in InfoNCE0
Beyond Dense Futures: World Models as Structured Planners for Robotic Manipulation0
Mobile-VTON: High-Fidelity On-Device Virtual Try-On0
Building Effective AI Coding Agents for the Terminal: Scaffolding, Harness, Context Engineering, and Lessons Learned0
TerraFlow: Multimodal, Multitemporal Representation Learning for Earth Observation0
A Closed-Form Solution for Debiasing Vision-Language Models with Utility Guarantees Across Modalities and Tasks0
Variational Garrote for Sparse Inverse Problems0
A Spectral Revisit of the Distributional Bellman Operator under the Cramér Metric0
Expert Pyramid Tuning: Efficient Parameter Fine-Tuning for Expertise-Driven Task Allocation0
DINOLight: Robust Ambient Light Normalization with Self-supervised Visual Prior Integration0
Node-RF: Learning Generalized Continuous Space-Time Scene Dynamics with Neural ODE-based NeRFs0
RTD-Guard: A Black-Box Textual Adversarial Detection Framework via Replacement Token Detection0
CA-HFP: Curvature-Aware Heterogeneous Federated Pruning with Model Reconstruction0
Early Pruning for Public Transport Routing0
Maximizing Incremental Information Entropy for Contrastive Learning0
Optimize Wider, Not Deeper: Consensus Aggregation for Policy Optimization0
Feynman: Knowledge-Infused Diagramming Agent for Scalable Visual Designs0
Enhancing Novel View Synthesis via Geometry Grounded Set Diffusion0
Show:102550
← PrevPage 136 of 13232Next →