SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 36013625 of 661570 papers

TitleStatusHype
CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think0
Regret Bounds for Competitive Resource Allocation with Endogenous Costs0
Hypothesis-Conditioned Query Rewriting for Decision-Useful Retrieval0
Security awareness in LLM agents: the NDAI zone case0
Generalized Hand-Object Pose Estimation with Occlusion Awareness0
Behavioral Fingerprints for LLM Endpoint Stability and Identity0
SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models0
Man and machine: artificial intelligence and judicial decision making0
How Uncertainty Estimation Scales with Sampling in Reasoning Models0
SwiftTailor: Efficient 3D Garment Generation with Geometry Image Representation0
CustomTex: High-fidelity Indoor Scene Texturing via Multi-Reference Customization0
Parallelograms Strike Back: LLMs Generate Better Analogies than People0
CAMO: A Conditional Neural Solver for the Multi-objective Multiple Traveling Salesman Problem0
A Dataset and Resources for Identifying Patient Health Literacy Information from Clinical Notes0
Serendipity by Design: Evaluating the Impact of Cross-domain Mappings on Human and LLM Creativity0
Position: Spectral GNNs Are Neither Spectral Nor Superior for Node Classification0
SHAPCA: Consistent and Interpretable Explanations for Machine Learning Models on Spectroscopy Data0
UGID: Unified Graph Isomorphism for Debiasing Large Language Models0
Enhancing Pretrained Model-based Continual Representation Learning via Guided Random Projection0
D5P4: Partition Determinantal Point Process for Diversity in Parallel Discrete Diffusion Decoding0
Fast and Effective Computation of Generalized Symmetric Matrix Factorization0
Optimal Splitting of Language Models from Mixtures to Specialized Domains0
VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models0
ADAPT: Attention Driven Adaptive Prompt Scheduling and InTerpolating Orthogonal Complements for Rare Concepts Generation0
Adaptive Auxiliary Prompt Blending for Target-Faithful Diffusion Generation0
Show:102550
← PrevPage 145 of 26463Next →