SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 86268650 of 474278 papers

TitleStatusHype
LeMat-Traj: A Scalable and Unified Dataset of Materials Trajectories for Atomistic ModelingCode0
Learning More with Less: A Generalizable, Self-Supervised Framework for Privacy-Preserving Capacity Estimation with EV Charging DataCode0
BSGS: Bi-stage 3D Gaussian Splatting for Camera Motion DeblurringCode0
The Face of Persuasion: Analyzing Bias and Generating Culture-Aware AdsCode0
Layer as Puzzle Pieces: Compressing Large Language Models through Layer ConcatenationCode0
The Road Less Traveled: Enhancing Exploration in LLMs via Sequential SamplingCode0
GraphMind: Interactive Novelty Assessment System for Accelerating Scientific DiscoveryCode0
NDM: A Noise-driven Detection and Mitigation Framework against Implicit Sexual Intentions in Text-to-Image GenerationCode0
ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object DetectionCode0
RewardRank: Optimizing True Learning-to-Rank UtilityCode0
EvolveR: Self-Evolving LLM Agents through an Experience-Driven LifecycleCode0
STABLE: Gated Continual Learning for Large Language ModelsCode0
Narrowing Action Choices with AI Improves Human Sequential DecisionsCode0
AtomBench: A Benchmark for Generative Atomic Structure Models using GPT, Diffusion, and Flow ArchitecturesCode0
Memory-SAM: Human-Prompt-Free Tongue Segmentation via Retrieval-to-PromptCode0
OpenDerisk: An Industrial Framework for AI-Driven SRE, with Design, Implementation, and Case StudiesCode0
MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning0
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents0
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar0
Are Large Reasoning Models Interruptible?0
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding0
C4D: 4D Made from 3D through Dual Correspondences0
LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training0
WithAnyone: Towards Controllable and ID Consistent Image Generation0
Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation0
Show:102550
← PrevPage 346 of 18972Next →