SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 42764300 of 661570 papers

TitleStatusHype
ResNet-50 with Class Reweighting and Anatomy-Guided Temporal Decoding for Gastrointestinal Video Analysis0
Facial Movement Dynamics Reveal Workload During Complex Multitasking0
CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution0
CrowdGaussian: Reconstructing High-Fidelity 3D Gaussians for Human Crowd from a Single Image0
Facts as First Class Objects: Knowledge Objects for Persistent LLM Memory0
EVA: Aligning Video World Models with Executable Robot Actions via Inverse Dynamics Rewards0
Dropout Robustness and Cognitive Profiling of Transformer Models via Stochastic Inference0
ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation0
Discovering Decoupled Functional Modules in Large Language Models0
RPMS: Enhancing LLM-Based Embodied Planning through Rule-Augmented Memory Synergy0
Symmetry-Reduced Physics-Informed Learning of Tensegrity Dynamics0
Steering Video Diffusion Transformers with Massive Activations0
TINA: Text-Free Inversion Attack for Unlearned Text-to-Image Diffusion Models0
CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents0
Generative Control as Optimization: Time Unconditional Flow Matching for Adaptive and Robust Robotic Control0
Verification and Validation of Physics-Informed Surrogate Component Models for Dynamic Power-System Simulation0
The Silent Thought: Modeling Internal Cognition in Full-Duplex Spoken Dialogue Models via Latent Reasoning0
How do LLMs Compute Verbal Confidence0
Operator-Theoretic Foundations and Policy Gradient Methods for General MDPs with Unbounded Costs0
Edit Spillover as a Probe: Do Image Editing Models Implicitly Understand World Relations?0
AI-Assisted Goal Setting Improves Goal Progress Through Social Accountability0
Identity as Presence: Towards Appearance and Voice Personalized Joint Audio-Video Generation0
RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference0
scicode-lint: Detecting Methodology Bugs in Scientific Python Code with LLM-Generated Patterns0
A Creative Agent is Worth a 64-Token Template0
Show:102550
← PrevPage 172 of 26463Next →