SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 63016325 of 474278 papers

TitleStatusHype
VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability DetectionCode0
MoCoRP: Modeling Consistent Relations between Persona and Response for Persona-based DialogueCode0
HalluShift++: Bridging Language and Vision through Internal Representation Shifts for Hierarchical Hallucinations in MLLMsCode0
UltrasODM: A Dual Stream Optical Flow Mamba Network for 3D Freehand Ultrasound ReconstructionCode0
Auditing Games for SandbaggingCode0
Synchrony-Gated Plasticity with Dopamine Modulation for Spiking Neural NetworksCode0
Ghost in the Transformer: Detecting Model Reuse with Invariant Spectral SignaturesCode0
RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior RegularizationCode0
A Biophysically-Conditioned Generative Framework for 3D Brain Tumor MRI SynthesisCode0
DCoAR: Deep Concept Injection into Unified Autoregressive Models for Personalized Text-to-Image GenerationCode0
Image2Net: Datasets, Benchmark and Hybrid Framework to Convert Analog Circuit Diagrams into NetlistsCode0
Coefficients-Preserving Sampling for Reinforcement Learning with Flow MatchingCode0
An Adaptive Resonance Theory-based Topological Clustering Algorithm with a Self-Adjusting Vigilance ParameterCode0
DZ-TDPO: Non-Destructive Temporal Alignment for Mutable State Tracking in Long-Context DialogueCode0
Understanding Diffusion Models via Code ExecutionCode0
Less is More: Non-uniform Road Segments are Efficient for Bus Arrival PredictionCode0
Training Language Models to Use Prolog as a ToolCode0
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMsCode0
A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for ClassificationCode0
Decomposition Sampling for Efficient Region Annotations in Active LearningCode0
PVeRA: Probabilistic Vector-Based Random Matrix AdaptationCode0
DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous DrivingCode0
Distribution Matching Variational AutoEncoderCode0
SSplain: Sparse and Smooth Explainer for Retinopathy of Prematurity ClassificationCode0
GlimmerNet: A Lightweight Grouped Dilated Depthwise Convolutions for UAV-Based Emergency MonitoringCode0
Show:102550
← PrevPage 253 of 18972Next →