SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 19511975 of 661570 papers

TitleStatusHype
SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning0
Failure of contextual invariance in gender inference with large language models0
TETO: Tracking Events with Teacher Observation for Motion Estimation and Frame Interpolation0
One View Is Enough! Monocular Training for In-the-Wild Novel View Generation0
AgentRVOS: Reasoning over Object Tracks for Zero-Shot Referring Video Object Segmentation0
Foveated Diffusion: Efficient Spatially Adaptive Image and Video Generation0
LLM Inference at the Edge: Mobile, NPU, and GPU Performance Efficiency Trade-offs Under Sustained Load0
Bio-Inspired Event-Based Visual Servoing for Ground Robots0
AdvSplat: Adversarial Attacks on Feed-Forward Gaussian Splatting Models0
CoRe: Joint Optimization with Contrastive Learning for Medical Image Registration0
The Diminishing Returns of Early-Exit Decoding in Modern LLMs0
An In-Depth Study of Filter-Agnostic Vector Search on a PostgreSQL Database System: [Experiments and Analysis]0
Mind the Hitch: Dynamic Calibration and Articulated Perception for Autonomous Trucks0
LLMs Do Not Grade Essays Like Humans0
CDMT-EHR: A Continuous-Time Diffusion Framework for Generating Mixed-Type Time-Series Electronic Health Records0
Semantic Iterative Reconstruction: One-Shot Universal Anomaly Detection0
AI-driven Intent-Based Networking Approach for Self-configuration of Next Generation Networks0
Human-in-the-Loop Pareto Optimization: Trade-off Characterization for Assist-as-Needed Training and Performance Evaluation0
Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters0
Latent Algorithmic Structure Precedes Grokking: A Mechanistic Study of ReLU MLPs on Modular Arithmetic0
Retinal Disease Classification from Fundus Images using CNN Transfer Learning0
Digital Twin-Assisted Measurement Design and Channel Statistics Prediction0
Re-Prompting SAM 3 via Object Retrieval: 3rd of the 5th PVUW MOSE Track0
The Cognitive Firewall:Securing Browser Based AI Agents Against Indirect Prompt Injection Via Hybrid Edge Cloud Defense0
PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning0
Show:102550
← PrevPage 79 of 26463Next →