SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 59516000 of 661570 papers

TitleStatusHype
GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding0
EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment0
EgoVITA: Learning to Plan and Verify for Egocentric Video Reasoning0
Towards High-Fidelity Gaussian Splatting with Queried-Convolution Neural Networks0
CHIPS: Efficient CLIP Adaptation via Curvature-aware Hybrid Influence-based Data Selection0
Kaleidoscopic Scintillation Event Imaging0
Nudging Hidden States: Training-Free Model Steering for Chain-of-Thought Reasoning in Large Audio-Language Models0
Chorus: Harmonizing Context and Sensing Signals for Data-Free Model Customization in IoT0
CARE What Fails: Contrastive Anchored-REflection for Verifiable Multimodal Reasoning0
Task Arithmetic with Support Languages for Low-Resource ASR0
Parametrized Sharing for Multi-Agent Hybrid DRL for Multiple Multi-Functional RISs-Aided Downlink NOMA Networks0
MultiSessionCollab: Learning User Preferences with Memory to Improve Long-Term Collaboration0
Stable Differentiable Modal Synthesis for Learning Nonlinear Dynamics0
CHLU: The Causal Hamiltonian Learning Unit as a Symplectic Primitive for Deep Learning0
Multimodal Rumor Detection Enhanced by External Evidence and Forgery Features0
Whisper-RIR-Mega: A Paired Clean-Reverberant Speech Benchmark for ASR Robustness to Room Acoustics0
Continual GUI Agents0
Denoising the Deep Sky: Physics-Based CCD Noise Formation for Astronomical Imaging0
HoRD: Robust Humanoid Control via History-Conditioned Reinforcement Learning and Online Distillation0
SDFed: Bridging Local Global Discrepancy via Subspace Refinement and Divergence Control in Federated Prompt Learning0
Estimating condition number with Graph Neural Networks0
Narrow Fine-Tuning Erodes Safety Alignment in Vision-Language Agents0
LESA: Learnable Stage-Aware Predictors for Diffusion Model Acceleration0
Send Less, Perceive More: Masked Quantized Point Cloud Communication for Loss-Tolerant Collaborative Perception0
SemanticDialect: Semantic-Aware Mixed-Format Quantization for Video Diffusion Transformers0
FocusTrack: One-Stage Focus-and-Suppress Framework for 3D Point Cloud Object Tracking0
Multi-Condition Digital Twin Calibration for Axial Piston Pumps : Compound Fault Simulation0
A Comprehensive Evaluation of LLM Unlearning Robustness under Multi-Turn Interaction0
RMBench: Memory-Dependent Robotic Manipulation Benchmark with Insights into Policy Design0
On Google's SynthID-Text LLM Watermarking System: Theoretical Analysis and Empirical Validation0
A Hypertoroidal Covering for Perfect Color Equivariance0
ProFocus: Proactive Perception and Focused Reasoning in Vision-and-Language Navigation0
Frequency-Separable Hamiltonian Neural Network for Multi-Timescale Dynamics0
TrajPred: Trajectory-Conditioned Joint Embedding Prediction for Surgical Instrument-Tissue Interaction Recognition in Vision-Language Models0
Single Image Super-Resolution via Bivariate `A Trous Wavelet Diffusion0
Agora: Teaching the Skill of Consensus-Finding with AI Personas Grounded in Human Voice0
Proxy-Guided Measurement Calibration0
OrthoFormer: Instrumental Variable Estimation in Transformer Hidden States via Neural Control Functions0
Concept-Guided Fine-Tuning: Steering ViTs away from Spurious Correlations to Improve Robustness0
STRIDE: Structured Lagrangian and Stochastic Residual Dynamics via Flow Matching0
Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers0
LuxBorrow: From Pompier to Pompjee, Tracing Borrowing in Luxembourgish0
Kernel Tests of Equivalence0
Evaluation format, not model capability, drives triage failure in the assessment of consumer health AI0
Verified Multi-Agent Orchestration: A Plan-Execute-Verify-Replan Framework for Complex Query Resolution0
KEPo: Knowledge Evolution Poison on Graph-based Retrieval-Augmented Generation0
MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?0
Data-Driven Physics Embedded Dynamics with Predictive Control and Reinforcement Learning for Quadrupeds0
Beyond Means: Topological Causal Effects under Persistent-Homology Ignorability0
Balancing Multimodal Domain Generalization via Gradient Modulation and Projection0
Show:102550
← PrevPage 120 of 13232Next →