SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 51015150 of 661570 papers

TitleStatusHype
Flexible and Efficient Spatio-Temporal Transformer for Sequential Visual Place Recognition0
Steering LLMs toward Korean Local Speech: Iterative Refinement Framework for Faithful Dialect Translation0
Rethinking Reward Signals in Video GRPO: When Scores Become Targets0
Learning Topology-Driven Multi-Subspace Fusion for Grassmannian Deep Network0
FedSDWC: Federated Synergistic Dual-Representation Weak Causal Learning for OOD0
Language as a Wave Phenomenon: Semantic Phase Locking and Interference in Neural Networks0
Fusion Complexity Inversion: Why Simpler Cross View Modules Outperform SSMs and Cross View Attention Transformers for Pasture Biomass Regression0
Amnesia: Adversarial Semantic Layer Specific Activation Steering in Large Language Models0
DUCTILE: Agentic LLM Orchestration of Engineering Analysis in Product Development Practice0
Enhanced Atrial Fibrillation Prediction in ESUS Patients with Hypergraph-based Pre-training0
Lipschitz-Based Robustness Certification Under Floating-Point Execution0
WorldVLM: Combining World Model Forecasting and Vision-Language Reasoning0
Residual Stream Duality in Modern Transformer Architectures0
Shuffling the Stochastic Mirror Descent via Dual Lipschitz Continuity and Kernel Conditioning0
A Depth-Aware Comparative Study of Euclidean and Hyperbolic Graph Neural Networks on Bitcoin Transaction Systems0
Efficient LLM Serving for Agentic Workflows: A Data Systems Perspective0
Human/AI Collective Intelligence for Deliberative Democracy: A Human-Centred Design Approach0
STARK: Spatio-Temporal Attention for Representation of Keypoints for Continuous Sign Language Recognition0
Language Models Don't Know What You Want: Evaluating Personalization in Deep Research Needs Real Users0
Pre-training LLM without Learning Rate Decay Enhances Supervised Fine-Tuning0
EFF-Grasp: Energy-Field Flow Matching for Physics-Aware Dexterous Grasp Generation0
HIPO: Instruction Hierarchy via Constrained Reinforcement Learning0
Homogeneous and Heterogeneous Consistency progressive Re-ranking for Visible-Infrared Person Re-identification0
Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR0
Online Semi-infinite Linear Programming: Efficient Algorithms via Function Approximation0
MOSAIC: Composable Safety Alignment with Modular Control Tokens0
How to Utilize Complementary Vision-Text Information for 2D Structure Understanding0
Physics-integrated neural differentiable modeling for immersed boundary systems0
FG-SGL: Fine-Grained Semantic Guidance Learning via Motion Process Decomposition for Micro-Gesture Recognition0
Behavioral Steering in a 35B MoE Language Model via SAE-Decoded Probe Vectors: One Agency Axis, Not Five Traits0
Overview of the CXR-LT 2026 Challenge: Multi-Center Long-Tailed and Zero Shot Chest X-ray Classification0
On-Policy Self-Distillation for Reasoning CompressionCode0
Clinical Priors Guided Lung Disease Detection in 3D CT Scans0
Controllable Graph Generation with Diffusion Models via Inference-Time Tree Search Guidance0
Proactive Rejection and Grounded Execution: A Dual-Stage Intent Analysis Paradigm for Safe and Efficient AIoT Smart Homes0
Muon Converges under Heavy-Tailed Noise: Nonconvex Hölder-Smooth Empirical Risk Minimization0
Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models0
Foundation-Model Surrogates Enable Data-Efficient Active Learning for Materials Discovery0
Alternating Gradient Flow Utility: A Unified Metric for Structural Pruning and Dynamic Routing in Deep Networks0
Content-Aware Mamba for Learned Image CompressionCode0
SARMAE: Masked Autoencoder for SAR Representation LearningCode0
Urban Socio-Semantic Segmentation with Vision-Language ReasoningCode0
Power Analysis for Prediction-Powered InferenceCode0
SciZoom: A Large-scale Benchmark for Hierarchical Scientific Summarization across the LLM EraCode0
PureCLIP-Depth: Prompt-Free and Decoder-Free Monocular Depth Estimation within CLIP Embedding SpaceCode0
Point-to-Mask: From Arbitrary Point Annotations to Mask-Level Infrared Small Target DetectionCode0
AW-MoE: All-Weather Mixture of Experts for Robust Multi-Modal 3D Object DetectionCode0
MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied NavigationCode0
3M-TI: High-Quality Mobile Thermal Imaging via Calibration-free Multi-Camera Cross-Modal DiffusionCode0
KEEP: A KV-Cache-Centric Memory Management System for Efficient Embodied PlanningCode0
Show:102550
← PrevPage 103 of 13232Next →