SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 35513600 of 659983 papers

TitleStatusHype
Language as a Wave Phenomenon: Semantic Phase Locking and Interference in Neural Networks0
Fusion Complexity Inversion: Why Simpler Cross View Modules Outperform SSMs and Cross View Attention Transformers for Pasture Biomass Regression0
Amnesia: Adversarial Semantic Layer Specific Activation Steering in Large Language Models0
DUCTILE: Agentic LLM Orchestration of Engineering Analysis in Product Development Practice0
Enhanced Atrial Fibrillation Prediction in ESUS Patients with Hypergraph-based Pre-training0
Lipschitz-Based Robustness Certification Under Floating-Point Execution0
WorldVLM: Combining World Model Forecasting and Vision-Language Reasoning0
Residual Stream Duality in Modern Transformer Architectures0
Shuffling the Stochastic Mirror Descent via Dual Lipschitz Continuity and Kernel Conditioning0
A Depth-Aware Comparative Study of Euclidean and Hyperbolic Graph Neural Networks on Bitcoin Transaction Systems0
Efficient LLM Serving for Agentic Workflows: A Data Systems Perspective0
Human/AI Collective Intelligence for Deliberative Democracy: A Human-Centred Design Approach0
STARK: Spatio-Temporal Attention for Representation of Keypoints for Continuous Sign Language Recognition0
Language Models Don't Know What You Want: Evaluating Personalization in Deep Research Needs Real Users0
Pre-training LLM without Learning Rate Decay Enhances Supervised Fine-Tuning0
EFF-Grasp: Energy-Field Flow Matching for Physics-Aware Dexterous Grasp Generation0
HIPO: Instruction Hierarchy via Constrained Reinforcement Learning0
Homogeneous and Heterogeneous Consistency progressive Re-ranking for Visible-Infrared Person Re-identification0
Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR0
Online Semi-infinite Linear Programming: Efficient Algorithms via Function Approximation0
MOSAIC: Composable Safety Alignment with Modular Control Tokens0
How to Utilize Complementary Vision-Text Information for 2D Structure Understanding0
Physics-integrated neural differentiable modeling for immersed boundary systems0
FG-SGL: Fine-Grained Semantic Guidance Learning via Motion Process Decomposition for Micro-Gesture Recognition0
Behavioral Steering in a 35B MoE Language Model via SAE-Decoded Probe Vectors: One Agency Axis, Not Five Traits0
Overview of the CXR-LT 2026 Challenge: Multi-Center Long-Tailed and Zero Shot Chest X-ray Classification0
On-Policy Self-Distillation for Reasoning CompressionCode0
Clinical Priors Guided Lung Disease Detection in 3D CT Scans0
Controllable Graph Generation with Diffusion Models via Inference-Time Tree Search Guidance0
Proactive Rejection and Grounded Execution: A Dual-Stage Intent Analysis Paradigm for Safe and Efficient AIoT Smart Homes0
Muon Converges under Heavy-Tailed Noise: Nonconvex Hölder-Smooth Empirical Risk Minimization0
Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models0
Foundation-Model Surrogates Enable Data-Efficient Active Learning for Materials Discovery0
Alternating Gradient Flow Utility: A Unified Metric for Structural Pruning and Dynamic Routing in Deep Networks0
Content-Aware Mamba for Learned Image CompressionCode0
SARMAE: Masked Autoencoder for SAR Representation LearningCode0
Urban Socio-Semantic Segmentation with Vision-Language ReasoningCode0
Power Analysis for Prediction-Powered InferenceCode0
SciZoom: A Large-scale Benchmark for Hierarchical Scientific Summarization across the LLM EraCode0
PureCLIP-Depth: Prompt-Free and Decoder-Free Monocular Depth Estimation within CLIP Embedding SpaceCode0
Point-to-Mask: From Arbitrary Point Annotations to Mask-Level Infrared Small Target DetectionCode0
AW-MoE: All-Weather Mixture of Experts for Robust Multi-Modal 3D Object DetectionCode0
MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied NavigationCode0
3M-TI: High-Quality Mobile Thermal Imaging via Calibration-free Multi-Camera Cross-Modal DiffusionCode0
KEEP: A KV-Cache-Centric Memory Management System for Efficient Embodied PlanningCode0
ReFORM: Review-aggregated Profile Generation via LLM with Multi-Factor Attention for Restaurant RecommendationCode0
ERGO: Efficient High-Resolution Visual Understanding for Vision-Language ModelsCode0
AGRAG: Advanced Graph-based Retrieval-Augmented Generation for LLMsCode0
MemPO: Self-Memory Policy Optimization for Long-Horizon AgentsCode0
Integrating Weather Foundation Model and Satellite to Enable Fine-Grained Solar Irradiance ForecastingCode0
Show:102550
← PrevPage 72 of 13200Next →