SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 18511900 of 659983 papers

TitleStatusHype
Holter-to-Sleep: AI-Enabled Repurposing of Single-Lead ECG for Sleep Phenotyping0
Learning Consistent Temporal Grounding between Related Tasks in Sports Coaching0
Look Before You Fuse: 2D-Guided Cross-Modal Alignment for Robust 3D Detection0
AgroCoT: A Chain-of-Thought Benchmark for Evaluating Reasoning in Vision-Language Models for Agriculture0
Self-Tuning Sparse Attention: Multi-Fidelity Hyperparameter Optimization for Transformer Acceleration0
SJD-PAC: Accelerating Speculative Jacobi Decoding via Proactive Drafting and Adaptive Continuation0
Fast and Interpretable Autoregressive Estimation with Neural Network Backpropagation0
SignAgent: Agentic LLMs for Linguistically-Grounded Sign Language Annotation and Dataset Curation0
SOL-ExecBench: Speed-of-Light Benchmarking for Real-World GPU Kernels Against Hardware Limits0
Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation0
Uniform a priori bounds and error analysis for the Adam stochastic gradient descent optimization method0
SynBullying: A Multi LLM Synthetic Conversational Dataset for Cyberbullying Detection0
Verifiable Semantics for Agent-to-Agent Communication0
Prompt Architecture Determines Reasoning Quality: A Variable Isolation Study on the Car Wash Problem0
Hallucination or Creativity: How to Evaluate AI-Generated Scientific Stories?0
Improving Spatial Allocation for Energy System Coupling with Graph Neural Networks0
Modality Equilibrium Matters: Minor-Modality-Aware Adaptive Alternating for Cross-Modal Memory Enhancement0
Auto-Annotation with Expert-Crafted Guidelines: A Study through 3D LiDAR Detection Benchmark0
Quantifying Student Success with Generative AI: A Monte Carlo Simulation Informed by Systematic Review0
Agentic Vehicles for Human-Centered Mobility: Definition, Prospects, and System Implications0
Unsupervised Learning for Inverse Problems in Computed Tomography0
Physics-informed neural network for predicting fatigue life of unirradiated and irradiated austenitic and ferritic/martensitic steels under reactor-relevant conditions0
CausalARC: Abstract Reasoning with Causal World Models0
GenCompositor: Generative Video Compositing with Diffusion Transformer2
Pi-transformer: A prior-informed dual-attention model for multivariate time-series anomaly detection0
Soft-Di[M]O: Improving One-Step Discrete Image Generation with Soft Embeddings0
Blind to Position, Biased in Language: Probing Mid-Layer Representational Bias in Vision-Language Encoders for Zero-Shot Language-Grounded Spatial Understanding0
Activation Quantization of Vision Encoders Needs Prefixing Registers0
Milco: Learned Sparse Retrieval Across Languages via a Multilingual Connector0
An Order-Sensitive Conflict Measure for Random Permutation Sets0
VeriEquivBench: An Equivalence Score for Ground-Truth-Free Evaluation of Formally Verifiable Code0
Unlocking 3D Affordance Segmentation with 2D Semantic Knowledge0
Towards more holistic interpretability: A lightweight disentangled Concept Bottleneck Model0
Membership Inference Attack against Large Language Model-based Recommendation Systems: A New Distillation-based Paradigm0
The Geometry of Dialogue: Graphing Language Models to Reveal Synergistic Teams for Multi-Agent Collaboration0
Do Language Models Associate Sound with Meaning? A Multimodal Study of Sound Symbolism0
Otter: Mitigating Background Distractions of Wide-Angle Few-Shot Action Recognition with Enhanced RWKV0
iSeal: Encrypted Fingerprinting for Reliable LLM Ownership Verification0
FlowCast: Advancing Precipitation Nowcasting with Conditional Flow Matching0
Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout0
Learning to Self-Evolve0
Steering Awareness: Detecting Activation Steering from Within0
Cell-cell Communication Inference and Analysis: Biological Mechanisms, Computational Approaches, and Future Opportunities0
CMV-Fuse: Cross Modal-View Fusion of AMR, Syntax, and Knowledge Representations for Aspect Based Sentiment Analysis0
Heads collapse, features stay: Why Replay needs big buffers0
ClinicalTrialsHub: Bridging Registries and Literature for Comprehensive Clinical Trial Access0
GTAvatar: Bridging Gaussian Splatting and Texture Mapping for Relightable and Editable Gaussian Avatars0
Capturing reduced-order quantum many-body dynamics out of equilibrium via neural ordinary differential equations0
K-means with learned metrics0
Clipped Gradient Methods for Nonsmooth Convex Optimization under Heavy-Tailed Noise: A Refined Analysis0
Show:102550
← PrevPage 38 of 13200Next →