SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers247,172 code links4,818 tasks

Papers

Showing 351400 of 658356 papers

TitleStatusHype
Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation0
Uniform a priori bounds and error analysis for the Adam stochastic gradient descent optimization method0
SynBullying: A Multi LLM Synthetic Conversational Dataset for Cyberbullying Detection0
Verifiable Semantics for Agent-to-Agent Communication0
Prompt Architecture Determines Reasoning Quality: A Variable Isolation Study on the Car Wash Problem0
Hallucination or Creativity: How to Evaluate AI-Generated Scientific Stories?0
Improving Spatial Allocation for Energy System Coupling with Graph Neural Networks0
Modality Equilibrium Matters: Minor-Modality-Aware Adaptive Alternating for Cross-Modal Memory Enhancement0
Auto-Annotation with Expert-Crafted Guidelines: A Study through 3D LiDAR Detection Benchmark0
Quantifying Student Success with Generative AI: A Monte Carlo Simulation Informed by Systematic Review0
Agentic Vehicles for Human-Centered Mobility: Definition, Prospects, and System Implications0
Unsupervised Learning for Inverse Problems in Computed Tomography0
Physics-informed neural network for predicting fatigue life of unirradiated and irradiated austenitic and ferritic/martensitic steels under reactor-relevant conditions0
CausalARC: Abstract Reasoning with Causal World Models0
GenCompositor: Generative Video Compositing with Diffusion Transformer2
Pi-transformer: A prior-informed dual-attention model for multivariate time-series anomaly detection0
Soft-Di[M]O: Improving One-Step Discrete Image Generation with Soft Embeddings0
Blind to Position, Biased in Language: Probing Mid-Layer Representational Bias in Vision-Language Encoders for Zero-Shot Language-Grounded Spatial Understanding0
Activation Quantization of Vision Encoders Needs Prefixing Registers0
Milco: Learned Sparse Retrieval Across Languages via a Multilingual Connector0
An Order-Sensitive Conflict Measure for Random Permutation Sets0
VeriEquivBench: An Equivalence Score for Ground-Truth-Free Evaluation of Formally Verifiable Code0
Unlocking 3D Affordance Segmentation with 2D Semantic Knowledge0
Towards more holistic interpretability: A lightweight disentangled Concept Bottleneck Model0
Membership Inference Attack against Large Language Model-based Recommendation Systems: A New Distillation-based Paradigm0
The Geometry of Dialogue: Graphing Language Models to Reveal Synergistic Teams for Multi-Agent Collaboration0
Do Language Models Associate Sound with Meaning? A Multimodal Study of Sound Symbolism0
Otter: Mitigating Background Distractions of Wide-Angle Few-Shot Action Recognition with Enhanced RWKV0
iSeal: Encrypted Fingerprinting for Reliable LLM Ownership Verification0
FlowCast: Advancing Precipitation Nowcasting with Conditional Flow Matching0
Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout0
Learning to Self-Evolve0
Steering Awareness: Detecting Activation Steering from Within0
Cell-cell Communication Inference and Analysis: Biological Mechanisms, Computational Approaches, and Future Opportunities0
CMV-Fuse: Cross Modal-View Fusion of AMR, Syntax, and Knowledge Representations for Aspect Based Sentiment Analysis0
Heads collapse, features stay: Why Replay needs big buffers0
ClinicalTrialsHub: Bridging Registries and Literature for Comprehensive Clinical Trial Access0
GTAvatar: Bridging Gaussian Splatting and Texture Mapping for Relightable and Editable Gaussian Avatars0
Capturing reduced-order quantum many-body dynamics out of equilibrium via neural ordinary differential equations0
K-means with learned metrics0
Clipped Gradient Methods for Nonsmooth Convex Optimization under Heavy-Tailed Noise: A Refined Analysis0
Efficient Reasoning with Balanced Thinking2
Weights to Code: Extracting Interpretable Algorithms from the Discrete Transformer0
DeeperBrain: A Neuro-Grounded EEG Foundation Model Towards Universal BCI0
Studying the Role of Synthetic Data for Machine Learning-based Wireless Networks Traffic Forecasting0
Forest-Chat: Adapting Vision-Language Agents for Interactive Forest Change Analysis0
Mixed-Precision Training and Compilation for RRAM-based Computing-in-Memory Accelerators0
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models2
Koopman Autoencoders with Continuous-Time Latent Dynamics for Fluid Dynamics Forecasting0
STELLAR: Structure-guided LLM Assertion Retrieval and Generation for Formal Verification0
Show:102550
← PrevPage 8 of 13168Next →