SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 73517400 of 661570 papers

TitleStatusHype
On the Role of Reversible Instance Normalization0
Derain-Agent: A Plug-and-Play Agent Framework for Rainy Image Restoration0
An Evolutionary Algorithm with Probabilistic Annealing for Large-scale Sparse Multi-objective Optimization0
Understanding LLM Behavior When Encountering User-Supplied Harmful Content in Harmless Tasks0
Chem4DLLM: 4D Multimodal LLMs for Chemical Dynamics Understanding0
Prototype-Based Knowledge Guidance for Fine-Grained Structured Radiology Reporting0
SNAP-V: A RISC-V SoC with Configurable Neuromorphic Acceleration for Small-Scale Spiking Neural Networks0
Exhaustive Circuit Mapping of a Single-Cell Foundation Model Reveals Massive Redundancy, Heavy-Tailed Hub Architecture, and Layer-Dependent Differentiation Control0
Causal Matrix Completion under Multiple Treatments via Mixed Synthetic Nearest Neighbors0
Effective Resistance Rewiring: A Simple Topological Correction for Over-Squashing0
Learning Transferable Sensor Models via Language-Informed Pretraining0
Uncovering Locally Low-dimensional Structure in Networks by Locally Optimal Spectral Embedding0
Statistical and structural identifiability in representation learning0
Ada3Drift: Adaptive Training-Time Drifting for One-Step 3D Visuomotor Robotic Manipulation0
On-Average Stability of Multipass Preconditioned SGD and Effective Dimension0
Decentralized Orchestration Architecture for Fluid Computing: A Secure Distributed AI Use Case0
CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation0
Can RL Improve Generalization of LLM Agents? An Empirical Study0
Deep Learning-Based Metamodeling of Nonlinear Stochastic Dynamic Systems under Parametric and Predictive Uncertainty0
Flowcean - Model Learning for Cyber-Physical Systems0
Nyxus: A Next Generation Image Feature Extraction Library for the Big Data and AI Era0
Sim-to-reality adaptation for Deep Reinforcement Learning applied to an underwater docking application0
Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems0
Single Pixel Image Classification using an Ultrafast Digital Light Projector0
To Words and Beyond: Probing Large Language Models for Sentence-Level Psycholinguistic Norms of Memorability and Reading Times0
EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation0
Frequentist Consistency of Prior-Data Fitted Networks for Causal Inference0
Slow-Fast Inference: Training-Free Inference Acceleration via Within-Sentence Support Stability0
Translationese as a Rational Response to Translation Task Difficulty0
A Robust and Efficient Multi-Agent Reinforcement Learning Framework for Traffic Signal Control0
Continual Learning with Vision-Language Models via Semantic-Geometry Preservation0
Coarse-Guided Visual Generation via Weighted h-Transform Sampling1
Cross-Domain Policy Optimization via Bellman Consistency and Hybrid Critics0
Human-Centred LLM Privacy Audits: Findings and Frictions0
Wasserstein Gradient Flows for Batch Bayesian Optimal Experimental Design0
On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents0
Taming the Adversary: Stable Minimax Deep Deterministic Policy Gradient via Fractional Objectives0
Increasing intelligence in AI agents can worsen collective outcomes0
Interpreting Contrastive Embeddings in Specific Domains with Fuzzy Rules0
HATS: Hardness-Aware Trajectory Synthesis for GUI Agents0
Automatic Generation of High-Performance RL Environments0
Linking Perception, Confidence and Accuracy in MLLMs0
IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL0
A Quantitative Characterization of Forgetting in Post-Training0
LatentGeo: Learnable Auxiliary Constructions in Latent Space for Multimodal Geometric Reasoning0
BehaviorVLM: Unified Finetuning-Free Behavioral Understanding with Vision-Language Reasoning0
Long-Context Encoder Models for Polish Language Understanding0
SaPaVe: Towards Active Perception and Manipulation in Vision-Language-Action Models for Robotics0
Security Considerations for Artificial Intelligence Agents0
HiAP: A Multi-Granular Stochastic Auto-Pruning Framework for Vision Transformers0
Show:102550
← PrevPage 148 of 13232Next →