SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1060110650 of 661570 papers

TitleStatusHype
Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement0
Replacing Parameters with Preferences: Federated Alignment of Heterogeneous Vision-Language Models0
MiTA Attention: Efficient Fast-Weight Scaling via a Mixture of Top-k Activations0
DDP-WM: Disentangled Dynamics Prediction for Efficient World Models0
Towards Exploratory and Focused Manipulation with Bimanual Active Perception: A New Problem, Benchmark and Strategy0
Supervised Metric Regularization Through Alternating Optimization for Multi-Regime Physics-Informed Neural Networks0
Learn from Your Mistakes: Self-Correcting Masked Diffusion Models0
Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search0
Bidirectional Temporal Dynamics Modeling for EEG-based Driving Fatigue Recognition0
Neural Network-Based Parameter Estimation of a Labour Market Agent-Based Model0
Zombie Agents: Persistent Control of Self-Evolving LLM Agents via Self-Reinforcing Injections0
Optimal training-conditional regret for online conformal prediction0
VoxKnesset: A Large-Scale Longitudinal Hebrew Speech Dataset for Aging Speaker Modeling0
SubQuad: Near-Quadratic-Free Structure Inference with Distribution-Balanced Objectives in Adaptive Receptor framework0
CityGuard: Graph-Aware Private Descriptors for Bias-Resilient Identity Search Across Urban Cameras0
Give Users the Wheel: Towards Promptable Recommendation Paradigm0
Assessing Risks of Large Language Models in Mental Health Support: A Framework for Automated Clinical AI Red Teaming0
cc-Shapley: Measuring Multivariate Feature Importance Needs Causal Context0
Diffusion Probe: Generated Image Result Prediction Using CNN Probes0
Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking0
DiffusionHarmonizer: Bridging Neural Reconstruction and Photorealistic Simulation with Online Diffusion Enhancer0
DMD-augmented Unpaired Neural Schrödinger Bridge for Ultra-Low Field MRI Enhancement0
AlignVAR: Towards Globally Consistent Visual Autoregression for Image Super-Resolution0
Dr.Occ: Depth- and Region-Guided 3D Occupancy from Surround-View Cameras for Autonomous Driving0
AutoSkill: Experience-Driven Lifelong Learning via Skill Self-Evolution0
Agents Learn Their Runtime: Interpreter Persistence as Training-Time Semantics0
Learn Hard Problems During RL with Reference Guided Fine-tuning0
FreeAct: Freeing Activations for LLM Quantization0
Real Money, Fake Models: Deceptive Model Claims in Shadow APIs0
MultiShadow: Multi-Object Shadow Generation for Image Compositing via Diffusion Model0
Incremental Graph Construction Enables Robust Spectral Clustering of Texts0
A Dynamical Theory of Sequential Retrieval in Input-Driven Hopfield Networks0
stratum: A System Infrastructure for Massive Agent-Centric ML Workloads0
Why Are Linear RNNs More Parallelizable?0
Zero-Knowledge Proof (ZKP) Authentication for Offline CBDC Payment System Using IoT Devices0
Bielik-Q2-Sharp: A Comparative Study of Extreme 2-bit Quantization Methods for a Polish 11B Language Model0
A Unified Framework for Joint Detection of Lacunes and Enlarged Perivascular Spaces0
Gaussian Wardrobe: Compositional 3D Gaussian Avatars for Free-Form Virtual Try-On0
Interpretable Pre-Release Baseball Pitch Type Anticipation from Broadcast 3D Kinematics0
Revisiting an Old Perspective Projection for Monocular 3D Morphable Models Regression0
Why the Brain Consolidates: Predictive Forgetting for Optimal Generalisation0
When Denoising Hinders: Revisiting Zero-Shot ASR with SAM-Audio and Whisper0
Generalizing Fair Top-k Selection: An Integrative Approach0
Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement0
Implicit Bias and Loss of Plasticity in Matrix Completion: Depth Promotes Low-Rankness0
Detection of Illicit Content on Online Marketplaces using Large Language Models0
SLO-Aware Compute Resource Allocation for Prefill-Decode Disaggregated LLM Inference0
AI-Assisted Moot Courts: Simulating Justice-Specific Questioning in Oral Arguments0
Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild0
Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery0
Show:102550
← PrevPage 213 of 13232Next →