SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 64516500 of 661570 papers

TitleStatusHype
Malicious Agent Skills in the Wild: A Large-Scale Security Empirical Study0
SERFN: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows0
GraphSeek: Next-Generation Graph Analytics with LLMs0
On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs0
Dense Dynamic Scene Reconstruction and Camera Pose Estimation from Multi-View Videos0
RetimeGS: Continuous-Time Reconstruction of 4D Gaussian Splatting0
Aumann-SHAP: The Geometry of Counterfactual Interaction Explanations in Machine Learning0
Few Batches or Little Memory, But Not Both: Simultaneous Space and Adaptivity Constraints in Stochastic Bandits0
Multimodal Emotion Regression with Multi-Objective Optimization and VAD-Aware Audio Modeling for the 10th ABAW EMI Track0
Retrieval-Feedback-Driven Distillation and Preference Alignment for Efficient LLM-based Query Expansion0
Your Vision-Language-Action Model Already Has Attention Heads For Path Deviation Detection0
Computation and Communication Efficient Federated Unlearning via On-server Gradient Conflict Mitigation and Expression0
PMIScore: An Unsupervised Approach to Quantify Dialogue Engagement0
Prototypical Exemplar Condensation for Memory-efficient Online Continual Learning0
Efficient Semi-Automated Material Microstructure Analysis Using Deep Learning: A Case Study in Additive Manufacturing0
Exploring the Dimensions of a Variational Neuron0
TransDex: Pre-training Visuo-Tactile Policy with Point Cloud Reconstruction for Dexterous Manipulation of Transparent Objects0
SCoCCA: Multi-modal Sparse Concept Decomposition via Canonical Correlation Analysis0
GradMem: Learning to Write Context into Memory with Test-Time Gradient Descent1
Discriminative Flow Matching Via Local Generative Predictors0
Chunk-Guided Q-Learning0
Bidirectional Cross-Attention Fusion of High-Res RGB and Low-Res HSI for Multimodal Automated Waste Sorting0
FLUX: Data Worth Training On0
Exploiting temporal parallelism for LSTM Autoencoder acceleration on FPGA0
Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models0
U-Face: An Efficient and Generalizable Framework for Unsupervised Facial Attribute Editing via Subspace Learning0
EI-Part: Explode for Completion and Implode for Refinement0
Benchmarking Open-Source PPG Foundation Models for Biological Age Prediction0
Gated Graph Attention Networks for Predicting Duration of Large Scale Power Outages Induced by Natural Disasters0
MotionCFG: Boosting Motion Dynamics via Stochastic Concept Perturbation0
Enhancing Eye Feature Estimation from Event Data Streams through Adaptive Inference State Space Modeling0
Low-Field Magnetic Resonance Image Quality Enhancement using Undersampled k-Space and Out-of-Distribution Generalisation0
The Institutional Scaling Law: Non-Monotonic Fitness, Capability-Trust Divergence, and Symbiogenetic Scaling in Generative AI0
Seeing Through the PRISM: Compound & Controllable Restoration of Scientific Images0
Point of Order: Action-Aware LLM Persona Modeling for Realistic Civic Simulation0
A Grammar of Machine Learning Workflows0
Generate Then Correct: Single Shot Global Correction for Aspect Sentiment Quad Prediction0
Post-hoc Stochastic Concept Bottleneck Models0
Can We Trust LLMs on Memristors? Diving into Reasoning Ability under Non-Ideality0
Conditioning on a Volatility Proxy Compresses the Apparent Timescale of Collective Market Correlation0
Self-Supervised Uncertainty Estimation For Super-Resolution of Satellite Images0
Enhancing Mental Health Classification with Layer-Attentive Residuals and Contrastive Feature Learning0
Machine Learning Detection of Lithium Plating in Lithium-ion Cells: A Gaussian Process Approach0
The Law-Following AI Framework: Legal Foundations and Technical Constraints. Legal Analogues for AI Actorship and technical feasibility of Law Alignment0
FMS^2: Unified Flow Matching for Segmentation and Synthesis of Thin Structures0
MaP: A Unified Framework for Reliable Evaluation of Pre-training Dynamics0
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation0
Scorio.jl: A Julia package for ranking stochastic responses0
UniOD: A Universal Model for Outlier Detection across Diverse DomainsCode0
CARE: Contrastive Alignment for ADL Recognition from Event-Triggered Sensor StreamsCode0
Show:102550
← PrevPage 130 of 13232Next →