SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers257,923 code links4,818 tasks

Papers

Showing 51100 of 658356 papers

TitleStatusHype
FB-CLIP: Fine-Grained Zero-Shot Anomaly Detection with Foreground-Background Disentanglement0
LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment0
ParallelVLM: Lossless Video-LLM Acceleration with Visual Alignment Aware Parallel Speculative Decoding0
Demonstrations, CoT, and Prompting: A Theoretical Analysis of ICL0
OrbitNVS: Harnessing Video Diffusion Priors for Novel View Synthesis0
CAF-Score: Calibrating CLAP with LALMs for Reference-free Audio Captioning EvaluationCode0
UniPR: Unified Object-level Real-to-Sim Perception and Reconstruction from a Single Stereo Pair0
On Performance Guarantees for Federated Learning with Personalized Constraints0
DeepStock: Reinforcement Learning with Policy Regularizations for Inventory Management0
Disentangle-then-Align: Non-Iterative Hybrid Multimodal Image Registration via Cross-Scale Feature Disentanglement0
IUP-Pose: Decoupled Iterative Uncertainty Propagation for Real-time Relative Pose Regression via Implicit Dense Alignment v10
On the role of memorization in learned priors for geophysical inverse problems0
Alternating Diffusion for Proximal Sampling with Zeroth Order Queries0
MetaCues: Enabling Critical Engagement with Generative AI for Information Seeking and Sensemaking0
BEAVER: A Training-Free Hierarchical Prompt Compression Method via Structure-Aware Page Selection0
RiboSphere: Learning Unified and Efficient Representations of RNA Structures0
UniBioTransfer: A Unified Framework for Multiple Biometrics Transfer0
HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning0
OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework0
Heavy-Tailed and Long-Range Dependent Noise in Stochastic Approximation: A Finite-Time Analysis0
PolicySim: An LLM-Based Agent Social Simulation Sandbox for Proactive Policy Optimization0
Ensembles-based Feature Guided Analysis0
GravCal: Single-Image Calibration of IMU Gravity Priors with Per-Sample Confidence0
Model Selection and Parameter Estimation of Multi-dimensional Gaussian Mixture Model0
CS-MUNet: A Channel-Spatial Dual-Stream Mamba Network for Multi-Organ Segmentation0
The Residual Stream Is All You Need: On the Redundancy of the KV Cache in Transformer InferenceCode0
Toward High-Fidelity Visual Reconstruction: From EEG-Based Conditioned Generation to Joint-Modal Guided Rebuilding0
Structured Prompting for Arabic Essay Proficiency: A Trait-Centric Evaluation Approach0
Scale-Dependent Radial Geometry and Metric Mismatch in Wasserstein Propagation for Reverse Diffusion0
Making Video Models Adhere to User Intent with Minor Adjustments0
ATHENA: Adaptive Test-Time Steering for Improving Count Fidelity in Diffusion Models0
GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems0
WorldAgents: Can Foundation Image Models be Agents for 3D World Models?0
3D Gaussian Splatting with Self-Constrained Priors for High Fidelity Surface Reconstruction0
Ontology-Based Knowledge Modeling and Uncertainty-Aware Outdoor Air Quality Assessment Using Weighted Interval Type-2 Fuzzy Logic0
TSegAgent: Zero-Shot Tooth Segmentation via Geometry-Aware Vision-Language Agents0
Diminishing Returns in Expanding Generative Models and Godel-Tarski-Lob Limits0
DataProphet: Demystifying Supervision Data Generalization in Multimodal LLMs0
A Unified Phase-native Computational Principle Governs Hippocampal Spike Timing and Neural Coding0
Minimax and Adaptive Covariance Matrix Estimation under Differential Privacy0
EvoTaxo: Building and Evolving Taxonomy from Social Media Streams0
Learning from Similarity/Dissimilarity and Pairwise Comparison0
LoopRPT: Reinforcement Pre-Training for Looped Language Models0
Stepwise: Neuro-Symbolic Proof Search for Automated Systems Verification0
FedRG: Unleashing the Representation Geometry for Federated Learning with Noisy Clients0
PerformRecast: Expression and Head Pose Disentanglement for Portrait Video Editing0
PoC: Performance-oriented Context Compression for Large Language Models via Performance Prediction0
A two-step sequential approach for hyperparameter selection in finite context models0
MOSS-TTSD: Text to Spoken Dialogue Generation0
FedPDPO: Federated Personalized Direct Preference Optimization for Large Language Model Alignment0
Show:102550
← PrevPage 2 of 13168Next →