SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1145111500 of 661570 papers

TitleStatusHype
Fine-grained Image Aesthetic Assessment: Learning Discriminative Scores from Relative Ranks0
GIPO: Gaussian Importance Sampling Policy Optimization0
Selecting Offline Reinforcement Learning Algorithms for Stochastic Network Control0
Inference-Time Toxicity Mitigation in Protein Language Models0
Slice-wise quality assessment of high b-value breast DWI via deep learning-based artifact detection0
Spatial Causal Prediction in Video0
Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models0
RVN-Bench: A Benchmark for Reactive Visual Navigation0
Structural Action Transformer for 3D Dexterous Manipulation0
ProFound: A moderate-sized vision foundation model for multi-task prostate imaging0
TFWaveFormer: Temporal-Frequency Collaborative Multi-level Wavelet Transformer for Dynamic Link Prediction0
Scaling Dense Event-Stream Pretraining from Visual Foundation Models0
Generative AI in Managerial Decision-Making: Redefining Boundaries through Ambiguity Resolution and Sycophancy Analysis0
Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI0
Dual-Solver: A Generalized ODE Solver for Diffusion Models with Dual Prediction0
Phi-4-reasoning-vision-15B Technical Report2
Right in Time: Reactive Reasoning in Regulated Traffic Spaces0
Degradation-based augmented training for robust individual animal re-identification0
When Visual Evidence is Ambiguous: Pareidolia as a Diagnostic Probe for Vision Models0
Specialization of softmax attention heads: insights from the high-dimensional single-location model0
Spectral Surgery: Training-Free Refinement of LoRA via Gradient-Guided Singular Value Reweighting0
Training-Free Rate-Distortion-Perception Traversal With Diffusion0
Fixed-Budget Constrained Best Arm Identification in Grouped Bandits0
Continuous Modal Logical Neural Networks: Modal Reasoning via Stochastic Accessibility0
Volumetric Directional Diffusion: Anchoring Uncertainty Quantification in Anatomical Consensus for Ambiguous Medical Image Segmentation0
Self-adapting Robotic Agents through Online Continual Reinforcement Learning with World Model Feedback0
Multi-Stage Music Source Restoration with BandSplit-RoFormer Separation and HiFi++ GAN0
The Empty Quadrant: AI Teammates for Embodied Field Learning0
DQE-CIR: Distinctive Query Embeddings through Learnable Attribute Weights and Target Relative Negative Sampling in Composed Image Retrieval0
Long-Term Visual Localization in Dynamic Benthic Environments: A Dataset, Footprint-Based Ground Truth, and Visual Place Recognition Benchmark0
Sim2Sea: Sim-to-Real Policy Transfer for Maritime Vessel Navigation in Congested Waters0
Fermi-Dirac thermal measurements: A framework for quantum hypothesis testing and semidefinite optimization0
Bayesian Adversarial Privacy0
FedCova: Robust Federated Covariance Learning Against Noisy Labels0
Tuning Just Enough: Lightweight Backdoor Attacks on Multi-Encoder Diffusion Models0
Monitoring Emergent Reward Hacking During Generation via Internal Activations0
SaFeR: Safety-Critical Scenario Generation for Autonomous Driving Test via Feasibility-Constrained Token Resampling0
Revisiting the Role of Foundation Models in Cell-Level Histopathological Image Analysis under Small-Patch Constraints -- Effects of Training Data Scale and Blur Perturbations on CNNs and Vision Transformers0
Hindsight Quality Prediction Experiments in Multi-Candidate Human-Post-Edited Machine Translation0
End-to-end event reconstruction for precision physics at future colliders0
EgoPoseFormer v2: Accurate Egocentric Human Motion Estimation for AR/VR0
Lyapunov Stability of Stochastic Vector Optimization: Theory and Numerical Implementation0
Understanding Sources of Demographic Predictability in Brain MRI via Disentangling Anatomy and Contrast0
TextBoost: Boosting Scene Text Fidelity in Ultra-low Bitrate Image Compression0
Any2Any: Unified Arbitrary Modality Translation for Remote SensingCode0
FINEST: Improving LLM Responses to Sensitive Topics Through Fine-Grained Evaluation0
BeamPERL: Parameter-Efficient RL with Verifiable Rewards Specializes Compact LLMs for Structured Beam Mechanics Reasoning0
A Baseline Study and Benchmark for Few-Shot Open-Set Action Recognition with Feature Residual Discrimination0
Data-Aware Random Feature Kernel for Transformers0
Mask-Guided Attention Regulation for Anatomically Consistent Counterfactual CXR Synthesis0
Show:102550
← PrevPage 230 of 13232Next →