SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1535115400 of 474278 papers

TitleStatusHype
Hybrid Meta-learners for Estimating Heterogeneous Treatment EffectsCode0
Automatic Multi-View X-Ray/CT Registration Using Bone Substructure ContoursCode0
Variational Inference with Mixtures of Isotropic GaussiansCode0
Curriculum Learning for Biological Sequence Prediction: The Case of De Novo Peptide SequencingCode1
We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent SystemsCode0
Quantitative Comparison of Fine-Tuning Techniques for Pretrained Latent Diffusion Models in the Generation of Unseen SAR Image Concepts0
OneRec Technical Report0
CALM: Consensus-Aware Localized Merging for Multi-Task LearningCode0
Value-Free Policy Optimization via Reward PartitioningCode0
EUNIS Habitat Maps: Enhancing Thematic and Spatial Resolution for Europe through Machine LearningCode0
Meta-learning how to Share Credit among Macro-ActionsCode0
TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement LearningCode2
Simple is what you need for efficient and accurate medical image segmentationCode0
PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images0
RelTopo: Enhancing Relational Modeling for Driving Scene Topology Reasoning0
Characterizing Linguistic Shifts in Croatian News via Diachronic Word EmbeddingsCode0
COME: Adding Scene-Centric Forecasting Control to Occupancy World ModelCode1
IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized RecommendationCode0
MARCO: Hardware-Aware Neural Architecture Search for Edge Devices with Multi-Agent Reinforcement Learning and Conformal Prediction Filtering0
Flexible-length Text Infilling for Discrete Diffusion Models0
Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry0
UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions0
Audio-Visual Driven Compression for Low-Bitrate Talking Head Videos0
Understanding Learning Invariance in Deep Linear Networks0
Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning0
Machine Learning-Driven Compensation for Non-Ideal Channels in AWG-Based FBG Interrogator0
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects0
Computational lower bounds in latent models: clustering, sparse-clustering, biclustering0
FOAM: A General Frequency-Optimized Anti-Overlapping Framework for Overlapping Object Perception0
Limited-Angle CBCT Reconstruction via Geometry-Integrated Cycle-domain Denoising Diffusion Probabilistic Models0
Instruction Following by Boosting Attention of Large Language Models0
Active Multimodal Distillation for Few-shot Action Recognition0
Intelligent Metasurface-Enabled Integrated Sensing and Communication: Unified Framework and Key Technologies0
ViT-NeBLa: A Hybrid Vision Transformer and Neural Beer-Lambert Framework for Single-View 3D Reconstruction of Oral Anatomy from Panoramic Radiographs0
Micro-macro Gaussian Splatting with Enhanced Scalability for Unconstrained Scene ReconstructionCode0
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning AttentionCode7
Adversarial Disentanglement by Backpropagation with Physics-Informed Variational AutoencoderCode0
The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning0
Honesty in Causal Forests: When It Helps and When It Hurts0
SAGDA: Open-Source Synthetic Agriculture Data for AfricaCode0
Learning to Explore in Diverse Reward Settings via Temporal-Difference-Error MaximizationCode0
Exploiting the Exact Denoising Posterior Score in Training-Free Guidance of Diffusion Models0
Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMsCode0
SA-LUT: Spatial Adaptive 4D Look-Up Table for Photorealistic Style TransferCode1
Multipole Attention for Efficient Long Context ReasoningCode0
Few-Shot Learning for Industrial Time Series: A Comparative Analysis Using the Example of Screw-Fastening Process Monitoring0
Alignment Quality Index (AQI) : Beyond Refusals: AQI as an Intrinsic Alignment Diagnostic via Latent Geometry, Cluster Divergence, and Layer wise Pooled Representations0
Comparison of ConvNeXt and Vision-Language Models for Breast Density Assessment in Screening Mammography0
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token SequencesCode3
Lost in the Mix: Evaluating LLM Understanding of Code-Switched TextCode0
Show:102550
← PrevPage 308 of 9486Next →