SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 78017850 of 661570 papers

TitleStatusHype
MVCustom: Multi-View Customized Diffusion via Geometric Latent Rendering and Completion0
Predicting kernel regression learning curves from only raw data statistics0
UltraGen: Efficient Ultra-High-Resolution Image Generation with Hierarchical Local Attention0
CEFR-Annotated WordNet: LLM-Based Proficiency-Guided Semantic Database for Language Learning0
KV Cache Transform Coding for Compact Storage in LLM Inference0
DeepEyesV2: Toward Agentic Multimodal Model5
D-GAP: Improving Out-of-Domain Robustness via Dataset-Agnostic and Gradient-Guided Augmentation in Frequency and Pixel Spaces0
MediRound: Multi-Round Entity-Level Reasoning Segmentation in Medical ImagesCode0
Hierarchical Dual-Strategy Unlearning for Biomedical and Healthcare Intelligence Using Imperfect and Privacy-Sensitive Medical Data0
TEAR: Temporal-aware Automated Red-teaming for Text-to-Video Models0
Partially Equivariant Reinforcement Learning in Symmetry-Breaking Environments0
PvP: Data-Efficient Humanoid Robot Learning with Proprioceptive-Privileged Contrastive Representations0
Pretrained battery transformer (PBT): A foundation model for universal battery life prediction0
Enhancing Tree Species Classification: Insights from YOLOv8 and Explainable AI Applied to TLS Point Cloud Projections0
Data relativistic uncertainty framework for low-illumination anime scenery image enhancement0
The Bayesian Geometry of Transformer Attention0
Cosmos-H-Surgical: Learning Surgical Robot Policies from Videos via World Modeling0
Geometric Scaling of Bayesian Inference in LLMs0
Inferring Clinically Relevant Molecular Subtypes of Pancreatic Cancer from Routine Histopathology Using Deep Learning0
Over-Searching in Search-Augmented Large Language Models0
Burn-After-Use for Preventing Data Leakage through a Secure Multi-Tenant Architecture in Enterprise LLM0
Get away with less: Need of source side data curation to build parallel corpus for low resource Machine Translation0
Beyond Max Tokens: Stealthy Resource Amplification via Tool Calling Chains in LLM Agents0
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning0
PLANING: A Loosely Coupled Triangle-Gaussian Framework for Streaming 3D Reconstruction0
ResearchEnvBench: Benchmarking Agents on Environment Synthesis for Research Code Execution0
Emergence of Distortions in High-Dimensional Guided Diffusion Models0
Position: Beyond Model-Centric Prediction -- Agentic Time Series Forecasting0
Moving On, Even When You're Broken: Fail-Active Trajectory Generation via Diffusion Policies Conditioned on Embodiment and Task0
WebAccessVL: Violation-Aware VLM for Web Accessibility0
KVSmooth: Mitigating Hallucination in Multi-modal Large Language Models through Key-Value Smoothing0
Universality of General Spiked Tensor Models0
BLITZRANK: Principled Zero-shot Ranking Agents with Tournament Graphs0
UniWeTok: An Unified Binary Tokenizer with Codebook Size 2^128 for Unified Multimodal Large Language Model0
TikArt: Stabilizing Aperture-Guided Fine-Grained Visual Reasoning with Reinforcement Learning0
LexiSafe: Offline Safe Reinforcement Learning with Lexicographic Safety-Reward Hierarchy0
Structured Bitmap-to-Mesh Triangulation for Geometry-Aware Discretization of Image-Derived Domains0
PatchDenoiser: Parameter-efficient multi-scale patch learning and fusion denoiser for Low-dose CT imaging0
SPIRAL: A Closed-Loop Framework for Self-Improving Action World Models via Reflective Planning Agents0
Active Value Querying to Minimize Additive Error in Subadditive Set Function Learning0
SAVE: Speech-Aware Video Representation Learning for Video-Text Retrieval0
Defensive Refusal Bias: How Safety Alignment Fails Cyber Defenders0
ToolRLA: Multiplicative Reward Decomposition for Tool-Integrated Agents0
SEED-SET: Scalable Evolving Experimental Design for System-level Ethical Testing0
AdaPonderLM: Gated Pondering Language Models with Token-Wise Adaptive Depth0
Mind the Way You Select Negative Texts: Pursuing the Distance Consistency in OOD Detection with VLMs0
BrandFusion: A Multi-Agent Framework for Seamless Brand Integration in Text-to-Video Generation0
LaTeX Compilation: Challenges in the Era of LLMs0
BD-Merging: Bias-Aware Dynamic Model Merging with Evidence-Guided Contrastive Learning0
RACAS: Controlling Diverse Robots With a Single Agentic System0
Show:102550
← PrevPage 157 of 13232Next →