SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1000110050 of 661570 papers

TitleStatusHype
Localizing and Correcting Errors for LLM-based Planners0
Uncertainty-Aware Subset Selection for Robust Visual Explainability under Distribution Shifts0
Photo3D: Advancing Photorealistic 3D Generation through Structure-Aligned Detail Enhancement0
LLMTM: Benchmarking and Optimizing LLMs for Temporal Motif Analysis in Dynamic Graphs0
Spatial4D-Bench: A Versatile 4D Spatial Intelligence Benchmark0
Creating a Hybrid Rule and Neural Network Based Semantic Tagger using Silver Standard Data: the PyMUSAS framework for Multilingual Semantic Annotation0
Beyond Mapping : Domain-Invariant Representations via Spectral Embedding of Optimal Transport Plans0
SRA 2: Variational Autoencoder Self-Representation Alignment for Efficient Diffusion Training0
Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional ReasoningCode0
Stochastic Parroting in Temporal Attention -- Regulating the Diagonal Sink0
EDIS: Diagnosing LLM Reasoning via Entropy Dynamics0
FARTrack: Fast Autoregressive Visual Tracking with High Performance0
SWE-MiniSandbox: Container-Free Reinforcement Learning for Building Software Engineering Agents0
An Adaptive Model Selection Framework for Demand Forecasting under Horizon-Induced Degradation to Support Business Strategy and Operations0
GaiaFlow: Semantic-Guided Diffusion Tuning for Carbon-Frugal Search0
IntelliAsk: Learning to Ask High-Quality Research Questions via RLVR0
Robust Self-Supervised Cross-Modal Super-Resolution against Real-World Misaligned Observations0
StoryTailor:A Zero-Shot Pipeline for Action-Rich Multi-Subject Visual Narratives0
UniVBench: Towards Unified Evaluation for Video Foundation Models1
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization0
Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos0
How Well Does Agent Development Reflect Real-World Work?0
CoME: Empowering Channel-of-Mobile-Experts with Informative Hybrid-Capabilities Reasoning0
Weight Updates as Activation Shifts: A Principled Framework for Steering0
Adaptive Dynamic Dehazing via Instruction-Driven and Task-Feedback Closed-Loop Optimization for Diverse Downstream Task Adaptation0
Multimodal Mixture-of-Experts with Retrieval Augmentation for Protein Active Site Identification0
"When to Hand Off, When to Work Together": Expanding Human-Agent Co-Creative Collaboration through Concurrent Interaction0
Rigidity-Aware Geometric Pretraining for Protein Design and Conformational Ensembles0
VSearcher: Long-Horizon Multimodal Search Agent via Reinforcement Learning0
Good-Enough LLM Obfuscation (GELO)0
A Persistent-State Dataflow Accelerator for Memory-Bound Linear Attention Decode on FPGA0
MOSIV: Multi-Object System Identification from Videos0
Implicit Style Conditioning: A Structured Style-Rewrite Framework for Low-Resource Character Modeling0
XAI for Coding Agent Failures: Transforming Raw Execution Traces into Actionable Insights0
Unify the Views: View-Consistent Prototype Learning for Few-Shot SegmentationCode0
Who We Are, Where We Are: Mental Health at the Intersection of Person, Situation, and Large Language Models0
Domain-Adaptive Model Merging across Disconnected Modes0
An Interactive Multi-Agent System for Evaluation of New Product Concepts0
Skeleton-to-Image Encoding: Enabling Skeleton Representation Learning via Vision-Pretrained Models0
PROBE: Probabilistic Occupancy BEV Encoding with Analytical Translation Robustness for 3D Place Recognition0
Agnostic learning in (almost) optimal time via Gaussian surface area0
Breaking Smooth-Motion Assumptions: A UAV Benchmark for Multi-Object Tracking in Complex and Adverse Conditions0
Technical Report: Automated Optical Inspection of Surgical Instruments0
Diffusion Language Models Are Natively Length-Aware0
Stem: Rethinking Causal Information Flow in Sparse Attention0
MM-ISTS: Cooperating Irregularly Sampled Time Series Forecasting with Multimodal Vision-Text LLMs0
Sensitivity-Aware Retrieval-Augmented Intent Clarification0
RePer-360: Releasing Perspective Priors for 360^ Depth Estimation via Self-Modulation0
Restoring Linguistic Grounding in VLA Models via Train-Free Attention Recalibration0
Demystifying KAN for Vision Tasks: The RepKAN Approach0
Show:102550
← PrevPage 201 of 13232Next →