SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 26512700 of 659983 papers

TitleStatusHype
Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress0
FineViT: Progressively Unlocking Fine-Grained Perception with Dense Recaptions0
A Progressive Visual-Logic-Aligned Framework for Ride-Hailing Adjudication0
Grid Spatial Understanding: A Dataset for Textual Spatial Reasoning over Grids, Embodied Settings, and Coordinate Structures0
Learning Permutation Distributions via Reflected Diffusion on Ranks0
Argument Reconstruction as Supervision for Critical Thinking in LLMs0
A 3D Reconstruction Benchmark for Asset Inspection0
MCoT-MVS: Multi-level Vision Selection by Multi-modal Chain-of-Thought Reasoning for Composed Image Retrieval0
Variational Kernel Design for Internal Noise: Gaussian Chaos Noise, Representation Compatibility, and Reliable Deep Learning0
Material Magic Wand: Material-Aware Grouping of 3D Parts in Untextured Meshes0
Understanding and Defending VLM Jailbreaks via Jailbreak-Related Representation Shift0
SafeTutors: Benchmarking Pedagogical Safety in AI Tutoring Systems0
Shot-Aware Frame Sampling for Video Understanding0
Cohomological Obstructions to Global Counterfactuals: A Sheaf-Theoretic Foundation for Generative Causal Models0
CRE-T1 Preview Technical Report: Beyond Contrastive Learning for Reasoning-Intensive Retrieval0
Toward Phonology-Guided Sign Language Motion Generation: A Diffusion Baseline and Conditioning Analysis0
Harnessing the Power of Foundation Models for Accurate Material Classification0
Rapid Neural Network Prediction of Linear Block Copolymer Free Energies0
Large-Scale 3D Ground-Motion Synthesis with Physics-Inspired Latent Operator Flow Matching0
Structured SIR: Efficient and Expressive Importance-Weighted Inference for High-Dimensional Image Registration0
Joint Degradation-Aware Arbitrary-Scale Super-Resolution for Variable-Rate Extreme Image Compression0
Mutually Causal Semantic Distillation Network for Zero-Shot Learning0
Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare0
From Digital Twins to World Models:Opportunities, Challenges, and Applications for Mobile Edge General Intelligence0
Data-driven model order reduction for structures with piecewise linear nonlinearity using dynamic mode decomposition0
ECHO: Towards Emotionally Appropriate and Contextually Aware Interactive Head Generation0
ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression0
Baguan-TS: A Sequence-Native In-Context Learning Model for Time Series Forecasting with Covariates0
AdaZoom-GUI: Adaptive Zoom-based GUI Grounding with Instruction Refinement0
AR-CoPO: Align Autoregressive Video Generation with Contrastive Policy Optimization0
Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control0
Revisiting Cross-Attention Mechanisms: Leveraging Beneficial Noise for Domain-Adaptive Learning0
Humans and transformer LMs: Abstraction drives language learning0
Auto-Unrolled Proximal Gradient Descent: An AutoML Approach to Interpretable Waveform Optimization0
Learning When to Attend: Conditional Memory Access for Long-Context LLMs0
Inducing Epistemological Humility in Large Language Models: A Targeted SFT Approach to Reducing Hallucination0
Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality0
Detecting the Machine: A Comprehensive Benchmark of AI-Generated Text Detectors Across Architectures, Domains, and Adversarial Conditions0
Translation Invariance of Neural Operators for the FitzHugh-Nagumo Model0
Mirror Descent on Riemannian Manifolds0
MM-OVSeg:Multimodal Optical-SAR Fusion for Open-Vocabulary Segmentation in Remote Sensing0
AdapTS: Lightweight Teacher-Student Approach for Multi-Class and Continual Visual Anomaly Detection0
Rel-Zero: Harnessing Patch-Pair Invariance for Robust Zero-Watermarking Against AI Editing0
Informative Semi-Factuals for XAI: The Elaborated Explanations that People Prefer0
Temporal Gains, Spatial Costs: Revisiting Video Fine-Tuning in Multimodal Large Language Models0
ProGVC: Progressive-based Generative Video Compression via Auto-Regressive Context Modeling0
Face anonymization preserving facial expressions and photometric realism0
Gaussian Process Limit Reveals Structural Benefits of Graph Transformers0
PanoVGGT: Feed-Forward 3D Reconstruction from Panoramic Imagery0
HeiSD: Hybrid Speculative Decoding for Embodied Vision-Language-Action Models with Kinematic Awareness0
Show:102550
← PrevPage 54 of 13200Next →