SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 25012550 of 659983 papers

TitleStatusHype
Optimizer-Induced Low-Dimensional Drift and Transverse Dynamics in Transformer Training0
Developing a Discrete-Event Simulator of School Shooter Behavior from VR Data0
Optimal rates for density and mode estimation with expand-and-sparsify representations0
Equivariant symmetry-aware head pose estimation for fetal MRICode0
Efficient and Scalable Monocular Human-Object Interaction Motion ReconstructionCode0
Multimodal Machine Learning for Soft High-k Elastomers under Data ScarcityCode0
EPOFusion: Exposure aware Progressive Optimization Method for Infrared and Visible Image FusionCode0
SSP-SAM: SAM with Semantic-Spatial Prompt for Referring Expression SegmentationCode0
Sharpness-Aware Minimization in Logit Space Efficiently Enhances Direct Preference OptimizationCode0
Approximate Subgraph Matching with Neural Graph Representations and Reinforcement LearningCode0
ARISE: Agent Reasoning with Intrinsic Skill Evolution in Hierarchical Reinforcement LearningCode0
Don't Pass@k: A Bayesian Framework for Large Language Model EvaluationCode0
Theory of Code Space: Do Code Agents Understand Software Architecture?Code0
GRAFITE: Generative Regression Analysis Framework for Issue Tracking and EvaluationCode0
AgentFactory: A Self-Evolving Framework Through Executable Subagent Accumulation and ReuseCode0
DREAM: A Benchmark Study for Deepfake photoREalism AssessMentCode0
MLLM-based Textual Explanations for Face ComparisonCode0
Training-Only Heterogeneous Image-Patch-Text Graph Supervision for Advancing Few-Shot Learning AdaptersCode0
R2-Dreamer: Redundancy-Reduced World Models without Decoders or AugmentationCode0
Open-o3-Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence2
MOSS-TTS Technical Report4
LoST: Level of Semantics Tokenization for 3D Shapes2
OPUS-VFL: Incentivizing Optimal Privacy-Utility Tradeoffs in Vertical Federated Learning0
Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models0
Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech0
M2P: Improving Visual Foundation Models with Mask-to-Point Weakly-Supervised Learning for Dense Point Tracking0
Anchoring and Rescaling Attention for Semantically Coherent Inbetweening0
CARPE: Context-Aware Image Representation Prioritization via Ensemble for Large Vision-Language Models0
Computing Pure-Strategy Nash Equilibria in a Two-Party Policy Competition: Existence and Algorithmic Approaches0
SHIFT: Motion Alignment in Video Diffusion Models with Adversarial Hybrid Fine-Tuning0
Beyond bouba/kiki: Multidimensional semantic signals are deeply woven into the fabric of natural language0
rSDNet: Unified Robust Neural Learning against Label Noise and Adversarial Attacks0
Gender Disambiguation in Machine Translation: Diagnostic Evaluation in Decoder-Only Architectures0
VirPro: Visual-referred Probabilistic Prompt Learning for Weakly-Supervised Monocular 3D Detection0
DeepCORO-CLIP: A Multi-View Foundation Model for Comprehensive Coronary Angiography Video-Text Analysis and External Validation0
AutoMoT: A Unified Vision-Language-Action Model with Asynchronous Mixture-of-Transformers for End-to-End Autonomous Driving0
Bootstrapping Coding Agents: The Specification Is the Program0
Anisotropic Permeability Tensor Prediction from Porous Media Microstructure via Physics-Informed Progressive Transfer Learning with Hybrid CNN-Transformer0
MATA: Mindful Assessment of the Telugu Abilities of Large Language Models0
Graph-Native Cognitive Memory for AI Agents: Formal Belief Revision Semantics for Versioned Memory Architectures0
3D MRI-Based Alzheimer's Disease Classification Using Multi-Modal 3D CNN with Leakage-Aware Subject-Level Evaluation0
AURORA Model of Formant-to-Tongue Inversion for Didactic and Clinical Applications0
Video Understanding: From Geometry and Semantics to Unified Models0
Bringing Emerging Architectures to Sequence Labeling in NLP0
In Trust We Survive: Emergent Trust Learning0
Fast weight programming and linear transformers: from machine learning to neurobiology0
Robust estimation of heterogeneous treatment effects in randomized trials leveraging external data0
MLlm-DR: Towards Explainable Depression Recognition with MultiModal Large Language Models0
Towards Inclusive Communication: A Unified Framework for Generating Spoken Language from Sign, Lip, and Audio0
AVIATOR: Towards AI-Agentic Vulnerability Injection Workflow for High-Fidelity, Large-Scale Code Security Dataset0
Show:102550
← PrevPage 51 of 13200Next →