SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 89519000 of 661570 papers

TitleStatusHype
Toward Unified Multimodal Representation Learning for Autonomous Driving0
What Do AI Agents Talk About? Emergent Communication Structure in the First AI-Only Social Network0
Local Constrained Bayesian Optimization0
CCR-Bench: A Comprehensive Benchmark for Evaluating LLMs on Complex Constraints, Control Flows, and Real-World Cases0
MINT: Molecularly Informed Training with Spatial Transcriptomics Supervision for Pathology Foundation Models0
SMGI: A Structural Theory of General Artificial Intelligence0
LeJOT-AutoML: LLM-Driven Feature Engineering for Job Execution Time Prediction in Databricks Cost Optimization0
EveryQuery: Zero-Shot Clinical Prediction via Task-Conditioned Pretraining over Electronic Health Records0
Long-Short Term Agents for Pure-Vision Bronchoscopy Robotic Autonomy0
Ares: Adaptive Reasoning Effort Selection for Efficient LLM Agents0
Rel-MOSS: Towards Imbalanced Relational Deep Learning on Relational Databases0
RLPR: Radar-to-LiDAR Place Recognition via Two-Stage Asymmetric Cross-Modal Alignment for Autonomous Driving0
Robust Transfer Learning with Side Information0
SWE-Fuse: Empowering Software Agents via Issue-free Trajectory Learning and Entropy-aware RLVR Training0
Text to Automata Diagrams: Comparing TikZ Code Generation with Direct Image Synthesis0
Advancing Automated Algorithm Design via Evolutionary Stagewise Design with LLMs0
AutoTraces: Autoregressive Trajectory Forecasting via Multimodal Large Language Models0
Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-Agent LLMs with Continual Learning0
VORL-EXPLORE: A Hybrid Learning Planning Approach to Multi-Robot Exploration in Dynamic Environments0
OSExpert: Computer-Use Agents Learning Professional Skills via Exploration0
Emergence is Overrated: AGI as an Archipelago of Experts0
Extend Your Horizon: A Device-Agnostic Surgical Tool Tracking Framework with Multi-View Optimization for Augmented Reality0
On the Feasibility and Opportunity of Autoregressive 3D Object Detection0
TeamHOI: Learning a Unified Policy for Cooperative Human-Object Interactions with Any Team Size1
MJ1: Multimodal Judgment via Grounded Verification0
CMMR-VLN: Vision-and-Language Navigation via Continual Multimodal Memory Retrieval0
Amortizing Maximum Inner Product Search with Learned Support Functions0
It's Time to Get It Right: Improving Analog Clock Reading and Clock-Hand Spatial Reasoning in Vision-Language Models0
PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents0
VSDiffusion: Taming Ill-Posed Shadow Generation via Visibility-Constrained Diffusion0
AffordGrasp: Cross-Modal Diffusion for Affordance-Aware Grasp Synthesis0
Not Like Transformers: Drop the Beat Representation for Dance Generation with Mamba-Based Diffusion Model0
Learning Hierarchical Knowledge in Text-Rich Networks with Taxonomy-Informed Representation Learning0
Controllable Complex Human Motion Video Generation via Text-to-Skeleton Cascades0
QualiTeacher: Quality-Conditioned Pseudo-Labeling for Real-World Image Restoration0
GCGNet: Graph-Consistent Generative Network for Time Series Forecasting with Exogenous Variables0
Solution to the 10th ABAW Expression Recognition Challenge: A Robust Multimodal Framework with Safe Cross-Attention and Modality Dropout0
CDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward Modeling0
S2S-FDD: Bridging Industrial Time Series and Natural Language for Explainable Zero-shot Fault Diagnosis0
Stabilized Fine-Tuning with LoRA in Federated Learning: Mitigating the Side Effect of Client Size and Rank via the Scaling Factor0
ImageEdit-R1: Boosting Multi-Agent Image Editing via Reinforcement Learning0
Adversarial Domain Adaptation Enables Knowledge Transfer Across Heterogeneous RNA-Seq Datasets0
Enhancing Cross-View UAV Geolocalization via LVLM-Driven Relational Modeling0
Synthetic Defect Image Generation for Power Line Insulator Inspection Using Multimodal Large Language Models0
Hybrid Quantum Neural Network for Multivariate Clinical Time Series Forecasting0
Wiener Chaos Expansion based Neural Operator for Singular Stochastic Partial Differential Equations0
Tiny Autoregressive Recursive Models0
From Reactive to Map-Based AI: Tuned Local LLMs for Semantic Zone Inference in Object-Goal Navigation0
EAGLE-Pangu: Accelerator-Safe Tree Speculative Decoding on Ascend NPUs0
DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation0
Show:102550
← PrevPage 180 of 13232Next →