SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1070110750 of 661570 papers

TitleStatusHype
AdaIAT: Adaptively Increasing Attention to Generated Text to Alleviate Hallucinations in LVLM0
Beyond the Patch: Exploring Vulnerabilities of Visuomotor Policies via Viewpoint-Consistent 3D Adversarial Object0
Semantic Communication-Enhanced Split Federated Learning for Vehicular Networks: Architecture, Challenges, and Case Study0
Federated Heterogeneous Language Model Optimization for Hybrid Automatic Speech Recognition0
LocalSUG: Geography-Aware LLM for Query Suggestion in Local-Life Services0
Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis0
Adaptive Prototype-based Interpretable Grading of Prostate Cancer0
Location-Aware Pretraining for Medical Difference Visual Question Answering0
Retrieval-Augmented Generation with Covariate Time Series0
Mixture of Universal Experts: Scaling Virtual Width via Depth-Width Transformation0
Functionality-Oriented LLM Merging on the Fisher--Rao Manifold0
VRM: Teaching Reward Models to Understand Authentic Human Preferences0
3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding0
Rethinking Representativeness and Diversity in Dynamic Data Selection0
Debiasing Sequential Recommendation with Time-aware Inverse Propensity Scoring0
MultiGO++: Monocular 3D Clothed Human Reconstruction via Geometry-Texture Collaboration0
HiFlow: Hierarchical Feedback-Driven Optimization for Constrained Long-Form Text Generation0
Lightweight and Scalable Transfer Learning Framework for Load Disaggregation0
Physics-consistent deep learning for blind aberration recovery in mobile optics0
Competitive Multi-Operator Reinforcement Learning for Joint Pricing and Fleet Rebalancing in AMoD Systems0
Non-Euclidean Gradient Descent Operates at the Edge of Stability0
Poisoning the Inner Prediction Logic of Graph Neural Networks for Clean-Label Backdoor Attacks0
How far have we gone in Generative Image Restoration? A study on its capability, limitations and evaluation practices0
BioLLMAgent: A Hybrid Framework with Enhanced Structural Interpretability for Simulating Human Decision-Making in Computational Psychiatry0
AegisUI: Behavioral Anomaly Detection for Structured User Interface Protocols in AI Agent Systems0
The Trilingual Triad Framework: Integrating Design, AI, and Domain Knowledge in No-code AI Smart City Course0
Generalizable Multiscale Segmentation of Heterogeneous Map Collections0
Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination0
Exploiting Intermediate Reconstructions in Optical Coherence Tomography for Test-Time Adaption of Medical Image Segmentation0
CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection0
WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents0
NeuronMoE: Neuron-Guided Mixture-of-Experts for Efficient Multilingual LLM Extension0
MCEL: Margin-Based Cross-Entropy Loss for Error-Tolerant Quantized Neural Networks0
CLIP-driven Zero-shot Learning with Ambiguous Labels0
MUTEX: Leveraging Multilingual Transformers and Conditional Random Fields for Enhanced Urdu Toxic Span Detection0
A 360-degree Multi-camera System for Blue Emergency Light Detection Using Color Attention RT-DETR and the ABLDataset0
Recurrent Graph Neural Networks and Arithmetic Circuits0
UniM: A Unified Any-to-Any Interleaved Multimodal Benchmark0
GEM-TFL: Bridging Weak and Full Supervision for Forgery Localization through EM-Guided Decomposition and Temporal Refinement0
ARC-TGI: Human-Validated Task Generators with Reasoning Chain Templates for ARC-AGI0
Axiomatic On-Manifold Shapley via Optimal Generative FlowsCode0
SPIRIT: Perceptive Shared Autonomy for Robust Robotic Manipulation under Deep Learning Uncertainty0
Decoupling Task and Behavior: A Two-Stage Reward Curriculum in Reinforcement Learning for Robotics0
InfoFlow KV: Information-Flow-Aware KV Recomputation for Long Context0
Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning0
Measuring the Redundancy of Decoder Layers in SpeechLLMs0
MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus0
LBM: Hierarchical Large Auto-Bidding Model via Reasoning and Acting0
SRasP: Self-Reorientation Adversarial Style Perturbation for Cross-Domain Few-Shot Learning0
Representation Fidelity:Auditing Algorithmic Decisions About Humans Using Self-Descriptions0
Show:102550
← PrevPage 215 of 13232Next →