SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1410114150 of 474278 papers

TitleStatusHype
MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided ConversationsCode0
High-Resolution Live Fuel Moisture Content (LFMC) Maps for Wildfire Risk from Multimodal Earth Observation DataCode1
MMSearch-R1: Incentivizing LMMs to SearchCode3
Loss-Aware Automatic Selection of Structured Pruning Criteria for Deep Neural Network AccelerationCode1
Disentangled representations of microscopy imagesCode0
AUTOMATIC PRONUNCIATION MISTAKE DETECTOR PROJECT REPORT0
AN INTERNSHIP REPORT ON E-HELPING HOUSING SOCIETY PROJECT REPORT0
Causal-Paced Deep Reinforcement LearningCode0
ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded ScenesCode0
Ark: An Open-source Python-based Framework for Robot Learning0
GBGC: Efficient and Adaptive Graph Coarsening via Granular-ball ComputingCode0
Ancient Script Image Recognition and Processing: A Review0
Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language ModelsCode1
NaviAgent: Bilevel Planning on Tool Dependency Graphs for Function Calling0
Progressive Size-Adaptive Federated Learning: A Comprehensive Framework for Heterogeneous Multi-Modal Data Systems0
HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions0
KunLunBaizeRAG: Reinforcement Learning Driven Inference Performance Leap for Large Language Models0
Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement LearningCode0
MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized CollaborationCode1
Augmenting Multi-Agent Communication with State Delta TrajectoryCode1
Has Machine Translation Evaluation Achieved Human Parity? The Human Reference and the Limits of ProgressCode0
Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference AlignmentCode0
From Memories to Maps: Mechanisms of In-Context Reinforcement Learning in Transformers0
Behavioral Anomaly Detection in Distributed Systems via Federated Contrastive Learning0
Tailored Conversations beyond LLMs: A RL-Based Dialogue Manager0
FAF: A Feature-Adaptive Framework for Few-Shot Time Series Forecasting0
CoCo4D: Comprehensive and Complex 4D Scene Generation0
SAM2-SGP: Enhancing SAM2 for Medical Image Segmentation via Support-Set Guided PromptingCode0
Overtuning in Hyperparameter OptimizationCode0
Scaling Speculative Decoding with Lookahead ReasoningCode0
From Data Acquisition to Lag Modeling: Quantitative Exploration of A-Share Market with Low-Coupling System Design0
Posterior Cramér-Rao Bounds on Localization and Mapping Errors in Distributed MIMO SLAM0
Revisiting R: Statistical Envelope Analysis for Lightweight RF Modulation Classification0
A Wireless Self-Calibrating Ultrasound Microphone Array with Sub-Microsecond Synchronization0
Reconfigurable Intelligent Surfaces for 6G and Beyond: A Comprehensive Survey from Theory to Deployment0
From High-SNR Radar Signal to ECG: A Transfer Learning Model with Cardio-Focusing Algorithm for Scenarios with Limited Data0
A standard transformer and attention with linear biases for molecular conformer generation0
The time course of visuo-semantic representations in the human brain is captured by combining vision and language models0
[Beat-to-beat AV nodal assessment] ECG-based beat-to-beat assessment of AV node conduction properties during AF0
Generate the Forest before the Trees -- A Hierarchical Diffusion model for Climate DownscalingCode0
Exact Matrix Seriation through Mathematical Optimization: Stress and Effectiveness-Based ModelsCode0
When Can We Reuse a Calibration Set for Multiple Conformal Predictions?0
ADDQ: Adaptive Distributional Double Q-LearningCode0
Toward Decision-Oriented Prognostics: An Integrated Estimate-Optimize Framework for Predictive Maintenance0
The Shape of Consumer Behavior: A Symbolic and Topological Analysis of Time Series0
ProCaliper: functional and structural analysis, visualization, and annotation of proteinsCode0
Training Flexible Models of Genetic Variant Effects from Functional Annotations using Accelerated Linear AlgebraCode0
Toward the Explainability of Protein Language Models for Sequence Design0
Neural Collapse based Deep Supervised Federated Learning for Signal Detection in OFDM Systems0
Cross-regularization: Adaptive Model Complexity through Validation Gradients0
Show:102550
← PrevPage 283 of 9486Next →