SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1245112500 of 474278 papers

TitleStatusHype
LiLM-RDB-SFC: Lightweight Language Model with Relational Database-Guided DRL for Optimized SFC Provisioning0
Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation ModelsCode0
A Risk-Aware Adaptive Robust MPC with Learned Uncertainty Quantification0
Mind the Gap: Bridging Occlusion in Gait Recognition via Residual Gap Correction0
A Learning Framework For Cooperative Collision Avoidance of UAV Swarms Leveraging Domain Knowledge0
GeoDistill: Geometry-Guided Self-Distillation for Weakly Supervised Cross-View LocalizationCode0
DCR: Quantifying Data Contamination in LLMs EvaluationCode0
High-Throughput Distributed Reinforcement Learning via Adaptive Policy SynchronizationCode0
LLM-Driven Dual-Level Multi-Interest Modeling for Recommendation0
Multi-Trigger Poisoning Amplifies Backdoor Vulnerabilities in LLMs0
Sparse Regression Codes exploit Multi-User Diversity without CSI0
HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing0
Stochastic Entanglement Configuration for Constructive Entanglement Topologies in Quantum Machine Learning with Application to Cardiac MRI0
Local Pairwise Distance Matching for Backpropagation-Free Reinforcement Learning0
Fairness-Aware Secure Integrated Sensing and Communications with Fractional Programming0
Personalized Exercise Recommendation with Semantically-Grounded Knowledge TracingCode0
Robust-Multi-Task Gradient BoostingCode0
Try Harder: Hard Sample Generation and Learning for Clothes-Changing Person Re-IDCode0
Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation0
Sensing Accuracy Optimization for Multi-UAV SAR Interferometry with Data Offloading0
Recursive Bound-Constrained AdaGrad with Applications to Multilevel and Domain Decomposition Minimization0
KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?0
SpaRTAN: Spatial Reinforcement Token-based Aggregation Network for Visual RecognitionCode0
LRCTI: A Large Language Model-Based Framework for Multi-Step Evidence Retrieval and Reasoning in Cyber Threat Intelligence Credibility Verification0
AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air0
COLIBRI Fuzzy Model: Color Linguistic-Based Representation and Interpretation0
A Parallelizable Approach for Characterizing NE in Zero-Sum Games After a Linear Number of Iterations of Gradient DescentCode0
PGT-I: Scaling Spatiotemporal GNNs with Memory-Efficient Distributed TrainingCode0
The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMsCode2
Seq vs Seq: An Open Suite of Paired Encoders and DecodersCode2
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil EngineeringCode2
Fairness-Aware Grouping for Continuous Sensitive Variables: Application for Debiasing Face Analysis with respect to Skin ToneCode1
CharaConsist: Fine-Grained Consistent Character GenerationCode2
MonoMVSNet: Monocular Priors Guided Multi-View Stereo NetworkCode1
Latent Space Consistency for Sparse-View CT Reconstruction0
Data Augmentation in Time Series Forecasting through Inverted Framework0
Addressing Data Imbalance in Transformer-Based Multi-Label Emotion Detection with Weighted LossCode0
Step-wise Policy for Rare-tool Knowledge (SPaRK): Offline RL that Drives Diverse Tool Use in LLMsCode0
Interpretable Bayesian Tensor Network Kernel Machines with Automatic Rank and Feature SelectionCode0
Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based AdaptationCode1
SystolicAttention: Fusing FlashAttention within a Single Systolic ArrayCode2
A Generalizable Physics-Enhanced State Space Model for Long-Term Dynamics Forecasting in Complex EnvironmentsCode0
RDMA: Cost Effective Agent-Driven Rare Disease Discovery within Electronic Health Record SystemsCode0
Open-Source LLMs Collaboration Beats Closed-Source LLMs: A Scalable Multi-Agent SystemCode0
Democratizing High-Fidelity Co-Speech Gesture Video Generation0
SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation0
Deep Recurrence for Dynamical Segmentation ModelsCode0
DeepResearch^Eco: A Recursive Agentic Workflow for Complex Scientific Question Answering in EcologyCode0
Can Multimodal Foundation Models Understand Schematic Diagrams? An Empirical Study on Information-Seeking QA over Scientific Papers0
WASABI: A Metric for Evaluating Morphometric Plausibility of Synthetic Brain MRIsCode0
Show:102550
← PrevPage 250 of 9486Next →