SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 85268550 of 474278 papers

TitleStatusHype
Select-Then-Decompose: From Empirical Analysis to Adaptive Selection Strategy for Task Decomposition in Large Language ModelsCode0
SimBA: Simplifying Benchmark Analysis Using Performance Matrices AloneCode0
Language Confusion Gate: Language-Aware Decoding Through Model Self-DistillationCode0
VisiPruner: Decoding Discontinuous Cross-Modal Dynamics for Efficient Multimodal LLMsCode0
λ-Orthogonality Regularization for Compatible Representation LearningCode0
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal AgentsCode0
FlowDet: Overcoming Perspective and Scale Challenges in Real-Time End-to-End Traffic DetectionCode0
TimeEmb: A Lightweight Static-Dynamic Disentanglement Framework for Time Series ForecastingCode0
Synthetic Series-Symbol Data Generation for Time Series Foundation ModelsCode0
Shape-aware Inertial Poser: Motion Tracking for Humans with Diverse Shapes Using Sparse Inertial SensorsCode0
Benchmarking Out-of-Distribution Detection for Plankton Recognition: A Systematic Evaluation of Advanced Methods in Marine Ecological MonitoringCode0
Rethinking Nighttime Image Deraining via Learnable Color Space TransformationCode0
An Empirical Study of Lagrangian Methods in Safe Reinforcement LearningCode0
CEPerFed: Communication-Efficient Personalized Federated Learning for Multi-Pulse MRI ClassificationCode0
Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and AligningCode0
AcademicEval: Live Long-Context LLM BenchmarkCode0
DETree: DEtecting Human-AI Collaborative Texts via Tree-Structured Hierarchical Representation LearningCode0
Mismatch reconstruction theory for unknown measurement matrix in imaging through multimode fiber bendingCode0
Class-N-Diff: Classification-Induced Diffusion Model Can Make Fair Skin Cancer DiagnosisCode0
RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization0
Q: Provably Optimal Distributional RL for LLM Post-TrainingCode0
Agentic Design of Compositional Machines0
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science0
Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations0
A Controllable Examination for Long-Context Language Models0
Show:102550
← PrevPage 342 of 18972Next →