SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 70767100 of 474278 papers

TitleStatusHype
LoRA-IR: Taming Low-Rank Experts for Efficient All-in-One Image RestorationCode2
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent EvaluationCode2
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One StepCode2
DM-Codec: Distilling Multimodal Representations for Speech TokenizationCode2
A Multimodal Vision Foundation Model for Clinical DermatologyCode2
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement LearningCode2
Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State FusionCode2
SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuningCode2
REEF: Representation Encoding Fingerprints for Large Language ModelsCode2
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation CapabilitiesCode2
How to Evaluate Reward Models for RLHFCode2
Combining Hough Transform and Deep Learning Approaches to Reconstruct ECG Signals From PrintoutsCode2
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial ScenariosCode2
Comparing Differentiable and Dynamic Ray Tracing: Introducing the Multipath Lifetime MapCode2
A Systematic Study of Cross-Layer KV Sharing for Efficient LLM InferenceCode2
HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image GenerationCode2
Montessori-Instruct: Generate Influential Training Data Tailored for Student LearningCode2
CybORG++: An Enhanced Gym for the Development of Autonomous Cyber AgentsCode2
Dynamic Factor Allocation Leveraging Regime-Switching SignalsCode2
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and PlanningCode2
On the Role of Attention Heads in Large Language Model SafetyCode2
SimLayerKV: A Simple Framework for Layer-Level KV Cache ReductionCode2
PUMA: Empowering Unified MLLM with Multi-granular Visual GenerationCode2
CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language ModelsCode2
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous TokensCode2
Show:102550
← PrevPage 284 of 18972Next →