SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 36513675 of 661570 papers

TitleStatusHype
UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMsCode3
NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous DrivingCode3
Rho-1: Not All Tokens Are What You NeedCode3
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on GraphsCode3
Addressing the Abstraction and Reasoning Corpus via Procedural Example GenerationCode3
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly DetectionCode3
ZeST: Zero-Shot Material Transfer from a Single ImageCode3
RoadBEV: Road Surface Reconstruction in Bird's Eye ViewCode3
Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in PythonCode3
HPNet: Dynamic Trajectory Forecasting with Historical Prediction AttentionCode3
pfl-research: simulation framework for accelerating research in Private Federated LearningCode3
MoMA: Multimodal LLM Adapter for Fast Personalized Image GenerationCode3
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly DetectionCode3
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video UnderstandingCode3
AI2Apps: A Visual IDE for Building LLM-based AI Agent ApplicationsCode3
Allo: A Programming Model for Composable Accelerator DesignCode3
Automatic Gradient Estimation for Calibrating Crowd Models with Discrete Decision MakingCode3
Lossless and Near-Lossless Compression for Foundation ModelsCode3
Sigma: Siamese Mamba Network for Multi-Modal Semantic SegmentationCode3
3D Facial Expressions through Analysis-by-Neural-SynthesisCode3
Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future DirectionsCode3
LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR SynthesisCode3
RS-Mamba for Large Remote Sensing Image Dense PredictionCode3
BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language ModelsCode3
Faster Diffusion via Temporal Attention DecompositionCode3
Show:102550
← PrevPage 147 of 26463Next →