SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 36513700 of 659983 papers

TitleStatusHype
UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMsCode3
NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous DrivingCode3
Rho-1: Not All Tokens Are What You NeedCode3
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on GraphsCode3
Addressing the Abstraction and Reasoning Corpus via Procedural Example GenerationCode3
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly DetectionCode3
ZeST: Zero-Shot Material Transfer from a Single ImageCode3
RoadBEV: Road Surface Reconstruction in Bird's Eye ViewCode3
Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in PythonCode3
HPNet: Dynamic Trajectory Forecasting with Historical Prediction AttentionCode3
pfl-research: simulation framework for accelerating research in Private Federated LearningCode3
MoMA: Multimodal LLM Adapter for Fast Personalized Image GenerationCode3
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly DetectionCode3
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video UnderstandingCode3
AI2Apps: A Visual IDE for Building LLM-based AI Agent ApplicationsCode3
Allo: A Programming Model for Composable Accelerator DesignCode3
Automatic Gradient Estimation for Calibrating Crowd Models with Discrete Decision MakingCode3
Lossless and Near-Lossless Compression for Foundation ModelsCode3
Sigma: Siamese Mamba Network for Multi-Modal Semantic SegmentationCode3
3D Facial Expressions through Analysis-by-Neural-SynthesisCode3
Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future DirectionsCode3
LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR SynthesisCode3
RS-Mamba for Large Remote Sensing Image Dense PredictionCode3
BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language ModelsCode3
Faster Diffusion via Temporal Attention DecompositionCode3
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language ModelsCode3
Bidirectional Multi-Scale Implicit Neural Representations for Image DerainingCode3
Tensorized NeuroEvolution of Augmenting Topologies for GPU AccelerationCode3
Advancing LLM Reasoning Generalists with Preference TreesCode3
SPMamba: State-space model is all you need in speech separationCode3
GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo ViewsCode3
ViTamin: Designing Scalable Vision Models in the Vision-Language EraCode3
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive AttacksCode3
Evalverse: Unified and Accessible Library for Large Language Model EvaluationCode3
GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEACode3
HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based ApproachCode3
An RML-FNML module for Python user-defined functions in Morph-KGCCode3
Evaluating Text-to-Visual Generation with Image-to-Text GenerationCode3
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language ModelsCode3
Towards Realistic Scene Generation with LiDAR Diffusion ModelsCode3
DRCT: Saving Image Super-resolution away from Information BottleneckCode3
94% on CIFAR-10 in 3.29 Seconds on a Single GPUCode3
Rewrite the StarsCode3
UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion SegmentationCode3
Are We on the Right Way for Evaluating Large Vision-Language Models?Code3
TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage ScenariosCode3
RSMamba: Remote Sensing Image Classification with State Space ModelCode3
Navigating Eukaryotic Genome Annotation Pipelines: A Route Map to BRAKER, Galba, and TSEBRACode3
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language ModelsCode3
MagicLens: Self-Supervised Image Retrieval with Open-Ended InstructionsCode3
Show:102550
← PrevPage 74 of 13200Next →