SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 81518175 of 474278 papers

TitleStatusHype
DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure GuidanceCode2
DiffusionPDE: Generative PDE-Solving Under Partial ObservationCode2
Dual-Space Knowledge Distillation for Large Language ModelsCode2
Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech DetectionCode2
Efficient, Multimodal, and Derivative-Free Bayesian Inference With Fisher-Rao Gradient FlowsCode2
European Space Agency Benchmark for Anomaly Detection in Satellite TelemetryCode2
LumberChunker: Long-Form Narrative Document SegmentationCode2
Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QACode2
Joint Admission Control and Resource Allocation of Virtual Network Embedding via Hierarchical Deep Reinforcement LearningCode2
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language ModelsCode2
MG-LLaVA: Towards Multi-Granularity Visual Instruction TuningCode2
Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIPCode2
Q-DiT: Accurate Post-Training Quantization for Diffusion TransformersCode2
The Balanced-Pairwise-Affinities Feature TransformCode2
Revitalizing Convolutional Network for Image RestorationCode2
SUM: Saliency Unification through Mamba for Visual Attention ModelingCode2
FedBiOT: LLM Local Fine-tuning in Federated Learning without Full ModelCode2
Disentangled Motion Modeling for Video Frame InterpolationCode2
Alpha^2: Discovering Logical Formulaic Alphas using Deep Reinforcement LearningCode2
Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal ModelsCode2
GC4NC: A Benchmark Framework for Graph Condensation on Node Classification with New InsightsCode2
OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?Code2
FaceScore: Benchmarking and Enhancing Face Quality in Human GenerationCode2
Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character CustomizationCode2
DreamBench++: A Human-Aligned Benchmark for Personalized Image GenerationCode2
Show:102550
← PrevPage 327 of 18972Next →