SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 93019325 of 474278 papers

TitleStatusHype
Code4MeV2: a Research-oriented Code-completion Platform0
SAR-TEXT: A Large-Scale SAR Image-Text Dataset Built with SAR-Narrator and A Progressive Learning Strategy for Downstream TasksCode0
Self-Correction Bench: Uncovering and Addressing the Self-Correction Blind Spot in Large Language Models0
Rainbow Padding: Mitigating Early Termination in Instruction-Tuned Diffusion LLMsCode0
ReMoMask: Retrieval-Augmented Masked Motion GenerationCode0
ArcMemo: Abstract Reasoning Composition with Lifelong LLM MemoryCode0
Adaptively Sampling-Reusing-Mixing Decomposed Gradients to Speed Up Sharpness Aware MinimizationCode0
LLM-Guided Evolutionary Program Synthesis for Quasi-Monte Carlo DesignCode0
Neural Low-Discrepancy SequencesCode0
Cross-Lingual Multi-Granularity Framework for Interpretable Parkinson's Disease Diagnosis from SpeechCode0
Destination-to-Chutes Task Mapping Optimization for Multi-Robot Coordination in Robotic Sorting SystemsCode0
EGSTalker: Real-Time Audio-Driven Talking Head Generation with Efficient Gaussian Deformation0
Knowledge Graph-Guided Multi-Agent Distillation for Reliable Industrial Question Answering with DatasetsCode0
Towards Size-invariant Salient Object Detection: A Generic Evaluation and Optimization ApproachCode0
Self-Reflective Generation at Test Time0
Dual-Stage Reweighted MoE for Long-Tailed Egocentric Mistake DetectionCode0
ZeroShotOpt: Towards Zero-Shot Pretrained Models for Efficient Black-Box OptimizationCode0
Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models0
LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers0
Hyperparameter Loss Surfaces Are Simple Near their OptimaCode0
Towards Scalable and Consistent 3D Editing0
LHGEL: Large Heterogeneous Graph Ensemble Learning using Batch View AggregationCode0
Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models0
Leave No TRACE: Black-box Detection of Copyrighted Dataset Usage in Large Language Models via WatermarkingCode0
Memory-Efficient Backpropagation for Fine-Tuning LLMs on Resource-Constrained Mobile DevicesCode0
Show:102550
← PrevPage 373 of 18972Next →