SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 81268150 of 177340 papers

TitleStatusHype
VRL3: A Data-Driven Framework for Visual Deep Reinforcement LearningCode2
A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and BenchmarkCode2
Neural interval-censored survival regression with feature selectionCode2
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented GenerationCode2
DiffusionBERT: Improving Generative Masked Language Models with Diffusion ModelsCode2
Executing your Commands via Motion Diffusion in Latent SpaceCode2
NMS Strikes BackCode2
DiffFace: Diffusion-based Face Swapping with Facial GuidanceCode2
Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model CapabilityCode2
Efficient Speech Enhancement via Embeddings from Pre-trained Generative AudioencodersCode2
Watermarking Autoregressive Image GenerationCode2
Investigating Affective Use and Emotional Well-being on ChatGPTCode2
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model ParallelismCode2
Autonomous Improvement of Instruction Following Skills via Foundation ModelsCode2
MemoryBank: Enhancing Large Language Models with Long-Term MemoryCode2
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language ModelsCode2
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image EditingCode2
GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D ReconstructionCode2
Unified Continuous Generative ModelsCode2
Text-based Animatable 3D Avatars with Morphable Model AlignmentCode2
Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data RestorationCode2
SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language ModelsCode2
Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model CapabilitiesCode2
Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly DetectionCode2
A Tutorial on Structural Identifiability of Epidemic Models Using StructuralIdentifiability.jlCode2
Show:102550
← PrevPage 326 of 7094Next →