SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 426450 of 659983 papers

TitleStatusHype
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference FeedbackCode7
TextGrad: Automatic "Differentiation" via TextCode7
Mixture-of-Agents Enhances Large Language Model CapabilitiesCode7
M&M VTO: Multi-Garment Virtual Try-On and EditingCode7
The Prompt Report: A Systematic Survey of Prompting TechniquesCode7
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiTCode7
Seed-TTS: A Family of High-Quality Versatile Speech Generation ModelsCode7
Scalable MatMul-free Language ModelingCode7
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text EmbeddingCode7
Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single ImageCode7
TotalSegmentator MRI: Robust Sequence-independent Segmentation of Multiple Anatomic Structures in MRICode7
Adaptive In-conversation Team Building for Language Model AgentsCode7
EasyAnimate: A High-Performance Long Video Generation Method based on Transformer ArchitectureCode7
PromptWizard: Task-Aware Prompt Optimization FrameworkCode7
Vista: A Generalizable Driving World Model with High Fidelity and Versatile ControllabilityCode7
Efficient multi-prompt evaluation of LLMsCode7
The Road Less ScheduledCode7
Learning Multi-dimensional Human Preference for Text-to-Image GenerationCode7
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language ModelsCode7
Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM TrainingCode7
Dynamic data sampler for cross-language transfer learning in large language modelsCode7
Chameleon: Mixed-Modal Early-Fusion Foundation ModelsCode7
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language ModelsCode7
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object DetectionCode7
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese UnderstandingCode7
Show:102550
← PrevPage 18 of 26400Next →