SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 776800 of 659983 papers

TitleStatusHype
StarCoder: may the source be with you!Code5
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model ParametersCode5
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation ModelsCode5
Jamba-1.5: Hybrid Transformer-Mamba Models at ScaleCode5
XGrammar: Flexible and Efficient Structured Generation Engine for Large Language ModelsCode5
SpinQuant: LLM quantization with learned rotationsCode5
Image Vectorization: a ReviewCode5
Zephyr: Direct Distillation of LM AlignmentCode5
ReLU Strikes Back: Exploiting Activation Sparsity in Large Language ModelsCode5
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven GenerationCode5
ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and EditingCode5
3D Reconstruction with Spatial MemoryCode5
RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query ParallelismCode5
Transformers without NormalizationCode5
VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification BenchmarkCode5
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional TokensCode5
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction FollowingCode5
Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMsCode5
Benchmarking the Myopic Trap: Positional Bias in Information RetrievalCode5
Randomized Autoregressive Visual GenerationCode5
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal DecompositionCode5
FlowTok: Flowing Seamlessly Across Text and Image TokensCode5
Loki: An Open-Source Tool for Fact VerificationCode5
NeuralSVG: An Implicit Representation for Text-to-Vector GenerationCode5
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer UseCode5
Show:102550
← PrevPage 32 of 26400Next →