SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 30763100 of 177340 papers

TitleStatusHype
Scaling Diffusion Language Models via Adaptation from Autoregressive ModelsCode3
ZipNN: Lossless Compression for AI ModelsCode3
TEXGen: a Generative Diffusion Model for Mesh TexturesCode3
BIP3D: Bridging 2D Images and 3D Perception for Embodied IntelligenceCode3
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language ModelsCode3
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and GenerationCode3
TryOffAnyone: Tiled Cloth Generation from a Dressed PersonCode3
InterPLM: Discovering Interpretable Features in Protein Language Models via Sparse AutoencodersCode3
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive SurveyCode3
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLMCode3
Lifelong Learning of Large Language Model based Agents: A RoadmapCode3
Learning Getting-Up Policies for Real-World Humanoid RobotsCode3
TokenSkip: Controllable Chain-of-Thought Compression in LLMsCode3
Attention Distillation: A Unified Approach to Visual Characteristics TransferCode3
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset GenerationCode3
SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey WritingCode3
Director3D: Real-world Camera Trajectory and 3D Scene Generation from TextCode3
Unleashing Vecset Diffusion Model for Fast Shape GenerationCode3
HyperGraphRAG: Retrieval-Augmented Generation with Hypergraph-Structured Knowledge RepresentationCode3
End-to-End Driving with Online Trajectory Evaluation via BEV World ModelCode3
Motion Representations for Articulated AnimationCode3
Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language ModelsCode3
Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future DirectionsCode3
Parallel Scaling Law for Language ModelsCode3
Visual Planning: Let's Think Only with ImagesCode3
Show:102550
← PrevPage 124 of 7094Next →