SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 24262450 of 661570 papers

TitleStatusHype
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language ModelsCode3
Pythia v0.1: the Winning Entry to the VQA Challenge 2018Code3
SQLFlow: A Bridge between SQL and Machine LearningCode3
Mesh R-CNNCode3
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian SplattingCode3
MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and AudioCode3
Efficient and Robust Automated Machine LearningCode3
MM-Agent: LLM as Agents for Real-world Mathematical Modeling ProblemCode3
SynSin: End-to-end View Synthesis from a Single ImageCode3
An Extensible Framework for Open Heterogeneous Collaborative PerceptionCode3
Multi-Head RAG: Solving Multi-Aspect Problems with LLMsCode3
Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligenceCode3
MMLSpark: Unifying Machine Learning Ecosystems at Massive ScalesCode3
Simulating the Real World: A Unified Survey of Multimodal Generative ModelsCode3
AlphaEvolve: A Learning Framework to Discover Novel Alphas in Quantitative InvestmentCode3
VideoRoPE: What Makes for Good Video Rotary Position Embedding?Code3
Green AICode3
Bag of Freebies for Training Object Detection Neural NetworksCode3
Characterizing signal propagation to close the performance gap in unnormalized ResNetsCode3
SnapKV: LLM Knows What You are Looking for Before GenerationCode3
Towards Next-Generation LLM-based Recommender Systems: A Survey and BeyondCode3
Distributional Generalization: A New Kind of GeneralizationCode3
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept SpaceCode3
ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and ReasoningCode3
Bilinear Attention NetworksCode3
Show:102550
← PrevPage 98 of 26463Next →