SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 26762700 of 661570 papers

TitleStatusHype
Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACsCode3
Learning and discovering multiple solutions using physics-informed neural networks with random initialization and deep ensembleCode3
GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and ImagesCode3
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous DrivingCode3
MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and AudioCode3
Simulating the Real World: A Unified Survey of Multimodal Generative ModelsCode3
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement LearningCode3
SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey WritingCode3
EgoLife: Towards Egocentric Life AssistantCode3
Parallelized Planning-Acting for Efficient LLM-based Multi-Agent SystemsCode3
All-atom Diffusion Transformers: Unified generative modelling of molecules and materialsCode3
Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich ManipulationCode3
A Phylogenetic Approach to Genomic Language ModelingCode3
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for CodingCode3
OmniSQL: Synthesizing High-quality Text-to-SQL Data at ScaleCode3
Exploring Intrinsic Normal Prototypes within a Single Image for Universal Anomaly DetectionCode3
Audio-Reasoner: Improving Reasoning Capability in Large Audio Language ModelsCode3
SCSegamba: Lightweight Structure-Aware Vision Mamba for Crack Segmentation in StructuresCode3
LiteGS: A High-Performance Modular Framework for Gaussian Splatting TrainingCode3
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRsCode3
MUSt3R: Multi-view Network for Stereo 3D ReconstructionCode3
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language InterfaceCode3
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset GenerationCode3
PipeOffload: Improving Scalability of Pipeline Parallelism with Memory OptimizationCode3
Proteina: Scaling Flow-based Protein Structure Generative ModelsCode3
Show:102550
← PrevPage 108 of 26463Next →