SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 22512275 of 177340 papers

TitleStatusHype
VideoFusion: Decomposed Diffusion Models for High-Quality Video GenerationCode4
LLM Inference Unveiled: Survey and Roofline Model InsightsCode4
Multimodal Whole Slide Foundation Model for PathologyCode4
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorchCode4
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with NothingCode4
MonSter: Marry Monodepth to Stereo Unleashes PowerCode4
Large Models for Time Series and Spatio-Temporal Data: A Survey and OutlookCode4
Cost-Effective Hyperparameter Optimization for Large Language Model Generation InferenceCode4
Efficient Post-training Quantization with FP8 FormatsCode4
Enabling more efficient and cost-effective AI/ML systems with Collective Mind, virtualized MLOps, MLPerf, Collective Knowledge Playground and reproducible optimization tournamentsCode4
Transformers in Time Series: A SurveyCode4
RaTEScore: A Metric for Radiology Report GenerationCode4
ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow MatchingCode4
Atom of Thoughts for Markov LLM Test-Time ScalingCode4
Mixtral of ExpertsCode4
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency PolicyCode3
KwaiAgents: Generalized Information-seeking Agent System with Large Language ModelsCode3
FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented GenerationCode3
How Far Are We From AGI: Are LLMs All We Need?Code3
Make-Your-Anchor: A Diffusion-based 2D Avatar Generation FrameworkCode3
Controllable Text-to-3D Generation via Surface-Aligned Gaussian SplattingCode3
TKAN: Temporal Kolmogorov-Arnold NetworksCode3
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data CompositionCode3
What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?Code3
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
Show:102550
← PrevPage 91 of 7094Next →