SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 86018625 of 474278 papers

TitleStatusHype
Why are Visually-Grounded Language Models Bad at Image Classification?Code2
TransVIP: Speech to Speech Translation System with Voice and Isochrony PreservationCode2
FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and Transferable Topic ModelCode2
FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic PredictionCode2
FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor ScenesCode2
Yuan 2.0-M32: Mixture of Experts with Attention RouterCode2
DiG: Scalable and Efficient Diffusion Models with Gated Linear AttentionCode2
ViG: Linear-complexity Visual Sequence Learning with Gated Linear AttentionCode2
SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound GenerationCode2
DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion ModelsCode2
BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation ExperimentsCode2
Are Self-Attentions Effective for Time Series Forecasting?Code2
Content-Style Decoupling for Unsupervised Makeup Transfer without Generating Pseudo Ground TruthCode2
Multi-Behavior Generative RecommendationCode2
Autoformalizing Euclidean GeometryCode2
LoRA-XS: Low-Rank Adaptation with Extremely Small Number of ParametersCode2
Spectral-Refiner: Accurate Fine-Tuning of Spatiotemporal Fourier Neural Operator for Turbulent FlowsCode2
Memorize What Matters: Emergent Scene Decomposition from MultitraverseCode2
Motion-Agent: A Conversational Framework for Human Motion Generation with LLMsCode2
Saturn: Sample-efficient Generative Molecular Design using Memory ManipulationCode2
MultiOOD: Scaling Out-of-Distribution Detection for Multiple ModalitiesCode2
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement LearningCode2
Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion ModelsCode2
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal ModelsCode2
AutoPSV: Automated Process-Supervised VerifierCode2
Show:102550
← PrevPage 345 of 18972Next →