SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 926950 of 177339 papers

TitleStatusHype
Low Bitrate High-Quality RVQGAN-based Discrete Speech TokenizerCode5
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-XCode5
Deep Confident Steps to New Pockets: Strategies for Docking GeneralizationCode5
Conditional Generative Models for Contrast-Enhanced Synthesis of T1w and T1 Maps in Brain MRICode5
skfolio: Portfolio Optimization in PythonCode5
Agentic Retrieval-Augmented Generation: A Survey on Agentic RAGCode5
Instruction-Following Evaluation for Large Language ModelsCode5
ShowUI: One Vision-Language-Action Model for GUI Visual AgentCode5
NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and ResultsCode5
SpatialTracker: Tracking Any 2D Pixels in 3D SpaceCode5
Autoformalization in the Era of Large Language Models: A SurveyCode5
BM25S: Orders of magnitude faster lexical search via eager sparse scoringCode5
DEIM: DETR with Improved Matching for Fast ConvergenceCode5
UQLM: A Python Package for Uncertainty Quantification in Large Language ModelsCode5
Chinese CLIP: Contrastive Vision-Language Pretraining in ChineseCode5
ControlNeXt: Powerful and Efficient Control for Image and Video GenerationCode5
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUsCode5
MiniRAG: Towards Extremely Simple Retrieval-Augmented GenerationCode5
SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and MoreCode5
WizardCoder: Empowering Code Large Language Models with Evol-InstructCode5
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue AbilitiesCode5
Long-term Forecasting with TiDE: Time-series Dense EncoderCode5
From System 1 to System 2: A Survey of Reasoning Large Language ModelsCode5
Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked DiffusionsCode5
Wonder3D: Single Image to 3D using Cross-Domain DiffusionCode5
Show:102550
← PrevPage 38 of 7094Next →