SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 901950 of 177339 papers

TitleStatusHype
InstantSplat: Sparse-view SfM-free Gaussian Splatting in SecondsCode5
AugLy: Data Augmentations for RobustnessCode5
The Rise and Potential of Large Language Model Based Agents: A SurveyCode5
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMsCode5
Feature Refinement to Improve High Resolution Image InpaintingCode5
Orthogonal Subspace Decomposition for Generalizable AI-Generated Image DetectionCode5
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive DiffusionCode5
Tree of Thoughts: Deliberate Problem Solving with Large Language ModelsCode5
SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape EstimationCode5
Monolith: Real Time Recommendation System With Collisionless Embedding TableCode5
Consistency ModelsCode5
Process Reinforcement through Implicit RewardsCode5
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPUCode5
RealFusion: 360° Reconstruction of Any Object from a Single ImageCode5
YOLOv6 v3.0: A Full-Scale ReloadingCode5
Text-to-Image Rectified Flow as Plug-and-Play PriorsCode5
Agents: An Open-source Framework for Autonomous Language AgentsCode5
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse AttentionCode5
ESC-Eval: Evaluating Emotion Support Conversations in Large Language ModelsCode5
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement LearningCode5
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language ModelsCode5
Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language ModelCode5
Prompting Depth Anything for 4K Resolution Accurate Metric Depth EstimationCode5
OPT: Open Pre-trained Transformer Language ModelsCode5
Low Bitrate High-Quality RVQGAN-based Discrete Speech TokenizerCode5
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-XCode5
Deep Confident Steps to New Pockets: Strategies for Docking GeneralizationCode5
Conditional Generative Models for Contrast-Enhanced Synthesis of T1w and T1 Maps in Brain MRICode5
skfolio: Portfolio Optimization in PythonCode5
Agentic Retrieval-Augmented Generation: A Survey on Agentic RAGCode5
Instruction-Following Evaluation for Large Language ModelsCode5
ShowUI: One Vision-Language-Action Model for GUI Visual AgentCode5
NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and ResultsCode5
SpatialTracker: Tracking Any 2D Pixels in 3D SpaceCode5
Autoformalization in the Era of Large Language Models: A SurveyCode5
BM25S: Orders of magnitude faster lexical search via eager sparse scoringCode5
DEIM: DETR with Improved Matching for Fast ConvergenceCode5
UQLM: A Python Package for Uncertainty Quantification in Large Language ModelsCode5
Chinese CLIP: Contrastive Vision-Language Pretraining in ChineseCode5
ControlNeXt: Powerful and Efficient Control for Image and Video GenerationCode5
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUsCode5
MiniRAG: Towards Extremely Simple Retrieval-Augmented GenerationCode5
SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and MoreCode5
WizardCoder: Empowering Code Large Language Models with Evol-InstructCode5
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue AbilitiesCode5
Long-term Forecasting with TiDE: Time-series Dense EncoderCode5
From System 1 to System 2: A Survey of Reasoning Large Language ModelsCode5
Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked DiffusionsCode5
Wonder3D: Single Image to 3D using Cross-Domain DiffusionCode5
MobileVLM V2: Faster and Stronger Baseline for Vision Language ModelCode5
Show:102550
← PrevPage 19 of 3547Next →