SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,216 code links4,818 tasks

Papers

Showing 351400 of 658356 papers

TitleStatusHype
Flow Matching Guide and CodeCode7
NVILA: Efficient Frontier Visual Language ModelsCode7
GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken ChatbotCode7
The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine LearningCode7
Efficient Track AnythingCode7
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model ServingCode7
Scaling Speech-Text Pre-training with Synthetic Interleaved DataCode7
X-MeshGraphNet: Scalable Multi-Scale Graph Neural Networks for Physics SimulationCode7
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?Code7
Tulu 3: Pushing Frontiers in Open Language Model Post-TrainingCode7
RedPajama: an Open Dataset for Training Large Language ModelsCode7
OASIS: Open Agent Social Interaction Simulations with One Million AgentsCode7
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 QuantizationCode7
LLaVA-CoT: Let Vision Language Models Reason Step-by-StepCode7
Zero-shot Voice Conversion with Diffusion TransformersCode7
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human AnimationCode7
MagicQuill: An Intelligent Interactive Image Editing SystemCode7
Measuring short-form factuality in large language modelsCode7
xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive ParallelismCode7
CALE: Continuous Arcade Learning EnvironmentCode7
In-Context LoRA for Diffusion TransformersCode7
AutoRAG: Automated Framework for optimization of Retrieval Augmented Generation PipelineCode7
ThunderKittens: Simple, Fast, and Adorable AI KernelsCode7
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction DataCode7
AutoTrain: No-code training for state-of-the-art modelsCode7
Ichigo: Mixed-Modal Early-Fusion Realtime Voice AssistantCode7
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution RefinementCode7
aiXcoder-7B: A Lightweight and Effective Large Language Model for Code ProcessingCode7
Gravity-aligned Rotation Averaging with Circular RegressionCode7
DocETL: Agentic Query Rewriting and Evaluation for Complex Document ProcessingCode7
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real VideosCode7
AFlow: Automating Agentic Workflow GenerationCode7
Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image AnimationCode7
O1 Replication Journey: A Strategic Progress Report -- Part 1Code7
Pyramidal Flow Matching for Efficient Video Generative ModelingCode7
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?Code7
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference AccelerationCode7
Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion ModelsCode7
PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI SystemCode7
ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AICode7
OmniGen: Unified Image GenerationCode7
LLaMA-Omni: Seamless Speech Interaction with Large Language ModelsCode7
gsplat: An Open-Source Library for Gaussian SplattingCode7
MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge DiscoveryCode7
Mini-Omni: Language Models Can Hear, Talk While Thinking in StreamingCode7
FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual OdometryCode7
Real-Time Video Generation with Pyramid Attention BroadcastCode7
FourierKAN outperforms MLP on Text Classification Head Fine-tuningCode7
VITA: Towards Open-Source Interactive Omni Multimodal LLMCode7
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language ModelsCode7
Show:102550
← PrevPage 8 of 13168Next →