SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers247,661 code links4,818 tasks

Papers

Showing 501550 of 177339 papers

TitleStatusHype
Goku: Flow Based Video Generative Foundation ModelsCode7
NVILA: Efficient Frontier Visual Language ModelsCode7
OpenVoice: Versatile Instant Voice CloningCode7
Efficient-vDiT: Efficient Video Diffusion Transformers With Attention TileCode7
Semantic Routing for Enhanced Performance of LLM-Assisted Intent-Based 5G Core Network Management and OrchestrationCode7
Byte Latent Transformer: Patches Scale Better Than TokensCode7
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human AnimationCode7
OmniGen2: Exploration to Advanced Multimodal GenerationCode7
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content CreationCode7
Champ: Controllable and Consistent Human Image Animation with 3D Parametric GuidanceCode7
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement LearningCode7
Gravity-aligned Rotation Averaging with Circular RegressionCode7
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!Code7
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement LearningCode7
HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion TransformerCode7
LLM Post-Training: A Deep Dive into Reasoning Large Language ModelsCode7
Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image AnimationCode7
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation DatasetCode7
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt SynergyCode7
HuixiangDou2: A Robustly Optimized GraphRAG ApproachCode7
MaskSketch: Unpaired Structure-guided Masked Image GenerationCode7
MoE-LLaVA: Mixture of Experts for Large Vision-Language ModelsCode7
Step-Audio: Unified Understanding and Generation in Intelligent Speech InteractionCode7
Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM TrainingCode7
Step1X-Edit: A Practical Framework for General Image EditingCode7
LLaVA-CoT: Let Vision Language Models Reason Step-by-StepCode7
Zero-shot Voice Conversion with Diffusion TransformersCode7
xLSTM: Extended Long Short-Term MemoryCode7
Full Scaling Automation for Sustainable Development of Green Data CentersCode7
LLaMA: Open and Efficient Foundation Language ModelsCode7
Direct Preference Optimization: Your Language Model is Secretly a Reward ModelCode6
Vision Transformers Need RegistersCode6
iTransformer: Inverted Transformers Are Effective for Time Series ForecastingCode6
L-Eval: Instituting Standardized Evaluation for Long Context Language ModelsCode6
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from BackboneCode6
RWKV: Reinventing RNNs for the Transformer EraCode6
A Watermark for Large Language ModelsCode6
Instant Neural Graphics Primitives with a Multiresolution Hash EncodingCode6
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language ModelsCode6
Mistral 7BCode6
Visual Instruction TuningCode6
A decoder-only foundation model for time-series forecastingCode6
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human FeedbackCode6
CVNets: High Performance Library for Computer VisionCode6
Better speech synthesis through scalingCode6
YaRN: Efficient Context Window Extension of Large Language ModelsCode6
H2O Open Ecosystem for State-of-the-art Large Language ModelsCode6
Towards Robust Blind Face Restoration with Codebook Lookup TransformerCode6
Pythia: A Suite for Analyzing Large Language Models Across Training and ScalingCode6
FlashAttention-2: Faster Attention with Better Parallelism and Work PartitioningCode6
Show:102550
← PrevPage 11 of 3547Next →