SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 201250 of 474278 papers

TitleStatusHype
Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image SynthesisCode9
Agent Laboratory: Using LLM Agents as Research AssistantsCode9
Divide and Conquer: High-Resolution Industrial Anomaly Detection via Memory Efficient Tiled EnsembleCode9
OpenVLA: An Open-Source Vision-Language-Action ModelCode9
Transformer Explainer: Interactive Learning of Text-Generative ModelsCode9
SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compileCode9
Emerging Properties in Unified Multimodal PretrainingCode9
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated ParametersCode9
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion TransformersCode9
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language ModelsCode9
AgentRxiv: Towards Collaborative Autonomous ResearchCode9
Natural language guidance of high-fidelity text-to-speech with synthetic annotationsCode9
Soft Condorcet Optimization for Ranking of General AgentsCode9
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end ModelCode9
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image AnimationCode9
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion TransformersCode9
Infinigen Indoors: Photorealistic Indoor Scenes using Procedural GenerationCode9
PowerInfer-2: Fast Large Language Model Inference on a SmartphoneCode9
PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PCCode9
GPT4All: An Ecosystem of Open Source Compressed Language ModelsCode8
DocLayNet: A Large Human-Annotated Dataset for Document-Layout AnalysisCode8
Llama 2: Open Foundation and Fine-Tuned Chat ModelsCode8
DETRs Beat YOLOs on Real-time Object DetectionCode8
Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech RecognitionCode8
Perception Encoder: The best visual embeddings are not at the output of the networkCode8
Fine-mixing: Mitigating Backdoors in Fine-tuned Language ModelsCode8
Robust Speech Recognition via Large-Scale Weak SupervisionCode8
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning7
dLLM: Simple Diffusion Language Modeling7
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning7
SAM 3D Body: Robust Full-Body Human Mesh Recovery7
Advancing Open-source World Models7
Attention Residuals7
Pretraining Large Language Models with NVFP47
Qwen3-ASR Technical Report7
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem7
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement LearningCode7
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human AnimationCode7
HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion TransformerCode7
LLM Post-Training: A Deep Dive into Reasoning Large Language ModelsCode7
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!Code7
HuixiangDou2: A Robustly Optimized GraphRAG ApproachCode7
Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image AnimationCode7
Champ: Controllable and Consistent Human Image Animation with 3D Parametric GuidanceCode7
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation DatasetCode7
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement LearningCode7
MaskSketch: Unpaired Structure-guided Masked Image GenerationCode7
MoE-LLaVA: Mixture of Experts for Large Vision-Language ModelsCode7
Byte Latent Transformer: Patches Scale Better Than TokensCode7
Gravity-aligned Rotation Averaging with Circular RegressionCode7
Show:102550
← PrevPage 5 of 9486Next →