SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers247,172 code links4,818 tasks

Papers

Showing 501550 of 658356 papers

TitleStatusHype
Intent-based Prompt Calibration: Enhancing prompt optimization with synthetic boundary casesCode7
Robust Inverse Graphics via Probabilistic InferenceCode7
Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian SplattingCode7
MoE-LLaVA: Mixture of Experts for Large Vision-Language ModelsCode7
EAGLE: Speculative Sampling Requires Rethinking Feature UncertaintyCode7
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion TransformersCode7
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding HeadsCode7
VMamba: Visual State Space ModelCode7
Code Generation with AlphaCodium: From Prompt Engineering to Flow EngineeringCode7
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical AssistanceCode7
Exploring Compressed Image Representation as a Perceptual Proxy: A StudyCode7
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency ModelsCode7
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-InferenceCode7
Bilateral Reference for High-Resolution Dichotomous Image SegmentationCode7
From Audio to Photoreal Embodiment: Synthesizing Humans in ConversationsCode7
OpenVoice: Versatile Instant Voice CloningCode7
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learningCode7
Prometheus: Inducing Fine-grained Evaluation Capability in Language ModelsCode7
DSPy: Compiling Declarative Language Model Calls into Self-Improving PipelinesCode7
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation DatasetCode7
Judging LLM-as-a-Judge with MT-Bench and Chatbot ArenaCode7
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image ManifoldCode7
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language ModelsCode7
Full Scaling Automation for Sustainable Development of Green Data CentersCode7
EasySpider: A No-Code Visual System for Crawling the WebCode7
Measuring Massive Multitask Chinese UnderstandingCode7
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language ModelsCode7
Low-code LLM: Graphical User Interface over Large Language ModelsCode7
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation ModelsCode7
LLaMA: Open and Efficient Foundation Language ModelsCode7
Adding Conditional Control to Text-to-Image Diffusion ModelsCode7
MaskSketch: Unpaired Structure-guided Masked Image GenerationCode7
Colossal-Auto: Unified Automation of Parallelization and Activation Checkpoint for Large-scale ModelsCode7
Domain Expansion of Image GeneratorsCode7
Neural Codec Language Models are Zero-Shot Text to Speech SynthesizersCode7
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLPCode7
Elixir: Train a Large Language Model on a Small GPU ClusterCode7
Easy Begun is Half Done: Spatial-Temporal Graph Modeling with ST-Curriculum DropoutCode7
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained TransformersCode7
Improving Sample Quality of Diffusion Models Using Self-Attention GuidanceCode7
AudioLM: a Language Modeling Approach to Audio GenerationCode7
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectorsCode7
Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale FusionCode7
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive GenerationCode6
Distributed Inference and Fine-tuning of Large Language Models Over The InternetCode6
SGLang: Efficient Execution of Structured Language Model ProgramsCode6
Seamless: Multilingual Expressive and Streaming Speech TranslationCode6
PhotoMaker: Customizing Realistic Human Photos via Stacked ID EmbeddingCode6
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human FeedbackCode6
Show:102550
← PrevPage 11 of 13168Next →