SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 10511100 of 659983 papers

TitleStatusHype
WebVoyager: Building an End-to-End Web Agent with Large Multimodal ModelsCode5
SpeechGPT-Gen: Scaling Chain-of-Information Speech GenerationCode5
Differentiable Tree Search NetworkCode5
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMsCode5
Large Language Model based Multi-Agents: A Survey of Progress and ChallengesCode5
OMG-Seg: Is One Model Good Enough For All Segmentation?Code5
Scalable Pre-training of Large Autoregressive Image ModelsCode5
SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant TransformersCode5
Real3D-Portrait: One-shot Realistic 3D Talking Portrait SynthesisCode5
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative DecodingCode5
Secrets of RLHF in Large Language Models Part II: Reward ModelingCode5
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language ModelsCode5
Extreme Compression of Large Language Models via Additive QuantizationCode5
Personal LLM Agents: Insights and Survey about the Capability, Efficiency and SecurityCode5
Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and ProspectsCode5
Segment Anything Model for Medical Image Segmentation: Current Applications and Future DirectionsCode5
Latte: Latent Diffusion Transformer for Video GenerationCode5
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes InteractivelyCode5
Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian SplattingCode5
A Comprehensive Study of Knowledge Editing for Large Language ModelsCode5
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language ModelsCode5
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image RecognitionCode5
Astraios: Parameter-Efficient Instruction Tuning Code Large Language ModelsCode5
Point Transformer V3: Simpler Faster StrongerCode5
VGGSfM: Visual Geometry Grounded Deep Structure From MotionCode5
Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar ModelingCode5
GenCast: Diffusion-based ensemble forecasting for medium-range weatherCode5
DUSt3R: Geometric 3D Vision Made EasyCode5
AppAgent: Multimodal Agents as Smartphone UsersCode5
StarVector: Generating Scalable Vector Graphics Code from Images and TextCode5
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPUCode5
MobileSAMv2: Faster Segment Anything to EverythingCode5
CogAgent: A Visual Language Model for GUI AgentsCode5
Weakly Supervised Detection of Hallucinations in LLM ActivationsCode5
TaskWeaver: A Code-First Agent FrameworkCode5
Human Gaussian Splatting: Real-time Rendering of Animatable AvatarsCode5
Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in MedicineCode5
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction FollowingCode5
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGICode5
Structure-Aware Sparse-View X-ray 3D ReconstructionCode5
Instruction-Following Evaluation for Large Language ModelsCode5
LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language ModelsCode5
CogVLM: Visual Expert for Pretrained Language ModelsCode5
VideoCrafter1: Open Diffusion Models for High-Quality Video GenerationCode5
Zephyr: Direct Distillation of LM AlignmentCode5
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft ReasoningCode5
Wonder3D: Single Image to 3D using Cross-Domain DiffusionCode5
NeMo Guardrails: A Toolkit for Controllable and Safe LLM Applications with Programmable RailsCode5
CacheGen: KV Cache Compression and Streaming for Fast Large Language Model ServingCode5
Ferret: Refer and Ground Anything Anywhere at Any GranularityCode5
Show:102550
← PrevPage 22 of 13200Next →