SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 10511075 of 659983 papers

TitleStatusHype
Off-Policy Primal-Dual Safe Reinforcement LearningCode5
WebVoyager: Building an End-to-End Web Agent with Large Multimodal ModelsCode5
SpeechGPT-Gen: Scaling Chain-of-Information Speech GenerationCode5
Differentiable Tree Search NetworkCode5
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMsCode5
Large Language Model based Multi-Agents: A Survey of Progress and ChallengesCode5
OMG-Seg: Is One Model Good Enough For All Segmentation?Code5
Scalable Pre-training of Large Autoregressive Image ModelsCode5
SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant TransformersCode5
Real3D-Portrait: One-shot Realistic 3D Talking Portrait SynthesisCode5
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative DecodingCode5
Secrets of RLHF in Large Language Models Part II: Reward ModelingCode5
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language ModelsCode5
Extreme Compression of Large Language Models via Additive QuantizationCode5
Personal LLM Agents: Insights and Survey about the Capability, Efficiency and SecurityCode5
Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and ProspectsCode5
Segment Anything Model for Medical Image Segmentation: Current Applications and Future DirectionsCode5
Latte: Latent Diffusion Transformer for Video GenerationCode5
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes InteractivelyCode5
Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian SplattingCode5
A Comprehensive Study of Knowledge Editing for Large Language ModelsCode5
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language ModelsCode5
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image RecognitionCode5
Astraios: Parameter-Efficient Instruction Tuning Code Large Language ModelsCode5
Point Transformer V3: Simpler Faster StrongerCode5
Show:102550
← PrevPage 43 of 26400Next →