SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 19762000 of 661570 papers

TitleStatusHype
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2Code4
Video-LLaVA: Learning United Visual Representation by Alignment Before ProjectionCode4
Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for CodeCode4
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language ModelsCode4
LCM-LoRA: A Universal Stable-Diffusion Acceleration ModuleCode4
Leveraging Speculative Sampling and KV-Cache Optimizations Together for Generative AI using OpenVINOCode4
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality CollaborationCode4
Rephrase and Respond: Let Large Language Models Ask Better Questions for ThemselvesCode4
OtterHD: A High-Resolution Multi-modality ModelCode4
AnyText: Multilingual Visual Text Generation And EditingCode4
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo LabellingCode4
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and EditingCode4
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorchCode4
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion PriorCode4
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking CompetitionCode4
Zero123++: a Single Image to Consistent Multi-view Diffusion Base ModelCode4
Open-Set Image Tagging with Multi-Grained Text SupervisionCode4
Habitat 3.0: A Co-Habitat for Humans, Avatars and RobotsCode4
Eureka: Human-Level Reward Design via Coding Large Language ModelsCode4
DynamiCrafter: Animating Open-domain Images with Video Diffusion PriorsCode4
A General Theoretical Paradigm to Understand Learning from Human PreferencesCode4
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-ReflectionCode4
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4VCode4
OpenAgents: An Open Platform for Language Agents in the WildCode4
A Survey on Video Diffusion ModelsCode4
Show:102550
← PrevPage 80 of 26463Next →