SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 29763000 of 177340 papers

TitleStatusHype
An LLM Compiler for Parallel Function CallingCode3
EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAMCode3
General Object Foundation Model for Images and Videos at ScaleCode3
DreamTalk: When Emotional Talking Head Generation Meets Diffusion Probabilistic ModelsCode3
Generative Multimodal Models are In-Context LearnersCode3
Attention is not not ExplanationCode3
Evaluating Language Model Agency through NegotiationsCode3
DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge DetectionCode3
Pheme: Efficient and Conversational Speech GenerationCode3
Remote Sensing ChatGPT: Solving Remote Sensing Tasks with ChatGPT and Visual ModelsCode3
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web TasksCode3
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-DesignCode3
SliceGPT: Compress Large Language Models by Deleting Rows and ColumnsCode3
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text SegmentationCode3
LongAlign: A Recipe for Long Context Alignment of Large Language ModelsCode3
Noise Contrastive Alignment of Language Models with Explicit RewardsCode3
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian SplattingCode3
Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series ForecastingCode3
PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language ModelsCode3
Magic-Me: Identity-Specific Video Customized DiffusionCode3
BitDelta: Your Fine-Tune May Only Be Worth One BitCode3
QuRating: Selecting High-Quality Data for Training Language ModelsCode3
LLMDFA: Analyzing Dataflow in Code with Large Language ModelsCode3
Smaug: Fixing Failure Modes of Preference Optimisation with DPO-PositiveCode3
Codec-SUPERB: An In-Depth Analysis of Sound Codec ModelsCode3
Show:102550
← PrevPage 120 of 7094Next →