SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,322 code links4,818 tasks

Papers

Showing 29513000 of 177339 papers

TitleStatusHype
FateZero: Fusing Attentions for Zero-shot Text-based Video EditingCode3
A Comprehensive Survey on Test-Time Adaptation under Distribution ShiftsCode3
Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free VideosCode3
Prompting with Pseudo-Code InstructionsCode3
Hierarchical Prompting Assists Large Language Model on Web NavigationCode3
Taming 3DGS: High-Quality Radiance Fields with Limited ResourcesCode3
Improving visual image reconstruction from human brain activity using latent diffusion models via multiple decoded inputsCode3
GlyphNet: Homoglyph domains dataset and detection using attention-based Convolutional Neural NetworksCode3
Segment Anything Meets Point TrackingCode3
EEGPT: Pretrained Transformer for Universal and Reliable Representation of EEG SignalsCode3
WebArena: A Realistic Web Environment for Building Autonomous AgentsCode3
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized StylizationCode3
nanoT5: A PyTorch Framework for Pre-training and Fine-tuning T5-style Models with Limited ResourcesCode3
LSNet: See Large, Focus SmallCode3
Sparse Autoencoders Find Highly Interpretable Features in Language ModelsCode3
FreeU: Free Lunch in Diffusion U-NetCode3
Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker RecognitionCode3
AutoAgents: A Framework for Automatic Agent GenerationCode3
Lag-Llama: Towards Foundation Models for Probabilistic Time Series ForecastingCode3
Putting the Object Back into Video Object SegmentationCode3
Skywork: A More Open Bilingual Foundation ModelCode3
PixelFlow: Pixel-Space Generative Models with FlowCode3
Class Symbolic Regression: Gotta Fit 'Em AllCode3
An LLM Compiler for Parallel Function CallingCode3
EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAMCode3
General Object Foundation Model for Images and Videos at ScaleCode3
DreamTalk: When Emotional Talking Head Generation Meets Diffusion Probabilistic ModelsCode3
Generative Multimodal Models are In-Context LearnersCode3
Attention is not not ExplanationCode3
Evaluating Language Model Agency through NegotiationsCode3
DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge DetectionCode3
Pheme: Efficient and Conversational Speech GenerationCode3
Remote Sensing ChatGPT: Solving Remote Sensing Tasks with ChatGPT and Visual ModelsCode3
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web TasksCode3
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-DesignCode3
SliceGPT: Compress Large Language Models by Deleting Rows and ColumnsCode3
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text SegmentationCode3
LongAlign: A Recipe for Long Context Alignment of Large Language ModelsCode3
Noise Contrastive Alignment of Language Models with Explicit RewardsCode3
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian SplattingCode3
Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series ForecastingCode3
PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language ModelsCode3
Magic-Me: Identity-Specific Video Customized DiffusionCode3
BitDelta: Your Fine-Tune May Only Be Worth One BitCode3
QuRating: Selecting High-Quality Data for Training Language ModelsCode3
LLMDFA: Analyzing Dataflow in Code with Large Language ModelsCode3
Smaug: Fixing Failure Modes of Preference Optimisation with DPO-PositiveCode3
Codec-SUPERB: An In-Depth Analysis of Sound Codec ModelsCode3
Towards Building Multilingual Language Model for MedicineCode3
ChatMusician: Understanding and Generating Music Intrinsically with LLMCode3
Show:102550
← PrevPage 60 of 3547Next →