SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 29763000 of 661570 papers

TitleStatusHype
FlipSketch: Flipping Static Drawings to Text-Guided Sketch AnimationsCode3
WavChat: A Survey of Spoken Dialogue ModelsCode3
Model Inversion Attacks: A Survey of Approaches and CountermeasuresCode3
Caravan MultiMet: Extending Caravan with Multiple Weather Nowcasts and ForecastsCode3
Jailbreak Attacks and Defenses against Multimodal Generative Models: A SurveyCode3
InterPLM: Discovering Interpretable Features in Protein Language Models via Sparse AutoencodersCode3
CameraHMR: Aligning People with PerspectiveCode3
MureObjectStitch: Multi-reference Image CompositionCode3
The Surprising Effectiveness of Test-Time Training for Few-Shot LearningCode3
General Geospatial Inference with a Population Dynamics Foundation ModelCode3
SplatFormer: Point Transformer for Robust 3D Gaussian SplattingCode3
Game-theoretic LLM: Agent Workflow for Negotiation GamesCode3
Effects of charging and discharging capabilities on trade-offs between model accuracy and computational efficiency in pumped thermal electricity storageCode3
MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse ViewsCode3
ZipNN: Lossless Compression for AI ModelsCode3
DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile ManipulationCode3
SuffixDecoding: Extreme Speculative Decoding for Emerging AI ApplicationsCode3
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning AgentCode3
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG SystemsCode3
Classification Done Right for Vision-Language Pre-TrainingCode3
ADOPT: Modified Adam Can Converge with Any β_2 with the Optimal RateCode3
Drone Data Analytics for Measuring Traffic Metrics at Intersections in High-Density AreasCode3
Addressing Representation Collapse in Vector Quantized Models with One Linear LayerCode3
A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and TrustworthinessCode3
Digitizing Touch with an Artificial Multimodal FingertipCode3
Show:102550
← PrevPage 120 of 26463Next →