SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 36763700 of 177340 papers

TitleStatusHype
MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector QuantizationCode3
Language-Codec: Bridging Discrete Codec Representations and Speech Language ModelsCode3
ROLAND: Graph Learning Framework for Dynamic GraphsCode3
DiC: Rethinking Conv3x3 Designs in Diffusion ModelsCode3
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive MemoryCode3
BiLLM: Pushing the Limit of Post-Training Quantization for LLMsCode3
MotionGPT: Human Motion as a Foreign LanguageCode3
HELMET: How to Evaluate Long-Context Language Models Effectively and ThoroughlyCode3
AiOS: All-in-One-Stage Expressive Human Pose and Shape EstimationCode3
Efficient Agent Training for Computer UseCode3
Agent Workflow MemoryCode3
LaViDa: A Large Diffusion Language Model for Multimodal UnderstandingCode3
Aquila2 Technical ReportCode3
The Flan Collection: Designing Data and Methods for Effective Instruction TuningCode3
DUFOMap: Efficient Dynamic Awareness MappingCode3
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-PlayCode3
UnMarker: A Universal Attack on Defensive Image WatermarkingCode3
AlphaEdit: Null-Space Constrained Knowledge Editing for Language ModelsCode3
PaliGemma 2: A Family of Versatile VLMs for TransferCode3
AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding AgentsCode3
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition TasksCode3
Nexus-Gen: A Unified Model for Image Understanding, Generation, and EditingCode3
StableIdentity: Inserting Anybody into Anywhere at First SightCode3
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human PreferencesCode3
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D GenerationCode3
Show:102550
← PrevPage 148 of 7094Next →