SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 876900 of 177339 papers

TitleStatusHype
The Role of World Models in Shaping Autonomous Driving: A Comprehensive SurveyCode5
SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoningCode5
Marco-o1: Towards Open Reasoning Models for Open-Ended SolutionsCode5
Fake News Detection: It's All in the Data!Code5
The BrowserGym Ecosystem for Web Agent ResearchCode5
SCBench: A KV Cache-Centric Analysis of Long-Context MethodsCode5
The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video SegmentationCode5
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and GenerationCode5
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and UnderstandingCode5
Can Foundation Models Wrangle Your Data?Code5
Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real TransferCode5
Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image PyramidCode5
Tora: Trajectory-oriented Diffusion Transformer for Video GenerationCode5
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio GenerationCode5
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?Code5
SuperAnimal pretrained pose estimation models for behavioral analysisCode5
Visual Identification of Problematic Bias in Large Label SpacesCode5
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation ModelsCode5
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future FrontiersCode5
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining ResearchCode5
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-TuningCode5
FeatUp: A Model-Agnostic Framework for Features at Any ResolutionCode5
DINO-X: A Unified Vision Model for Open-World Object Detection and UnderstandingCode5
MixTex: Unambiguous Recognition Should Not Rely Solely on Real DataCode5
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init AttentionCode5
Show:102550
← PrevPage 36 of 7094Next →