SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 10511100 of 659983 papers

TitleStatusHype
Uni-Mol Docking V2: Towards Realistic and Accurate Binding Pose PredictionCode5
Showing Many Labels in Multi-label Classification Models: An Empirical Study of Adversarial ExamplesCode5
IMAGDressing-v1: Customizable Virtual DressingCode5
ChatDBG: Augmenting Debugging with Large Language ModelsCode5
Enabling Novel Mission Operations and Interactions with ROSA: The Robot Operating System AgentCode5
RLHF Workflow: From Reward Modeling to Online RLHFCode5
Generating Physically Stable and Buildable LEGO Designs from TextCode5
A Survey on Knowledge Distillation of Large Language ModelsCode5
Reservoir-enhanced Segment Anything Model for Subsurface DiagnosisCode5
Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion HeadCode5
Scalable Pre-training of Large Autoregressive Image ModelsCode5
ReLoRA: High-Rank Training Through Low-Rank UpdatesCode5
A Comprehensive Study of Knowledge Editing for Large Language ModelsCode5
BigDL 2.0: Seamless Scaling of AI Pipelines from Laptops to Distributed ClusterCode5
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal AttentionCode5
TaskWeaver: A Code-First Agent FrameworkCode5
StarVector: Generating Scalable Vector Graphics Code from Images and TextCode5
APISR: Anime Production Inspired Real-World Anime Super-ResolutionCode5
Granite Code Models: A Family of Open Foundation Models for Code IntelligenceCode5
VGGSfM: Visual Geometry Grounded Deep Structure From MotionCode5
Maia-2: A Unified Model for Human-AI Alignment in ChessCode5
Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex CapabilitiesCode5
MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral ReconstructionCode5
OpenMLDB: A Real-Time Relational Data Feature Computation System for Online MLCode5
IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI SystemsCode5
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward PassCode5
LIMO: Less is More for ReasoningCode5
The Role of World Models in Shaping Autonomous Driving: A Comprehensive SurveyCode5
SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoningCode5
Marco-o1: Towards Open Reasoning Models for Open-Ended SolutionsCode5
Fake News Detection: It's All in the Data!Code5
The BrowserGym Ecosystem for Web Agent ResearchCode5
SCBench: A KV Cache-Centric Analysis of Long-Context MethodsCode5
The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video SegmentationCode5
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and GenerationCode5
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and UnderstandingCode5
Can Foundation Models Wrangle Your Data?Code5
Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real TransferCode5
Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image PyramidCode5
Tora: Trajectory-oriented Diffusion Transformer for Video GenerationCode5
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio GenerationCode5
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?Code5
SuperAnimal pretrained pose estimation models for behavioral analysisCode5
Visual Identification of Problematic Bias in Large Label SpacesCode5
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation ModelsCode5
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future FrontiersCode5
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining ResearchCode5
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-TuningCode5
FeatUp: A Model-Agnostic Framework for Features at Any ResolutionCode5
DINO-X: A Unified Vision Model for Open-World Object Detection and UnderstandingCode5
Show:102550
← PrevPage 22 of 13200Next →