SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 461470 of 659983 papers

TitleStatusHype
MiniCheck: Efficient Fact-Checking of LLMs on Grounding DocumentsCode7
Long-form music generation with latent diffusionCode7
Interactive Prompt Debugging with Sequence SalienceCode7
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer EnvironmentsCode7
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction ModelsCode7
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language ModelsCode7
AutoCodeRover: Autonomous Program ImprovementCode7
Streamlining Ocean Dynamics Modeling with Fourier Neural Operators: A Multiobjective Hyperparameter and Architecture Optimization ApproachCode7
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image GenerationCode7
Mini-Gemini: Mining the Potential of Multi-modality Vision Language ModelsCode7
Show:102550
← PrevPage 47 of 65999Next →