SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 451475 of 659983 papers

TitleStatusHype
MambaOut: Do We Really Need Mamba for Vision?Code7
AIOS Compiler: LLM as Interpreter for Natural Language Programming and Flow Programming of AI AgentsCode7
Mirage: A Multi-Level Superoptimizer for Tensor ProgramsCode7
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion TransformersCode7
xLSTM: Extended Long Short-Term MemoryCode7
Labeling supervised fine-tuning data with the scaling lawCode7
PuLID: Pure and Lightning ID Customization via Contrastive AlignmentCode7
Semantic Routing for Enhanced Performance of LLM-Assisted Intent-Based 5G Core Network Management and OrchestrationCode7
Better Synthetic Data by Retrieving and Transforming Existing DatasetsCode7
CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language ModelsCode7
MiniCheck: Efficient Fact-Checking of LLMs on Grounding DocumentsCode7
Long-form music generation with latent diffusionCode7
Interactive Prompt Debugging with Sequence SalienceCode7
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer EnvironmentsCode7
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction ModelsCode7
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language ModelsCode7
AutoCodeRover: Autonomous Program ImprovementCode7
Streamlining Ocean Dynamics Modeling with Fourier Neural Operators: A Multiobjective Hyperparameter and Architecture Optimization ApproachCode7
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image GenerationCode7
Mini-Gemini: Mining the Potential of Multi-modality Vision Language ModelsCode7
2D Gaussian Splatting for Geometrically Accurate Radiance FieldsCode7
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal EstimationCode7
InternVideo2: Scaling Foundation Models for Multimodal Video UnderstandingCode7
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt SynergyCode7
Champ: Controllable and Consistent Human Image Animation with 3D Parametric GuidanceCode7
Show:102550
← PrevPage 19 of 26400Next →