SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 19261950 of 177339 papers

TitleStatusHype
3D Scene Generation: A SurveyCode4
LEAN-GitHub: Compiling GitHub LEAN repositories for a versatile LEAN proverCode4
Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models' Reasoning PerformanceCode4
AgentBench: Evaluating LLMs as AgentsCode4
Semantic-SAM: Segment and Recognize Anything at Any GranularityCode4
4D Gaussian Splatting for Real-Time Dynamic Scene RenderingCode4
InstanceDiffusion: Instance-level Control for Image GenerationCode4
Depth Any Video with Scalable Synthetic DataCode4
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic DataCode4
Quality-aware Masked Diffusion Transformer for Enhanced Music GenerationCode4
LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D DetectionCode4
Simple and Effective Masked Diffusion Language ModelsCode4
Sample-Efficient Alignment for LLMsCode4
PVUW 2024 Challenge on Complex Video Understanding: Methods and ResultsCode4
SeeSR: Towards Semantics-Aware Real-World Image Super-ResolutionCode4
Sparse Tensor-based Point Cloud Attribute CompressionCode4
WavCraft: Audio Editing and Generation with Large Language ModelsCode4
Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and SoundCode4
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality CollaborationCode4
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat DataCode4
Video Seal: Open and Efficient Video WatermarkingCode4
MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh TokenizationCode4
TimeGPT-1Code4
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language ModelsCode4
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsCode4
Show:102550
← PrevPage 78 of 7094Next →