SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 31763200 of 661570 papers

TitleStatusHype
Qihoo-T2X: An Efficient Proxy-Tokenized Diffusion Transformer for Text-to-Any-TaskCode3
Image Over Text: Transforming Formula Recognition Evaluation with Character Detection MatchingCode3
The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan ArchivesCode3
Attention Heads of Large Language Models: A SurveyCode3
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid ArchitectureCode3
LinFusion: 1 GPU, 1 Minute, 16K ImageCode3
EPRecon: An Efficient Framework for Real-Time Panoptic 3D Reconstruction from Monocular VideoCode3
Affordance-based Robot Manipulation with Flow MatchingCode3
ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI SystemsCode3
ContextCite: Attributing Model Generation to ContextCode3
TinyAgent: Function Calling at the EdgeCode3
CTNet: A Convolutional Transformer Network for EEG-Based Motor Imagery ClassificationCode3
VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series ForecastersCode3
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language ModelCode3
SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable MannersCode3
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge DistillationCode3
LRP4RAG: Detecting Hallucinations in Retrieval-Augmented Generation via Layer-wise Relevance PropagationCode3
InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentationCode3
OctFusion: Octree-based Diffusion Models for 3D Shape GenerationCode3
The Mamba in the Llama: Distilling and Accelerating Hybrid ModelsCode3
A Survey of Camouflaged Object Detection and BeyondCode3
Foundation Models for Music: A SurveyCode3
SWE-bench-java: A GitHub Issue Resolving Benchmark for JavaCode3
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsCode3
Recent Event Camera Innovations: A SurveyCode3
Show:102550
← PrevPage 128 of 26463Next →