SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 27262750 of 177340 papers

TitleStatusHype
4D Panoptic Scene Graph GenerationCode3
Logit Standardization in Knowledge DistillationCode3
AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile MethodologyCode3
Harnessing Temporal Causality for Advanced Temporal Action DetectionCode3
Simple and Effective Relation-based Embedding Propagation for Knowledge Representation LearningCode3
DifFace: Blind Face Restoration with Diffused Error ContractionCode3
Degradation-Guided One-Step Image Super-Resolution with Diffusion PriorsCode3
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge BasesCode3
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem SolvingCode3
Unifying Vision, Text, and Layout for Universal Document ProcessingCode3
LongBench: A Bilingual, Multitask Benchmark for Long Context UnderstandingCode3
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object DetectionCode3
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context MemoryCode3
Scalable Bayesian Learning with posteriorsCode3
PureForest: A Large-Scale Aerial Lidar and Aerial Imagery Dataset for Tree Species Classification in Monospecific ForestsCode3
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement LearningCode3
AutoGluon-Tabular: Robust and Accurate AutoML for Structured DataCode3
Towards Seamless Adaptation of Pre-trained Models for Visual Place RecognitionCode3
A Survey of Resource-efficient LLM and Multimodal Foundation ModelsCode3
TSLANet: Rethinking Transformers for Time Series Representation LearningCode3
Intuitive physics understanding emerges from self-supervised pretraining on natural videosCode3
Video Diffusion Alignment via Reward GradientsCode3
Parallelized Planning-Acting for Efficient LLM-based Multi-Agent SystemsCode3
Don't fear the unlabelled: safe semi-supervised learning via simple debiasingCode3
LRP4RAG: Detecting Hallucinations in Retrieval-Augmented Generation via Layer-wise Relevance PropagationCode3
Show:102550
← PrevPage 110 of 7094Next →