SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 61766200 of 474278 papers

TitleStatusHype
VABench: A Comprehensive Benchmark for Audio-Video Generation0
H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos0
Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs0
Closing the Train-Test Gap in World Models for Gradient-Based Planning0
AGORA: Adversarial Generation Of Real-time Animatable 3D Gaussian Head Avatars0
Interpretable Embeddings with Sparse Autoencoders: A Data Analysis Toolkit0
Push Smarter, Not Harder: Hierarchical RL-Diffusion Policy for Efficient Nonprehensile ManipulationCode0
ARE: Scaling Up Agent Environments and Evaluations0
TeleEgo: Benchmarking Egocentric AI Assistants in the Wild0
Attention Sinks in Diffusion Language Models0
VFM-ISRefiner: Towards Better Adapting Vision Foundation Models for Interactive Segmentation of Remote Sensing ImagesCode0
Deep Edge Filter: Return of the Human-Crafted Layer in Deep LearningCode0
Adaptive Gradient Calibration for Single-Positive Multi-Label Learning in Remote Sensing Image Scene ClassificationCode0
Dual Refinement Cycle Learning: Unsupervised Text Classification of Mamba and Community Detection on Text Attributed GraphCode0
GLACIA: Instance-Aware Positional Reasoning for Glacial Lake Segmentation via Multimodal Large Language ModelCode0
Contrastive Learning for Semi-Supervised Deep Regression with Generalized Ordinal Rankings from Spectral SeriationCode0
MelanomaNet: Explainable Deep Learning for Skin Lesion ClassificationCode0
Label-free Motion-Conditioned Diffusion Model for Cardiac Ultrasound SynthesisCode0
NeuroSketch: An Effective Framework for Neural Decoding via Systematic Architectural OptimizationCode0
Local LLM Ensembles for Zero-shot Portuguese Named Entity RecognitionCode0
LxCIM: a new rank-based binary classifier performance metric invariant to local exchange of classesCode0
Visual Heading Prediction for Autonomous Aerial VehiclesCode0
Rethinking Chain-of-Thought Reasoning for VideosCode0
Bring Your Dreams to Life: Continual Text-to-Video CustomizationCode0
Decoupling Template Bias in CLIP: Harnessing Empty Prompts for Enhanced Few-Shot LearningCode0
Show:102550
← PrevPage 248 of 18972Next →