SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 27012750 of 177339 papers

TitleStatusHype
Strassen Multisystolic Array Hardware ArchitecturesCode3
CoverM: Read alignment statistics for metagenomicsCode3
OneForecast: A Universal Framework for Global and Regional Weather ForecastingCode3
Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise RewardCode3
Improved 3D Point-Line Mapping Regression for Camera RelocalizationCode3
3D Gaussian Splatting: Survey, Technologies, Challenges, and OpportunitiesCode3
Delay-penalized transducer for low-latency streaming ASRCode3
Intervention-Aware Forecasting: Breaking Historical Limits from a System PerspectiveCode3
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures SynthesisCode3
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and GenerationCode3
Deep Learning for Protein-Ligand Docking: Are We There Yet?Code3
Autoregressive Image Generation using Residual QuantizationCode3
SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model CompressionCode3
AutoSurvey: Large Language Models Can Automatically Write SurveysCode3
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective FusionCode3
A Survey on Large Language Model Acceleration based on KV Cache ManagementCode3
AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMsCode3
Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image ReconstructionCode3
CryptoMamba: Leveraging State Space Models for Accurate Bitcoin Price PredictionCode3
Alignment of Diffusion Models: Fundamentals, Challenges, and FutureCode3
Learning with 3D rotations, a hitchhiker's guide to SO(3)Code3
SuffixDecoding: Extreme Speculative Decoding for Emerging AI ApplicationsCode3
Tails Tell Tales: Chapter-Wide Manga Transcriptions with Character NamesCode3
4D Panoptic Scene Graph GenerationCode3
Logit Standardization in Knowledge DistillationCode3
AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile MethodologyCode3
Harnessing Temporal Causality for Advanced Temporal Action DetectionCode3
Simple and Effective Relation-based Embedding Propagation for Knowledge Representation LearningCode3
DifFace: Blind Face Restoration with Diffused Error ContractionCode3
Degradation-Guided One-Step Image Super-Resolution with Diffusion PriorsCode3
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge BasesCode3
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem SolvingCode3
Unifying Vision, Text, and Layout for Universal Document ProcessingCode3
LongBench: A Bilingual, Multitask Benchmark for Long Context UnderstandingCode3
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object DetectionCode3
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context MemoryCode3
Scalable Bayesian Learning with posteriorsCode3
PureForest: A Large-Scale Aerial Lidar and Aerial Imagery Dataset for Tree Species Classification in Monospecific ForestsCode3
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement LearningCode3
AutoGluon-Tabular: Robust and Accurate AutoML for Structured DataCode3
Towards Seamless Adaptation of Pre-trained Models for Visual Place RecognitionCode3
A Survey of Resource-efficient LLM and Multimodal Foundation ModelsCode3
TSLANet: Rethinking Transformers for Time Series Representation LearningCode3
Intuitive physics understanding emerges from self-supervised pretraining on natural videosCode3
Video Diffusion Alignment via Reward GradientsCode3
Parallelized Planning-Acting for Efficient LLM-based Multi-Agent SystemsCode3
Don't fear the unlabelled: safe semi-supervised learning via simple debiasingCode3
LRP4RAG: Detecting Hallucinations in Retrieval-Augmented Generation via Layer-wise Relevance PropagationCode3
Evaluating Large Language Models Trained on CodeCode3
Learning Inclusion Matching for Animation Paint Bucket ColorizationCode3
Show:102550
← PrevPage 55 of 3547Next →