SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 36013625 of 661570 papers

TitleStatusHype
Leveraging tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacousticsCode3
COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio RepresentationsCode3
Evolve Cost-aware Acquisition Functions Using Large Language ModelsCode3
SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual ComprehensionCode3
GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian SplattingCode3
Improving Dictionary Learning with Gated Sparse AutoencodersCode3
Retrieval Head Mechanistically Explains Long-Context FactualityCode3
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion ModelsCode3
Taming Diffusion Probabilistic Models for Character ControlCode3
TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian SplattingCode3
SST: Multi-Scale Hybrid Mamba-Transformer Experts for Long-Short Range Time Series ForecastingCode3
FlashSpeech: Efficient Zero-Shot Speech SynthesisCode3
UniMERNet: A Universal Network for Real-World Mathematical Expression RecognitionCode3
ID-Animator: Zero-Shot Identity-Preserving Human Video GenerationCode3
From Matching to Generation: A Survey on Generative Information RetrievalCode3
MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-MakingCode3
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of ExpertsCode3
SnapKV: LLM Knows What You are Looking for Before GenerationCode3
SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core FusionCode3
Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent TransformerCode3
A Survey on the Memory Mechanism of Large Language Model based AgentsCode3
DMesh: A Differentiable Mesh RepresentationCode3
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge BasesCode3
On-Demand Earth System Data CubesCode3
AutoScraper: A Progressive Understanding Web Agent for Web Scraper GenerationCode3
Show:102550
← PrevPage 145 of 26463Next →