SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 60266050 of 474278 papers

TitleStatusHype
ArtGS: Building Interactable Replicas of Complex Articulated Objects via Gaussian SplattingCode2
NeoBERT: A Next-Generation BERTCode2
Medical Hallucinations in Foundation Models and Their Impact on HealthcareCode2
RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-ThoughtsCode2
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human PreferenceCode2
LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented SearchersCode2
WebGames: Challenging General-Purpose Web-Browsing AI AgentsCode2
Rank1: Test-Time Compute for Reranking in Information RetrievalCode2
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision SupportCode2
SPECTRE: An FFT-Based Efficient Drop-In Replacement to Self-Attention for Long ContextsCode2
Benchmarking Retrieval-Augmented Generation in Multi-Modal ContextsCode2
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization AlignmentCode2
The GigaMIDI Dataset with Features for Expressive Music Performance DetectionCode2
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future DirectionsCode2
Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical SystemsCode2
Delta Decompression for MoE-based LLMs CompressionCode2
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language ModelsCode2
Introducing Visual Perception Token into Multimodal Large Language ModelCode2
PointSea: Point Cloud Completion via Self-structure AugmentationCode2
MegaLoc: One Retrieval to Place Them AllCode2
LongSpec: Long-Context Speculative Decoding with Efficient Drafting and VerificationCode2
A Survey on Industrial Anomalies SynthesisCode2
FreeTumor: Large-Scale Generative Tumor Synthesis in Computed Tomography Images for Improving Tumor RecognitionCode2
Audio-FLAN: A Preliminary ReleaseCode2
SalM2: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver AttentionCode2
Show:102550
← PrevPage 242 of 18972Next →