SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 76767700 of 474278 papers

TitleStatusHype
deepmriprep: Voxel-based Morphometry (VBM) Preprocessing via Deep Neural NetworksCode2
ConFIG: Towards Conflict-free Training of Physics Informed Neural NetworksCode2
PartGS:Learning Part-aware 3D Representations by Fusing 2D Gaussians and SuperquadricsCode2
FLAME: Learning to Navigate with Multimodal LLM in Urban EnvironmentsCode2
BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language ModelCode2
LegalBench-RAG: A Benchmark for Retrieval-Augmented Generation in the Legal DomainCode2
TraDiffusion: Trajectory-Based Training-Free Image GenerationCode2
C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake DetectionCode2
PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image UnderstandingCode2
SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short DramaCode2
An Open-Source American Sign Language Fingerspell Recognition and Semantic Pose Retrieval InterfaceCode2
Segment Anything with Multiple ModalitiesCode2
Gaussian in the Dark: Real-Time View Synthesis From Inconsistent Dark Images Using Gaussian SplattingCode2
TC-RAG:Turing-Complete RAG's Case study on Medical LLM SystemsCode2
Selective Prompt Anchoring for Code GenerationCode2
EasyRec: Simple yet Effective Language Models for RecommendationCode2
Accelerating Giant Impact Simulations with Machine LearningCode2
MIA-Tuner: Adapting Large Language Models as Pre-training Text DetectorCode2
PCP-MAE: Learning to Predict Centers for Point Masked AutoencodersCode2
OpenCity: Open Spatio-Temporal Foundation Models for Traffic PredictionCode2
MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event DetectionCode2
Efficient Autoregressive Audio Modeling via Next-Scale PredictionCode2
ECG-Chat: A Large ECG-Language Model for Cardiac Disease DiagnosisCode2
A Survey on Benchmarks of Multimodal Large Language ModelsCode2
xGen-MM (BLIP-3): A Family of Open Large Multimodal ModelsCode2
Show:102550
← PrevPage 308 of 18972Next →