SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 96019625 of 474278 papers

TitleStatusHype
Probabilistic Contrastive Learning for Long-Tailed Visual RecognitionCode2
LISO: Lidar-only Self-Supervised 3D Object DetectionCode2
The pitfalls of next-token predictionCode2
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge EnhancementCode2
MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational PathologyCode2
EarthLoc: Astronaut Photography Localization by Indexing Earth from SpaceCode2
Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language ModelsCode2
CT2Rep: Automated Radiology Report Generation for 3D Medical ImagingCode2
Eliminating Warping Shakes for Unsupervised Online Video StitchingCode2
Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real SystemCode2
Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer ReviewsCode2
RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-FeedbackCode2
Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?Code2
Ant Colony Sampling with GFlowNets for Combinatorial OptimizationCode2
ERA-CoT: Improving Chain-of-Thought through Entity Relationship AnalysisCode2
DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal InconsistencyCode2
VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion ModelsCode2
V_kD: Improving Knowledge Distillation using Orthogonal ProjectionsCode2
RepoHyper: Search-Expand-Refine on Semantic Graphs for Repository-Level Code CompletionCode2
Poly Kernel Inception Network for Remote Sensing DetectionCode2
Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous DrivingCode2
MG-TSD: Multi-Granularity Time Series Diffusion Models with Guided Learning ProcessCode2
SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object DetectionCode2
KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking TechniquesCode2
S^2IP-LLM: Semantic Space Informed Prompt Learning with LLM for Time Series ForecastingCode2
Show:102550
← PrevPage 385 of 18972Next →