SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 94019425 of 177340 papers

TitleStatusHype
EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health RecordsCode2
Multi-Modal Self-Supervised Learning for RecommendationCode2
Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural CalibrationCode2
MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene UnderstandingCode2
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code GenerationCode2
MoA: Mixture of Sparse Attention for Automatic Large Language Model CompressionCode2
iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvementCode2
DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text SpottingCode2
Slice-Consistent 3D Volumetric Brain CT-to-MRI Translation with 2D Brownian Bridge Diffusion ModelCode2
PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest TransformerCode2
Exposing the Deception: Uncovering More Forgery Clues for Deepfake DetectionCode2
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation MasksCode2
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified PerspectiveCode2
LegalBench-RAG: A Benchmark for Retrieval-Augmented Generation in the Legal DomainCode2
ZoomNAS: Searching for Whole-body Human Pose Estimation in the WildCode2
UniFormer: Unifying Convolution and Self-attention for Visual RecognitionCode2
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent EvaluationCode2
Large-scale Multi-Modal Pre-trained Models: A Comprehensive SurveyCode2
Unsupervised Continual Anomaly Detection with Contrastively-learned PromptCode2
ChatIE: Zero-Shot Information Extraction via Chatting with ChatGPTCode2
BiomedCoOp: Learning to Prompt for Biomedical Vision-Language ModelsCode2
Habitat 2.0: Training Home Assistants to Rearrange their HabitatCode2
UnlearnCanvas: Stylized Image Dataset for Enhanced Machine Unlearning Evaluation in Diffusion ModelsCode2
Real-time 3D-aware Portrait Video RelightingCode2
2.5 Years in Class: A Multimodal Textbook for Vision-Language PretrainingCode2
Show:102550
← PrevPage 377 of 7094Next →