SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 72767300 of 177340 papers

TitleStatusHype
PyTorch FSDP: Experiences on Scaling Fully Sharded Data ParallelCode2
MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language ModelsCode2
Minstrel: Structural Prompt Generation with Multi-Agents Coordination for Non-AI ExpertsCode2
Contrastive Search Is What You Need For Neural Text GenerationCode2
MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object DetectorsCode2
Enhancing Spatiotemporal Disease Progression Models via Latent Diffusion and Prior KnowledgeCode2
Open World Scene Graph Generation using Vision Language ModelsCode2
Exposure Bracketing Is All You Need For A High-Quality ImageCode2
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary SegmentationCode2
TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation DataCode2
MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language BenchmarkCode2
An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language ModelsCode2
zkLLM: Zero Knowledge Proofs for Large Language ModelsCode2
FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing ModelCode2
X^2-VLM: All-In-One Pre-trained Model For Vision-Language TasksCode2
Git-Theta: A Git Extension for Collaborative Development of Machine Learning ModelsCode2
Starting From Non-Parametric Networks for 3D Point Cloud AnalysisCode2
Foundational Large Language Models for Materials ResearchCode2
Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer VisionCode2
AdaParse: An Adaptive Parallel PDF Parsing and Resource Scaling EngineCode2
Re3: Generating Longer Stories With Recursive Reprompting and RevisionCode2
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point CloudsCode2
CNMBERT: A Model for Converting Hanyu Pinyin Abbreviations to Chinese CharactersCode2
Can Graph Learning Improve Planning in LLM-based Agents?Code2
Universal Segmentation at Arbitrary Granularity with Language InstructionCode2
Show:102550
← PrevPage 292 of 7094Next →