SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 92019225 of 177340 papers

TitleStatusHype
AdapterFusion: Non-Destructive Task Composition for Transfer LearningCode2
Language Models Can Improve Event Prediction by Few-Shot Abductive ReasoningCode2
DeepRobust: A PyTorch Library for Adversarial Attacks and DefensesCode2
Transformers as Policies for Variable Action EnvironmentsCode2
Simulation-Based Inference for Global Health DecisionsCode2
Implementation of UAV Coordination Based on a Hierarchical Multi-UAV Simulation PlatformCode2
Practical Continual Forgetting for Pre-trained Vision ModelsCode2
TabDPT: Scaling Tabular Foundation ModelsCode2
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-DependencyCode2
Learning to Compose Dynamic Tree Structures for Visual ContextsCode2
Generative Sparse Detection Networks for 3D Single-shot Object DetectionCode2
HelloBench: Evaluating Long Text Generation Capabilities of Large Language ModelsCode2
Generative causal testing to bridge data-driven models and scientific theories in language neuroscienceCode2
Few-shot Knowledge Transfer for Fine-grained Cartoon Face GenerationCode2
Simplifying Object Segmentation with PixelLib LibraryCode2
Boundary-Aware Segmentation Network for Mobile and Web ApplicationsCode2
Poisoning Attacks against Recommender Systems: A SurveyCode2
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and ToxicityCode2
Interpreting and Editing Vision-Language Representations to Mitigate HallucinationsCode2
MBRL-Lib: A Modular Library for Model-based Reinforcement LearningCode2
VideoRAG: Retrieval-Augmented Generation over Video CorpusCode2
Fast Transformers with Clustered AttentionCode2
Language Model CascadesCode2
TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement LearningCode2
Non-Metric Space Library ManualCode2
Show:102550
← PrevPage 369 of 7094Next →