SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 87018725 of 177340 papers

TitleStatusHype
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making AgentsCode2
With Greater Text Comes Greater Necessity: Inference-Time Training Helps Long Text GenerationCode2
Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic RoomsCode2
SGTR+: End-to-end Scene Graph Generation with TransformerCode2
Neural deformation fields for template-based reconstruction of cortical surfaces from MRICode2
Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop TechnologiesCode2
Tyche: Stochastic In-Context Learning for Medical Image SegmentationCode2
ADMap: Anti-disturbance framework for reconstructing online vectorized HD mapCode2
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other ModalitiesCode2
Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language ModelsCode2
SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion NetworksCode2
GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic TasksCode2
Diffusion Facial Forgery DetectionCode2
FaKnow: A Unified Library for Fake News DetectionCode2
Weak-to-Strong Jailbreaking on Large Language ModelsCode2
On Prompt-Driven Safeguarding for Large Language ModelsCode2
ControlCap: Controllable Region-level CaptioningCode2
On the Challenges of Fuzzing Techniques via Large Language ModelsCode2
InfMAE: A Foundation Model in the Infrared ModalityCode2
A Single Simple Patch is All You Need for AI-generated Image DetectionCode2
Efficient and Effective Time-Series Forecasting with Spiking Neural NetworksCode2
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal InstructionsCode2
Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot LearningCode2
Self-Supervised Contrastive Learning for Long-term ForecastingCode2
4D-Rotor Gaussian Splatting: Towards Efficient Novel View Synthesis for Dynamic ScenesCode2
Show:102550
← PrevPage 349 of 7094Next →