SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 62516275 of 474278 papers

TitleStatusHype
On the Feasibility of Using LLMs to Autonomously Execute Multi-host Network AttacksCode2
LUCY: Linguistic Understanding and Control Yielding Early Stage of HerCode2
LLM-powered Multi-agent Framework for Goal-oriented Learning in Intelligent Tutoring SystemCode2
Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware SparsityCode2
Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-ResolutionCode2
TopoNets: High Performing Vision and Language Models with Brain-Like TopographyCode2
MM-Retinal V2: Transfer an Elite Knowledge Spark into Fundus Vision-Language PretrainingCode2
Visual Generation Without GuidanceCode2
Universal Image Restoration Pre-training via Degradation ClassificationCode2
Baichuan-Omni-1.5 Technical ReportCode2
iFormer: Integrating ConvNet and Transformer for Mobile ApplicationCode2
GaussianToken: An Effective Image Tokenizer with 2D Gaussian SplattingCode2
TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video UnderstandingCode2
Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement LearningCode2
Uni-Sign: Toward Unified Sign Language Understanding at ScaleCode2
Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language ModelsCode2
Deeply Optimizing the SAT Solver for the IC3 AlgorithmCode2
STAMP: Scalable Task And Model-agnostic Collaborative PerceptionCode2
Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image UnderstandingCode2
Advancing MRI Reconstruction: A Systematic Review of Deep Learning and Compressed Sensing IntegrationCode2
Bayesian Neural Networks for One-to-Many Mapping in Image EnhancementCode2
VideoShield: Regulating Diffusion-based Video Generation Models via WatermarkingCode2
Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy VideoCode2
Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge GraphCode2
Spurious Forgetting in Continual Learning of Language ModelsCode2
Show:102550
← PrevPage 251 of 18972Next →