SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 17511775 of 661570 papers

TitleStatusHype
NeMo-Aligner: Scalable Toolkit for Efficient Model AlignmentCode4
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual ReasoningCode4
Self-Play Preference Optimization for Language Model AlignmentCode4
RAPIDFlow: Recurrent Adaptable Pyramids with Iterative Decoding for Efficient Optical Flow EstimationCode4
A Survey on Diffusion Models for Time Series and Spatio-Temporal DataCode4
Visual Mamba: A Survey and New OutlooksCode4
Hallucination of Multimodal Large Language Models: A SurveyCode4
Mamba-FETrack: Frame-Event Tracking via State Space ModelCode4
MovieChat+: Question-aware Sparse Memory for Long Video Question AnsweringCode4
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense CaptioningCode4
Continual Learning of Large Language Models: A Comprehensive SurveyCode4
A Survey on Visual MambaCode4
Autonomous LLM-driven research from data to human-verifiable research papersCode4
FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient DescentCode4
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and GenerationCode4
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language ModelsCode4
StyleBooth: Image Style Editing with Multimodal InstructionCode4
AgentKit: Structured LLM Reasoning with Dynamic GraphsCode4
State Space Model for New-Generation Network Alternative to Transformers: A SurveyCode4
Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language ModelsCode4
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context LengthCode4
JetMoE: Reaching Llama2 Performance with 0.1M DollarsCode4
ControlNet++: Improving Conditional Controls with Efficient Consistency FeedbackCode4
RecurrentGemma: Moving Past Transformers for Efficient Open Language ModelsCode4
A Foundation Model for Zero-shot Logical Query ReasoningCode4
Show:102550
← PrevPage 71 of 26463Next →