SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 30513075 of 661570 papers

TitleStatusHype
SoftVQ-VAE: Efficient 1-Dimensional Continuous TokenizerCode3
Automating the Search for Artificial Life with Foundation ModelsCode3
RadGPT: Constructing 3D Image-Text Tumor DatasetsCode3
Hierarchical Lexical Graph for Enhanced Multi-Hop RetrievalCode3
Generalized Trajectory Scoring for End-to-end Multimodal PlanningCode3
General-Reasoner: Advancing LLM Reasoning Across All DomainsCode3
A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation ModelsCode3
SNR-Aware Low-Light Image EnhancementCode3
Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language ModelsCode3
Scaling up Masked Diffusion Models on TextCode3
A Phylogenetic Approach to Genomic Language ModelingCode3
SplatFormer: Point Transformer for Robust 3D Gaussian SplattingCode3
Automated Hypothesis Validation with Agentic Sequential FalsificationsCode3
BUFFER-X: Towards Zero-Shot Point Cloud Registration in Diverse ScenesCode3
The T05 System for The VoiceMOS Challenge 2024: Transfer Learning from Deep Image Classifier to Naturalness MOS Prediction of High-Quality Synthetic SpeechCode3
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision MakingCode3
VideoGPT+: Integrating Image and Video Encoders for Enhanced Video UnderstandingCode3
AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language ModelsCode3
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation BenchmarkCode3
MP-SfM: Monocular Surface Priors for Robust Structure-from-MotionCode3
Recurrent Drafter for Fast Speculative Decoding in Large Language ModelsCode3
MAD-ICP: It Is All About Matching Data -- Robust and Informed LiDAR OdometryCode3
MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare CopilotCode3
AER: Auto-Encoder with Regression for Time Series Anomaly DetectionCode3
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning AttentionCode3
Show:102550
← PrevPage 123 of 26463Next →