SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 97019725 of 177340 papers

TitleStatusHype
P2P: Automated Paper-to-Poster Generation and Fine-Grained BenchmarkCode2
EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-Degree RepresentationsCode2
SoftGPT: Learn Goal-oriented Soft Object Manipulation Skills by Generative Pre-trained Heterogeneous Graph TransformerCode2
3D Reconstruction of Spherical Images based on Incremental Structure from MotionCode2
RVT: Robotic View Transformer for 3D Object ManipulationCode2
To Spike or Not To Spike: A Digital Hardware Perspective on Deep Learning AccelerationCode2
MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and Guided Intention QueryingCode2
An Open-Source Knowledge Graph Ecosystem for the Life SciencesCode2
Multimodality Helps Few-Shot 3D Point Cloud Semantic SegmentationCode2
A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual LearningCode2
Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid InferenceCode2
PINNsFormer: A Transformer-Based Framework For Physics-Informed Neural NetworksCode2
LP-MusicCaps: LLM-Based Pseudo Music CaptioningCode2
Phoneme Hallucinator: One-shot Voice Conversion via Set ExpansionCode2
CDMamba: Incorporating Local Clues into Mamba for Remote Sensing Image Binary Change DetectionCode2
Topical-Chat: Towards Knowledge-Grounded Open-Domain ConversationsCode2
SONAR: Sentence-Level Multimodal and Language-Agnostic RepresentationsCode2
FreeVA: Offline MLLM as Training-Free Video AssistantCode2
RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-WorldCode2
LVD-2M: A Long-take Video Dataset with Temporally Dense CaptionsCode2
WeatherBench 2: A benchmark for the next generation of data-driven global weather modelsCode2
ConTextTab: A Semantics-Aware Tabular In-Context LearnerCode2
MTVQA: Benchmarking Multilingual Text-Centric Visual Question AnsweringCode2
FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model EvaluationCode2
PromptASR for contextualized ASR with controllable styleCode2
Show:102550
← PrevPage 389 of 7094Next →