SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 88018850 of 661570 papers

TitleStatusHype
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language ModelsCode2
Medical Diffusion: Denoising Diffusion Probabilistic Models for 3D Medical Image GenerationCode2
Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion ModelsCode2
LibAUC: A Deep Learning Library for X-Risk OptimizationCode2
STAR Loss: Reducing Semantic Ambiguity in Facial Landmark DetectionCode2
Estimating heterogeneous treatment effects with right-censored data via causal survival forestsCode2
FasterViT: Fast Vision Transformers with Hierarchical AttentionCode2
Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language ModelsCode2
P2P: Automated Paper-to-Poster Generation and Fine-Grained BenchmarkCode2
EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-Degree RepresentationsCode2
SoftGPT: Learn Goal-oriented Soft Object Manipulation Skills by Generative Pre-trained Heterogeneous Graph TransformerCode2
3D Reconstruction of Spherical Images based on Incremental Structure from MotionCode2
RVT: Robotic View Transformer for 3D Object ManipulationCode2
To Spike or Not To Spike: A Digital Hardware Perspective on Deep Learning AccelerationCode2
MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and Guided Intention QueryingCode2
An Open-Source Knowledge Graph Ecosystem for the Life SciencesCode2
Multimodality Helps Few-Shot 3D Point Cloud Semantic SegmentationCode2
A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual LearningCode2
Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid InferenceCode2
PINNsFormer: A Transformer-Based Framework For Physics-Informed Neural NetworksCode2
LP-MusicCaps: LLM-Based Pseudo Music CaptioningCode2
Phoneme Hallucinator: One-shot Voice Conversion via Set ExpansionCode2
CDMamba: Incorporating Local Clues into Mamba for Remote Sensing Image Binary Change DetectionCode2
Topical-Chat: Towards Knowledge-Grounded Open-Domain ConversationsCode2
SONAR: Sentence-Level Multimodal and Language-Agnostic RepresentationsCode2
FreeVA: Offline MLLM as Training-Free Video AssistantCode2
RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-WorldCode2
LVD-2M: A Long-take Video Dataset with Temporally Dense CaptionsCode2
WeatherBench 2: A benchmark for the next generation of data-driven global weather modelsCode2
ConTextTab: A Semantics-Aware Tabular In-Context LearnerCode2
MTVQA: Benchmarking Multilingual Text-Centric Visual Question AnsweringCode2
FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model EvaluationCode2
PromptASR for contextualized ASR with controllable styleCode2
PLVS: A SLAM System with Points, Lines, Volumetric Mapping, and 3D Incremental SegmentationCode2
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone ControlCode2
RLLTE: Long-Term Evolution Project of Reinforcement LearningCode2
Smoothing Methods for Automatic Differentiation Across Conditional BranchesCode2
Generative Judge for Evaluating AlignmentCode2
Under pressure: learning-based analog gauge reading in the wildCode2
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech RecognitionCode2
Large Language Models as Zero-shot Dialogue State Tracker through Function CallingCode2
DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion modelCode2
UniPAD: A Universal Pre-training Paradigm for Autonomous DrivingCode2
A Setwise Approach for Effective and Highly Efficient Zero-shot Ranking with Large Language ModelsCode2
Representation Learning with Large Language Models for RecommendationCode2
Atom: Low-bit Quantization for Efficient and Accurate LLM ServingCode2
EmT: A Novel Transformer for Generalized Cross-subject EEG Emotion RecognitionCode2
GraphCLIP: Enhancing Transferability in Graph Foundation Models for Text-Attributed GraphsCode2
ESVO2: Direct Visual-Inertial Odometry with Stereo Event CamerasCode2
A Survey of Graph Meets Large Language Model: Progress and Future DirectionsCode2
Show:102550
← PrevPage 177 of 13232Next →