SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1235112400 of 177340 papers

TitleStatusHype
STICKERCONV: Generating Multimodal Empathetic Responses from ScratchCode2
Revealing data leakage in protein interaction benchmarksCode2
Improving CLIP Training with Language RewritesCode2
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language ModelCode2
What Kind of Visual Tokens Do We Need? Training-free Visual Token Pruning for Multi-modal Large Language Models from the Perspective of GraphCode2
Dynamic 3D Point Cloud Sequences as 2D VideosCode2
PILOT: A Pre-Trained Model-Based Continual Learning ToolboxCode2
ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to DescribeCode2
PL-EVIO: Robust Monocular Event-based Visual Inertial Odometry with Point and Line FeaturesCode2
EvoCodeBench: An Evolving Code Generation Benchmark Aligned with Real-World Code RepositoriesCode2
DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real WorldCode2
RangeUDF: Semantic Surface Reconstruction from 3D Point CloudsCode2
Finetuning Large Language Models for Vulnerability DetectionCode2
Beyond Any-Shot Adaptation: Predicting Optimization Outcome for Robustness Gains without Extra PayCode2
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image ModelsCode2
Retrieval with Learned SimilaritiesCode2
Dynamic Graph Representation with Knowledge-aware Attention for Histopathology Whole Slide Image AnalysisCode2
LATR: 3D Lane Detection from Monocular Images with TransformerCode2
SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPTCode2
ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPOCode2
Face2Diffusion for Fast and Editable Face PersonalizationCode2
Hibou: A Family of Foundational Vision Transformers for PathologyCode2
Diffusion models as plug-and-play priorsCode2
Does Refusal Training in LLMs Generalize to the Past Tense?Code2
LatteReview: A Multi-Agent Framework for Systematic Review Automation Using Large Language ModelsCode2
Introducing Visual Perception Token into Multimodal Large Language ModelCode2
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image SynthesisCode2
SeFlow: A Self-Supervised Scene Flow Method in Autonomous DrivingCode2
RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question AnsweringCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image GenerationCode2
Stella Nera: Achieving 161 TOp/s/W with Multiplier-free DNN Acceleration based on Approximate Matrix MultiplicationCode2
GaussianAD: Gaussian-Centric End-to-End Autonomous DrivingCode2
AvatarGen: a 3D Generative Model for Animatable Human AvatarsCode2
LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA CompositionCode2
Machine Learning Coarse-Grained Potentials of Protein ThermodynamicsCode2
MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced Reranking and Noise-injected TrainingCode2
Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and SegmentationCode2
JAX-FLUIDS: A fully-differentiable high-order computational fluid dynamics solver for compressible two-phase flowsCode2
QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy PredictionCode2
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot ExecutionCode2
SPD Learning for Covariance-Based Neuroimaging Analysis: Perspectives, Methods, and ChallengesCode2
X-Ray: A Sequential 3D Representation For GenerationCode2
BridgeData V2: A Dataset for Robot Learning at ScaleCode2
Controlled Text Generation via Language Model ArithmeticCode2
Diff-BGM: A Diffusion Model for Video Background Music GenerationCode2
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM AgentCode2
DVMSR: Distillated Vision Mamba for Efficient Super-ResolutionCode2
Open-Set Domain Adaptation for Semantic SegmentationCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
Show:102550
← PrevPage 248 of 3547Next →