SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 85518600 of 177340 papers

TitleStatusHype
End-to-End Modeling Hierarchical Time Series Using Autoregressive Transformer and Conditional Normalizing Flow based ReconciliationCode2
H-CoT: Hijacking the Chain-of-Thought Safety Reasoning Mechanism to Jailbreak Large Reasoning Models, Including OpenAI o1/o3, DeepSeek-R1, and Gemini 2.0 Flash ThinkingCode2
CGI-Stereo: Accurate and Real-Time Stereo Matching via Context and Geometry InteractionCode2
Generative Time Series Forecasting with Diffusion, Denoise, and DisentanglementCode2
Controllable and Reliable Knowledge-Intensive Task-Oriented Conversational Agents with Declarative Genie WorksheetsCode2
FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion ModelsCode2
On Evaluating Adversarial Robustness of Large Vision-Language ModelsCode2
Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN DiscriminatorCode2
BackdoorBox: A Python Toolbox for Backdoor LearningCode2
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision TasksCode2
Raising the Cost of Malicious AI-Powered Image EditingCode2
YOWOv2: A Stronger yet Efficient Multi-level Detection Framework for Real-time Spatio-temporal Action DetectionCode2
Delivering Arbitrary-Modal Semantic SegmentationCode2
ACR: Attention Collaboration-based Regressor for Arbitrary Two-Hand ReconstructionCode2
General Place Recognition Survey: Towards Real-World AutonomyCode2
DeltaEdit: Exploring Text-free Training for Text-Driven Image ManipulationCode2
LayoutDM: Discrete Diffusion Model for Controllable Layout GenerationCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic SegmentationCode2
SHERF: Generalizable Human NeRF from a Single ImageCode2
NOPE: Novel Object Pose Estimation from a Single ImageCode2
MDTv2: Masked Diffusion Transformer is a Strong Image SynthesizerCode2
SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and ModelingCode2
On the Benefits of 3D Pose and Tracking for Human Action RecognitionCode2
Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth EstimationCode2
Robust Multiview Point Cloud Registration with Reliable Pose Graph Initialization and History ReweightingCode2
Detecting and Grounding Multi-Modal Media ManipulationCode2
Large Language Models Post-training: Surveying Techniques from Alignment to ReasoningCode2
Automatic Gradient Descent: Deep Learning without HyperparametersCode2
Diffusion Recommender ModelCode2
RoboBEV: Towards Robust Bird's Eye View Perception under CorruptionsCode2
Heterogeneous-Agent Reinforcement LearningCode2
Tetra-NeRF: Representing Neural Radiance Fields Using TetrahedraCode2
SILVR: Guided Diffusion for Molecule GenerationCode2
JaxPruner: A concise library for sparsity researchCode2
NeuralKG-ind: A Python Library for Inductive Knowledge Graph Representation LearningCode2
Huatuo-26M, a Large-scale Chinese Medical QA DatasetCode2
MAMCA -- Optimal on Accuracy and Efficiency for Automatic Modulation Classification with Extended Signal LengthCode2
TART: An Open-Source Tool-Augmented Framework for Explainable Table-based ReasoningCode2
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model TrainingCode2
DeepEdit: Deep Editable Learning for Interactive Segmentation of 3D Medical ImagesCode2
Causal Document-Grounded Dialogue Pre-trainingCode2
Variational Learning is Effective for Large Deep NetworksCode2
Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free ApproachCode2
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language ModelsCode2
Medical Diffusion: Denoising Diffusion Probabilistic Models for 3D Medical Image GenerationCode2
Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion ModelsCode2
LibAUC: A Deep Learning Library for X-Risk OptimizationCode2
STAR Loss: Reducing Semantic Ambiguity in Facial Landmark DetectionCode2
Estimating heterogeneous treatment effects with right-censored data via causal survival forestsCode2
Show:102550
← PrevPage 172 of 3547Next →