SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 94519500 of 177340 papers

TitleStatusHype
StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion ModelsCode2
Drone-assisted Road Gaussian Splatting with Cross-view UncertaintyCode2
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content GenerationCode2
CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal ModelsCode2
Uncertainty Modelling and Robust Observer Synthesis using the Koopman OperatorCode2
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video GenerationCode2
Multimodal RewardBench: Holistic Evaluation of Reward Models for Vision Language ModelsCode2
Autoregressive Action Sequence Learning for Robotic ManipulationCode2
ZipAR: Accelerating Auto-regressive Image Generation through Spatial LocalityCode2
GNSS/GPS Spoofing and Jamming Identification Using Machine Learning and Deep LearningCode2
Towards Vision-Language Geo-Foundation Model: A SurveyCode2
PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space ModelCode2
Deep Learning for Cross-Domain Data Fusion in Urban Computing: Taxonomy, Advances, and OutlookCode2
DRO: A Python Library for Distributionally Robust Optimization in Machine LearningCode2
Model-Preserving Adaptive RoundingCode2
Learning Trajectory-Aware Transformer for Video Super-ResolutionCode2
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask InpaintingCode2
MWFormer: Multi-Weather Image Restoration Using Degradation-Aware TransformersCode2
Reasoning to Attend: Try to Understand How <SEG> Token WorksCode2
PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervisionCode2
Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMsCode2
TableBank: A Benchmark Dataset for Table Detection and RecognitionCode2
No Language Left Behind: Scaling Human-Centered Machine TranslationCode2
EraRAG: Efficient and Incremental Retrieval Augmented Generation for Growing CorporaCode2
SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software DevelopmentCode2
AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion SensingCode2
FlipAttack: Jailbreak LLMs via FlippingCode2
PEDANTS: Cheap but Effective and Interpretable Answer EquivalenceCode2
SchNetPack 2.0: A neural network toolbox for atomistic machine learningCode2
Closed-Form Factorization of Latent Semantics in GANsCode2
Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character CustomizationCode2
OptiChat: Bridging Optimization Models and Practitioners with Large Language ModelsCode2
CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code CompletionCode2
VL-ICL Bench: The Devil in the Details of Multimodal In-Context LearningCode2
ESM All-Atom: Multi-scale Protein Language Model for Unified Molecular ModelingCode2
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space ModelCode2
TableRAG: A Retrieval Augmented Generation Framework for Heterogeneous Document ReasoningCode2
Controllable 3D Outdoor Scene Generation via Scene GraphsCode2
DDSP: Differentiable Digital Signal ProcessingCode2
Coswara: A website application enabling COVID-19 screening by analysing respiratory sound samples and health symptomsCode2
Diffusion Explainer: Visual Explanation for Text-to-image Stable DiffusionCode2
RetroGFN: Diverse and Feasible Retrosynthesis using GFlowNetsCode2
Reevaluating Adversarial Examples in Natural LanguageCode2
CTR-Driven Advertising Image Generation with Multimodal Large Language ModelsCode2
Learning Few-Step Diffusion Models by Trajectory Distribution MatchingCode2
T2S: High-resolution Time Series Generation with Text-to-Series Diffusion ModelsCode2
RM-R1: Reward Modeling as ReasoningCode2
OBELiX: A Curated Dataset of Crystal Structures and Experimentally Measured Ionic Conductivities for Lithium Solid-State ElectrolytesCode2
pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing ModelsCode2
Lemur: Harmonizing Natural Language and Code for Language AgentsCode2
Show:102550
← PrevPage 190 of 3547Next →