SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 79518000 of 661570 papers

TitleStatusHype
Projecting Points to Axes: Oriented Object Detection via Point-Axis RepresentationCode2
β-DPO: Direct Preference Optimization with Dynamic βCode2
An Economic Framework for 6-DoF Grasp DetectionCode2
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image RetrievalCode2
WalkTheDog: Cross-Morphology Motion Alignment via Phase ManifoldsCode2
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal PerceptionCode2
Gradient Boosting Reinforcement LearningCode2
SALT: Introducing a Framework for Hierarchical Segmentations in Medical Imaging using Softmax for Arbitrary Label TreesCode2
MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view VideosCode2
Adaptive Parametric ActivationCode2
WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous DrivingCode2
AddressCLIP: Empowering Vision-Language Models for City-wide Image Address LocalizationCode2
Transformer Circuit Faithfulness Metrics are not RobustCode2
Map It Anywhere (MIA): Empowering Bird's Eye View Mapping using Large-scale Public DataCode2
Exploiting Scale-Variant Attention for Segmenting Small Medical ObjectsCode2
MoreFixes: A Large-Scale Dataset of CVE Fix Commits Mined through Enhanced Repository DiscoveryCode2
TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete DataCode2
Coherent and Multi-modality Image Inpainting via Latent Space OptimizationCode2
SaMoye: Zero-shot Singing Voice Conversion Model Based on Feature Disentanglement and EnhancementCode2
IRSAM: Advancing Segment Anything Model for Infrared Small Target DetectionCode2
Density Estimation via Binless Multidimensional IntegrationCode2
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph PriorCode2
Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain ShiftCode2
ViTime: A Visual Intelligence-Based Foundation Model for Time Series ForecastingCode2
GLBench: A Comprehensive Benchmark for Graph with Large Language ModelsCode2
PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest TransformerCode2
Generative Image as Action ModelsCode2
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image SynthesisCode2
LitSearch: A Retrieval Benchmark for Scientific Literature SearchCode2
Adversarial Attacks and Defenses on Text-to-Image Diffusion Models: A SurveyCode2
Exploring the Causality of End-to-End Autonomous DrivingCode2
Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement LearningCode2
Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and AnalysisCode2
ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept ExtractionCode2
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention MapsCode2
LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous ExplorationCode2
Decomposition Betters Tracking Everything EverywhereCode2
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion ModelsCode2
Vision language models are blind: Failing to translate detailed visual features into wordsCode2
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision MakingCode2
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible GuidanceCode2
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language ModelCode2
FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive DistillationCode2
Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature AttentionCode2
Hyperion - A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAMCode2
Etalon: Holistic Performance Evaluation Framework for LLM Inference SystemsCode2
Graph Neural Networks and Deep Reinforcement Learning Based Resource Allocation for V2X CommunicationsCode2
ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape DisentanglementCode2
MEEG and AT-DGNN: Improving EEG Emotion Recognition with Music Introducing and Graph-based LearningCode2
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight GenerationCode2
Show:102550
← PrevPage 160 of 13232Next →