SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 70017050 of 661570 papers

TitleStatusHype
SAW: Toward a Surgical Action World Model via Controllable and Scalable Video Generation0
PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection DefensesCode0
Human-Centered Evaluation of an LLM-Based Process Modeling Copilot: A Mixed-Methods Study with Domain Experts0
IROSA: Interactive Robot Skill Adaptation using Natural Language0
Fourier Angle Alignment for Oriented Object Detection in Remote SensingCode0
AI Model Modulation with Logits Redistribution0
FastDSAC: Unlocking the Potential of Maximum Entropy RL in High-Dimensional Humanoid Control0
DRIFT-Net: A Spectral--Coupled Neural Operator for PDEs LearningCode0
A2Z-10M+: Geometric Deep Learning with A-to-Z BRep Annotations for AI-Assisted CAD Modeling and Reverse Engineering0
HSEmotion Team at ABAW-10 Competition: Facial Expression Recognition, Valence-Arousal Estimation, Action Unit Detection and Fine-Grained Violence Classification0
CognitionCapturerPro: Towards High-Fidelity Visual Decoding from EEG/MEG via Multi-modal Information and Asymmetric AlignmentCode0
DiffProxy: Multi-View Human Mesh Recovery via Diffusion-Generated Dense Proxies0
Beyond Static Instruction: A Multi-agent AI Framework for Adaptive Augmented Reality Robot Training0
Mask2Flow-TSE: Two-Stage Target Speaker Extraction with Masking and Flow Matching0
98 Faster LLM Routing Without a Dedicated GPU: Flash Attention, Prompt Compression, and Near-Streaming for the vLLM Semantic Router0
Dependency-Aware Parallel Decoding via Attention for Diffusion LLMs0
GA-Drive: Geometry-Appearance Decoupled Modeling for Free-viewpoint Driving Scene Generation0
ToolTree: Efficient LLM Agent Tool Planning via Dual-Feedback Monte Carlo Tree Search and Bidirectional Pruning0
The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts0
TubeMLLM: A Foundation Model for Topology Knowledge Exploration in Vessel-like Anatomy0
Thinking in Dynamics: How Multimodal Large Language Models Perceive, Track, and Reason Dynamics in Physical 4D World0
AccelAes: Accelerating Diffusion Transformers for Training-Free Aesthetic-Enhanced Image GenerationCode0
SPELL: Self-Play Reinforcement Learning for Evolving Long-Context Language ModelsCode0
GeoZero: Incentivizing Reasoning from Scratch on Geospatial ScenesCode0
Mitigating Latent Mismatch in cVAE-Based Singing Voice Synthesis via Flow MatchingCode0
VLM4Rec: Multimodal Semantic Representation for Recommendation with Large Vision-Language ModelsCode0
HIFICL: High-Fidelity In-Context Learning for Multimodal TasksCode0
CM-Bench: A Comprehensive Cross-Modal Feature Matching Benchmark Bridging Visible and Infrared ImagesCode0
A protocol for evaluating robustness to H&E staining variation in computational pathology modelsCode0
FedBPrompt: Federated Domain Generalization Person Re-Identification via Body Distribution Aware Visual PromptsCode0
Fair Lung Disease Diagnosis from Chest CT via Gender-Adversarial Attention Multiple Instance LearningCode0
SortScrews: A Dataset and Baseline for Real-time Screw ClassificationCode0
Think and Answer ME: Benchmarking and Exploring Multi-Entity Reasoning Grounding in Remote SensingCode0
Vision Verification Enhanced Fusion of VLMs for Efficient Visual ReasoningCode0
HFP-SAM: Hierarchical Frequency Prompted SAM for Efficient Marine Animal SegmentationCode0
UNIStainNet: Foundation-Model-Guided Virtual Staining of H&E to IHCCode0
IGASA: Integrated Geometry-Aware and Skip-Attention Modules for Enhanced Point Cloud RegistrationCode0
CVGL: Causal Learning and Geometric TopologyCode0
Reinforcement Learning for Diffusion LLMs with Entropy-Guided Step Selection and Stepwise AdvantagesCode0
Multiscale Structure-Guided Latent Diffusion for Multimodal MRI TranslationCode0
Swap-guided Preference Learning for Personalized Reinforcement Learning from Human FeedbackCode0
Automatic Labelling for Low-Light Pedestrian DetectionCode0
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited ViewsCode0
Parameterized Prompt for Incremental Object DetectionCode0
GraphPilot: Grounded Scene Graph Conditioning for Language-Based Autonomous DrivingCode0
AnatomiX, an Anatomy-Aware Grounded Multimodal Large Language Model for Chest X-Ray InterpretationCode0
BitDance: Scaling Autoregressive Generative Models with Binary TokensCode0
Follow the Saliency: Supervised Saliency for Retrieval-augmented Dense Video CaptioningCode0
SODA: Sensitivity-Oriented Dynamic Acceleration for Diffusion TransformerCode0
CMHANet: A Cross-Modal Hybrid Attention Network for Point Cloud RegistrationCode0
Show:102550
← PrevPage 141 of 13232Next →