SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1245112500 of 177340 papers

TitleStatusHype
Ensured: Explanations for Decreasing the Epistemic Uncertainty in PredictionsCode2
HAIR: Hypernetworks-based All-in-One Image RestorationCode2
Unifying Pairwise Interactions in Complex DynamicsCode2
Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse WeatherCode2
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face DetectorCode2
Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-ExpertsCode2
GarmentLab: A Unified Simulation and Benchmark for Garment ManipulationCode2
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization MethodsCode2
MTGS: Multi-Traversal Gaussian SplattingCode2
What Was Your Prompt? A Remote Keylogging Attack on AI AssistantsCode2
Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and PerceptionCode2
Discovery of 2D materials using Transformer Network based Generative DesignCode2
A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future DirectionsCode2
PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System SafetyCode2
Parameter-Efficient Fine-Tuning with Discrete Fourier TransformCode2
CERT: Continual Pre-Training on Sketches for Library-Oriented Code GenerationCode2
HumanRF: High-Fidelity Neural Radiance Fields for Humans in MotionCode2
Misalignment-Robust Frequency Distribution Loss for Image TransformationCode2
TRESTLE: A Model of Concept Formation in Structured DomainsCode2
Synthesis of discrete-continuous quantum circuits with multimodal diffusion modelsCode2
SecAlign: Defending Against Prompt Injection with Preference OptimizationCode2
decoupleQ: Towards 2-bit Post-Training Uniform Quantization via decoupling Parameters into Integer and Floating PointsCode2
SegFace: Face Segmentation of Long-Tail ClassesCode2
Gaussian Deja-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization AbilitiesCode2
IMU-Aided Event-based Stereo Visual OdometryCode2
Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMsCode2
MotionLLaMA: A Unified Framework for Motion Synthesis and ComprehensionCode2
EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement LearningCode2
SE(3)-Stochastic Flow Matching for Protein Backbone GenerationCode2
AnyText2: Visual Text Generation and Editing With Customizable AttributesCode2
Smaller But Better: Unifying Layout Generation with Smaller Large Language ModelsCode2
Dialectal Coverage And Generalization in Arabic Speech RecognitionCode2
Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object DetectionCode2
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and PlanningCode2
QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust ControlCode2
FinMTEB: Finance Massive Text Embedding BenchmarkCode2
Online Vectorized HD Map Construction using GeometryCode2
Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-ResolutionCode2
TabDiff: a Multi-Modal Diffusion Model for Tabular Data GenerationCode2
VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing ControlCode2
Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous DrivingCode2
Transformer tricks: Removing weights for skipless transformersCode2
Hyper-3DG: Text-to-3D Gaussian Generation via HypergraphCode2
Listen, Think, and UnderstandCode2
Pre-training Differentially Private Models with Limited Public DataCode2
Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine SamplingCode2
BARS: Towards Open Benchmarking for Recommender SystemsCode2
RAP: Retrieval-Augmented Personalization for Multimodal Large Language ModelsCode2
Optimal Invariant Bases for Atomistic Machine LearningCode2
Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image RegistrationCode2
Show:102550
← PrevPage 250 of 3547Next →