SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 97019750 of 661570 papers

TitleStatusHype
Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation ModelsCode2
Exposing the Deception: Uncovering More Forgery Clues for Deepfake DetectionCode2
One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language ModelsCode2
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention ControlCode2
SciAssess: Benchmarking LLM Proficiency in Scientific Literature AnalysisCode2
A Simple Baseline for Efficient Hand Mesh ReconstructionCode2
Applied Causal Inference Powered by ML and AICode2
AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic SegmentationCode2
Differentially Private Synthetic Data via Foundation Model APIs 2: TextCode2
REAL-Colon: A dataset for developing real-world AI applications in colonoscopyCode2
Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud AnalysisCode2
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination MitigationCode2
OccFusion: Multi-Sensor Fusion Framework for 3D Semantic Occupancy PredictionCode2
EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic SegmentationCode2
Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTVCode2
Face Swap via Diffusion ModelCode2
Dynamic 3D Point Cloud Sequences as 2D VideosCode2
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement LearningCode2
On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous DrivingCode2
AutoDefense: Multi-Agent LLM Defense against Jailbreak AttacksCode2
VNLP: Turkish NLP PackageCode2
Depth Information Assisted Collaborative Mutual Promotion Network for Single Image DehazingCode2
DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature FusionCode2
EfficientZero V2: Mastering Discrete and Continuous Control with Limited DataCode2
Point Cloud Mamba: Point Cloud Learning via State Space ModelCode2
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast DecodingCode2
Data Science Education in Undergraduate Physics: Lessons Learned from a Community of PracticeCode2
Deformable One-shot Face Stylization via DINO Semantic GuidanceCode2
SURE: SUrvey REcipes for building reliable and robust deep networksCode2
A Modular and Robust Physics-Based Approach for Lensless Image ReconstructionCode2
Dual-domain strip attention for image restorationCode2
Rethinking Few-shot 3D Point Cloud Semantic SegmentationCode2
Selective-Stereo: Adaptive Frequency Information Selection for Stereo MatchingCode2
TempCompass: Do Video LLMs Really Understand Videos?Code2
PEM: Prototype-based Efficient MaskFormer for Image SegmentationCode2
Spyx: A Library for Just-In-Time Compiled Optimization of Spiking Neural NetworksCode2
Learning Commonality, Divergence and Variety for Unsupervised Visible-Infrared Person Re-identificationCode2
Curiosity-driven Red-teaming for Large Language ModelsCode2
FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anythingCode2
A Cognitive-Based Trajectory Prediction Approach for Autonomous DrivingCode2
How do Large Language Models Handle Multilingualism?Code2
CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place RecognitionCode2
NARUTO: Neural Active Reconstruction from Uncertain Target ObservationsCode2
Deep learning for 3D human pose estimation and mesh recovery: A surveyCode2
Global and Local Prompts Cooperation via Optimal Transport for Federated LearningCode2
Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of ArtifactsCode2
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning GapCode2
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem SolversCode2
DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D ReassemblyCode2
A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online AdaptationCode2
Show:102550
← PrevPage 195 of 13232Next →