SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1935119400 of 474278 papers

TitleStatusHype
Referring Video Object Segmentation via Language-aligned Track SelectionCode1
Explainable fault and severity classification for rolling element bearings using Kolmogorov-Arnold networksCode1
How Much Can Time-related Features Enhance Time Series Forecasting?Code1
MambaU-Lite: A Lightweight Model based on Mamba and Integrated Channel-Spatial Attention for Skin Lesion SegmentationCode1
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-trainingCode1
SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian LanguagesCode1
Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language ModelCode1
Phaseformer: Phase-based Attention Mechanism for Underwater Image Restoration and BeyondCode1
Dual-Branch Graph Transformer Network for 3D Human Mesh Reconstruction from VideoCode1
Hiding Faces in Plain Sight: Defending DeepFakes by Disrupting Face DetectionCode1
Improving Detail in Pluralistic Image Inpainting with Feature DequantizationCode1
PhysGame: Uncovering Physical Commonsense Violations in Gameplay VideosCode1
Multi-Granularity Video Object SegmentationCode1
Toward Real-Time Edge AI: Model-Agnostic Task-Oriented Communication with Visual Feature AlignmentCode1
Token Cropr: Faster ViTs for Quite a Few TasksCode1
VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric InformationCode1
SEED4D: A Synthetic Ego--Exo Dynamic 4D Data Generator, Driving Dataset and BenchmarkCode1
DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined RotationCode1
Towards Unified Molecule-Enhanced Pathology Image Representation Learning via Integrating Spatial TranscriptomicsCode1
Free and Customizable Code Documentation with LLMs: A Fine-Tuning ApproachCode1
Particle-based 6D Object Pose Estimation from Point Clouds using Diffusion ModelsCode1
EDTformer: An Efficient Decoder Transformer for Visual Place RecognitionCode1
SyncVIS: Synchronized Video Instance SegmentationCode1
DMFourLLIE: Dual-Stage and Multi-Branch Fourier Network for Low-Light Image EnhancementCode1
Visual Modality Prompt for Adapting Vision-Language Object DetectorsCode1
Vid-Morp: Video Moment Retrieval Pretraining from Unlabeled Videos in the WildCode1
Motion-Aware Optical Camera Communication with Event CamerasCode1
MambaNUT: Nighttime UAV Tracking via Mamba-based Adaptive Curriculum LearningCode1
Oracle-guided Dynamic User Preference Modeling for Sequential RecommendationCode1
Unified Parameter-Efficient Unlearning for LLMsCode1
DroidCall: A Dataset for LLM-powered Android Intent InvocationCode1
LineGS : 3D Line Segment Representation on 3D Gaussian SplattingCode1
DogLayout: Denoising Diffusion GAN for Discrete and Continuous Layout GenerationCode1
TAROT: Targeted Data Selection via Optimal TransportCode1
AgriBench: A Hierarchical Agriculture Benchmark for Multimodal Large Language ModelsCode1
Jailbreak Large Vision-Language Models Through Multi-Modal LinkageCode1
Fine Tuning Large Language Models to Deliver CBT for DepressionCode1
PerLA: Perceptive 3D Language AssistantCode1
T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMsCode1
GuardSplat: Efficient and Robust Watermarking for 3D Gaussian SplattingCode1
Multigraph Message Passing with Bi-Directional Multi-Edge AggregationsCode1
V2SFlow: Video-to-Speech Generation with Speech Decomposition and Rectified FlowCode1
Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical FindingsCode1
SDR-GNN: Spectral Domain Reconstruction Graph Neural Network for Incomplete Multimodal Learning in Conversational Emotion RecognitionCode1
On the Performance Analysis of Momentum Method: A Frequency Domain PerspectiveCode1
Deepfake Media Generation and Detection in the Generative AI Era: A Survey and OutlookCode1
Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty EstimationCode1
Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature FinetuningCode1
DELT: A Simple Diversity-driven EarlyLate Training for Dataset DistillationCode1
Another look at inference after predictionCode1
Show:102550
← PrevPage 388 of 9486Next →