SOTAVerified

Saliency Prediction

A saliency map is a model that predicts eye fixations on a visual scene. Saliency prediction is informed by the human visual attention mechanism and predicts the possibility of the human eyes to stay in a certain position in the scene.

Papers

Showing 125 of 268 papers

TitleStatusHype
CASE: Contrastive Activation for Saliency EstimationCode0
CSPENet: Contour-Aware and Saliency Priors Embedding Network for Infrared Small Target DetectionCode0
Modeling Saliency Dataset Bias0
LMME3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs0
Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction0
DTFSal: Audio-Visual Dynamic Token Fusion for Video Saliency Prediction0
Uncertainty Guided Refinement for Fine-Grained Salient Object DetectionCode1
Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured MeshesCode1
A Deep Learning Framework for Visual Attention Prediction and Analysis of News InterfacesCode0
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues0
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues0
CASP: Consistency-aware Audio-induced Saliency Prediction Model for Omnidirectional Video0
Explainable Saliency: Articulating Reasoning with Contextual Prioritization0
Textured Mesh Saliency: Bridging Geometry and Texture for Human Perception in 3D GraphicsCode0
Relevance-guided Audio Visual Fusion for Video Saliency Prediction0
Correlation of Object Detection Performance with Visual Saliency and Depth EstimationCode0
AIM 2024 Challenge on Video Saliency Prediction: Methods and ResultsCode1
Data Augmentation via Latent Diffusion for Saliency PredictionCode1
Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion GuidanceCode1
CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion0
How Does Audio Influence Visual Attention in Omnidirectional Videos? Database and ModelCode0
Exploiting Change Blindness for Video Coding: Perspectives from a Less Promising User Study0
SalNAS: Efficient Saliency-prediction Neural Architecture Search with self-knowledge distillationCode0
Shap-Mix: Shapley Value Guided Mixing for Long-Tailed Skeleton Based Action RecognitionCode1
SUM: Saliency Unification through Mamba for Visual Attention ModelingCode2
Show:102550
← PrevPage 1 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TransalnetKL0.87Unverified
2TempSALKL0.71Unverified
3SSwin transformerKL0.65Unverified
4ECT-SALKL0.58Unverified
5SUMKL0.47Unverified
#ModelMetricClaimedVerifiedStatus
1SUMAUC0.88Unverified
2SalNAS-XL + Self-KDAUC0.87Unverified
3TempSALAUC0.87Unverified
4MDS-ViTNetAUC0.87Unverified
5TranSalNetAUC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1SUMAUC-Judd0.91Unverified
2TranSalNetAUC-Judd0.87Unverified
#ModelMetricClaimedVerifiedStatus
1SUMKL0.27Unverified