SOTAVerified

Saliency Prediction

A saliency map is a model that predicts eye fixations on a visual scene. Saliency prediction is informed by the human visual attention mechanism and predicts the possibility of the human eyes to stay in a certain position in the scene.

Papers

Showing 150 of 268 papers

TitleStatusHype
SUM: Saliency Unification through Mamba for Visual Attention ModelingCode2
Hierarchical Domain-Adapted Feature Learning for Video Saliency PredictionCode1
AIM 2024 Challenge on Video Saliency Prediction: Methods and ResultsCode1
Simultaneous Enhancement and Super-Resolution of Underwater Imagery for Improved Visual PerceptionCode1
TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps DistillationCode1
Unified Image and Video Saliency ModelingCode1
FastSal: a Computationally Efficient Network for Visual Saliency PredictionCode1
Learning to Predict Salient Faces: A Novel Visual-Audio Saliency ModelCode1
n-Reference Transfer Learning for Saliency PredictionCode1
Shap-Mix: Shapley Value Guided Mixing for Long-Tailed Skeleton Based Action RecognitionCode1
SVAM: Saliency-guided Visual Attention Modeling by Autonomous Underwater RobotsCode1
Tidying Deep Saliency Prediction ArchitecturesCode1
Generative Transformer for Accurate and Reliable Salient Object DetectionCode1
Uncertainty Inspired RGB-D Saliency DetectionCode1
Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face VideosCode1
GASP: Gated Attention For Saliency PredictionCode1
Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency DetectionCode1
Learning From Pixel-Level Noisy Label: A New Perspective for Light Field Saliency DetectionCode1
Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured MeshesCode1
n-Reference Transfer Learning for Saliency PredictionCode1
Pyramidal Attention for Saliency DetectionCode1
Semantic Segmentation of Underwater Imagery: Dataset and BenchmarkCode1
STAViS: Spatio-Temporal AudioVisual Saliency NetworkCode1
ATSal: An Attention Based Architecture for Saliency Prediction in 360 VideosCode1
TempSAL -- Uncovering Temporal Information for Deep Saliency PredictionCode1
TempSAL - Uncovering Temporal Information for Deep Saliency PredictionCode1
TranSalNet: Towards perceptually relevant visual saliency predictionCode1
ViNet: Pushing the limits of Visual Modality for Audio-Visual Saliency PredictionCode1
BASNet: Boundary-Aware Salient Object DetectionCode1
Uncertainty Guided Refinement for Fine-Grained Salient Object DetectionCode1
How is Gaze Influenced by Image Transformations? Dataset and ModelCode1
Does Text Attract Attention on E-Commerce Images: A Novel Saliency Prediction Dataset and MethodCode1
An Energy-Based Prior for Generative SaliencyCode1
Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement AnalysisCode1
Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion GuidanceCode1
Data Augmentation via Latent Diffusion for Saliency PredictionCode1
BTS-Net: Bi-directional Transfer-and-Selection Network For RGB-D Salient Object DetectionCode1
DeepGaze IIE: Calibrated prediction in and out-of-domain for state-of-the-art saliency modelingCode1
MaskSplit: Self-supervised Meta-learning for Few-shot Semantic SegmentationCode1
MDS-ViTNet: Improving saliency prediction for Eye-Tracking with Vision TransformerCode1
Deep Gaze I: Boosting Saliency Prediction with Feature Maps Trained on ImageNetCode1
Multi-task UNet: Jointly Boosting Saliency Prediction and Disease Classification on Chest X-ray ImagesCode1
Alignment-Free RGBT Salient Object Detection: Semantics-guided Asymmetric Correlation Network and A Unified BenchmarkCode1
Panoramic Vision Transformer for Saliency Detection in 360° VideosCode1
RSKDD-Net: Random Sample-based Keypoint Detector and DescriptorCode1
Select, Supplement and Focus for RGB-D Saliency DetectionCode1
Specificity-preserving RGB-D Saliency DetectionCode1
Spherical Vision Transformer for 360-degree Video Saliency PredictionCode1
Spatio-Temporal Self-Attention Network for Video Saliency PredictionCode1
Video Saliency Prediction Using Enhanced Spatiotemporal Alignment NetworkCode1
Show:102550
← PrevPage 1 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TransalnetKL0.87Unverified
2TempSALKL0.71Unverified
3SSwin transformerKL0.65Unverified
4ECT-SALKL0.58Unverified
5SUMKL0.47Unverified
#ModelMetricClaimedVerifiedStatus
1SUMAUC0.88Unverified
2SalNAS-XL + Self-KDAUC0.87Unverified
3TempSALAUC0.87Unverified
4MDS-ViTNetAUC0.87Unverified
5TranSalNetAUC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1SUMAUC-Judd0.91Unverified
2TranSalNetAUC-Judd0.87Unverified
#ModelMetricClaimedVerifiedStatus
1SUMKL0.27Unverified