SOTAVerified

Multimodal Recommendation

The multimodal recommendation task involves developing systems that leverage and integrate multiple types of data—such as text, images, audio, and user interactions—to predict and suggest items that align with a user's preferences. Unlike traditional recommendation approaches that rely on a single data modality, multimodal recommendation harnesses the diverse information from various sources to create richer and more nuanced representations of both users and items. This integration enables the system to understand and capture complex relationships and attributes across different data types, thereby enhancing the accuracy and relevance of the recommendations. The primary goal is to provide personalized suggestions by effectively merging and processing heterogeneous data to better match users with items they are likely to engage with or find valuable.

Papers

Showing 125 of 59 papers

TitleStatusHype
Modality-Independent Graph Neural Networks with Global Transformers for Multimodal RecommendationCode2
A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future DirectionsCode2
Quadratic Interest Network for Multimodal Click-Through Rate PredictionCode1
COHESION: Composite Graph Convolutional Network with Dual-Stage Fusion for Multimodal RecommendationCode1
Generating with Fairness: A Modality-Diffused Counterfactual Framework for Incomplete Multimodal RecommendationsCode1
Spectrum-based Modality Representation Fusion Graph Convolutional Network for Multimodal RecommendationCode1
Beyond Graph Convolution: Multimodal Recommendation with Topology-aware MLPsCode1
Train Once, Deploy Anywhere: Matryoshka Representation Learning for Multimodal RecommendationCode1
Harnessing Multimodal Large Language Models for Multimodal Sequential RecommendationCode1
Modality-Balanced Learning for Multimedia RecommendationCode1
GUME: Graphs and User Modalities Enhancement for Long-Tail Multimodal RecommendationCode1
End-to-end training of Multimodal Model and ranking ModelCode1
AlignRec: Aligning and Training in Multimodal RecommendationsCode1
Ducho 2.0: Towards a More Up-to-Date Unified Framework for the Extraction of Multimodal Features in RecommendationCode1
MENTOR: Multi-level Self-supervised Learning for Multimodal RecommendationCode1
Disentangled Graph Variational Auto-Encoder for Multimodal Recommendation with InterpretabilityCode1
Mirror Gradient: Towards Robust Multimodal Recommender Systems via Exploring Flat Local MinimaCode1
LGMRec: Local and Global Graph Learning for Multimodal RecommendationCode1
Causality-Inspired Fair Representation Learning for Multimodal RecommendationCode1
LightGT: A Light Graph Transformer for Multimedia RecommendationCode1
Ducho: A Unified Framework for the Extraction of Multimodal Features in RecommendationCode1
Enhancing Dyadic Relations with Homogeneous Graphs for Multimodal RecommendationCode1
A Tale of Two Graphs: Freezing and Denoising Graph Structures for Multimodal RecommendationCode1
Mining Latent Structures for Multimedia RecommendationCode1
MDVT: Enhancing Multimodal Recommendation with Model-Agnostic Multimodal-Driven Virtual Triplets0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.