Multimodal Recommendation

The multimodal recommendation task involves developing systems that leverage and integrate multiple types of data—such as text, images, audio, and user interactions—to predict and suggest items that align with a user's preferences. Unlike traditional recommendation approaches that rely on a single data modality, multimodal recommendation harnesses the diverse information from various sources to create richer and more nuanced representations of both users and items. This integration enables the system to understand and capture complex relationships and attributes across different data types, thereby enhancing the accuracy and relevance of the recommendations. The primary goal is to provide personalized suggestions by effectively merging and processing heterogeneous data to better match users with items they are likely to engage with or find valuable.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 59 papers

Title	Date	Tasks	Status	Hype
Modality-Independent Graph Neural Networks with Global Transformers for Multimodal Recommendation	Dec 18, 2024	Graph LearningMulti-modal Recommendation	CodeCode Available	2
A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future Directions	Feb 9, 2023	Multimodal RecommendationRecommendation Systems	CodeCode Available	2
Quadratic Interest Network for Multimodal Click-Through Rate Prediction	Apr 24, 2025	Click-Through Rate PredictionMultimodal Recommendation	CodeCode Available	1
COHESION: Composite Graph Convolutional Network with Dual-Stage Fusion for Multimodal Recommendation	Apr 6, 2025	Multimodal RecommendationRepresentation Learning	CodeCode Available	1
Generating with Fairness: A Modality-Diffused Counterfactual Framework for Incomplete Multimodal Recommendations	Jan 21, 2025	counterfactualFairness	CodeCode Available	1
Spectrum-based Modality Representation Fusion Graph Convolutional Network for Multimodal Recommendation	Dec 19, 2024	Graph LearningMultimodal Recommendation	CodeCode Available	1
Beyond Graph Convolution: Multimodal Recommendation with Topology-aware MLPs	Dec 16, 2024	Multimodal RecommendationRecommendation Systems	CodeCode Available	1
Train Once, Deploy Anywhere: Matryoshka Representation Learning for Multimodal Recommendation	Sep 25, 2024	Multimodal RecommendationRecommendation Systems	CodeCode Available	1
Harnessing Multimodal Large Language Models for Multimodal Sequential Recommendation	Aug 19, 2024	Large Language ModelMultimodal Large Language Model	CodeCode Available	1
Modality-Balanced Learning for Multimedia Recommendation	Jul 26, 2024	Collaborative Filteringcounterfactual	CodeCode Available	1
GUME: Graphs and User Modalities Enhancement for Long-Tail Multimodal Recommendation	Jul 17, 2024	Multimodal RecommendationRecommendation Systems	CodeCode Available	1
End-to-end training of Multimodal Model and ranking Model	Apr 9, 2024	Contrastive Learningmodel	CodeCode Available	1
AlignRec: Aligning and Training in Multimodal Recommendations	Mar 19, 2024	Multimodal Recommendation	CodeCode Available	1
Ducho 2.0: Towards a More Up-to-Date Unified Framework for the Extraction of Multimodal Features in Recommendation	Mar 7, 2024	BenchmarkingMultimodal Recommendation	CodeCode Available	1
MENTOR: Multi-level Self-supervised Learning for Multimodal Recommendation	Feb 29, 2024	cross-modal alignmentMultimodal Recommendation	CodeCode Available	1
Disentangled Graph Variational Auto-Encoder for Multimodal Recommendation with Interpretability	Feb 25, 2024	Collaborative FilteringMultimodal Recommendation	CodeCode Available	1
Mirror Gradient: Towards Robust Multimodal Recommender Systems via Exploring Flat Local Minima	Feb 17, 2024	Multimodal RecommendationRecommendation Systems	CodeCode Available	1
LGMRec: Local and Global Graph Learning for Multimodal Recommendation	Dec 27, 2023	Graph EmbeddingGraph Learning	CodeCode Available	1
Causality-Inspired Fair Representation Learning for Multimodal Recommendation	Oct 26, 2023	AttributeCausal Inference	CodeCode Available	1
LightGT: A Light Graph Transformer for Multimedia Recommendation	Jul 18, 2023	Collaborative FilteringMicrovideo Recommendation	CodeCode Available	1
Ducho: A Unified Framework for the Extraction of Multimodal Features in Recommendation	Jun 29, 2023	Multimodal Recommendation	CodeCode Available	1
Enhancing Dyadic Relations with Homogeneous Graphs for Multimodal Recommendation	Jan 28, 2023	Graph LearningMultimodal Recommendation	CodeCode Available	1
A Tale of Two Graphs: Freezing and Denoising Graph Structures for Multimodal Recommendation	Nov 13, 2022	DenoisingGraph structure learning	CodeCode Available	1
Mining Latent Structures for Multimedia Recommendation	Apr 19, 2021	Collaborative FilteringMultimedia recommendation	CodeCode Available	1
MDVT: Enhancing Multimodal Recommendation with Model-Agnostic Multimodal-Driven Virtual Triplets	May 22, 2025	Model OptimizationMultimodal Recommendation	—Unverified	0

Show:10 25 50

← PrevPage 1 of 3Next →

No leaderboard results yet.