Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

Supervised representation learning: learning representations on task A using annotated data and used to solve task B
Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

Representation Learning: A Review and New Perspectives - Bengio et al. (2012)
A Few Words on Representation Learning - Thalles Silva

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 10580 papers

Title	Date	Tasks	Status	Hype
Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval	May 26, 2025	Contrastive Learningcross-modal alignment	CodeCode Available	1
LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image Segmentation	May 26, 2025	Data AugmentationDomain Generalization	CodeCode Available	1
UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset	May 21, 2025	Instance SegmentationKnowledge Distillation	CodeCode Available	1
PyTDC: A multimodal machine learning training, evaluation, and inference platform for biomedical foundation models	May 8, 2025	BenchmarkingGraph Representation Learning	CodeCode Available	1
fastabx: A library for efficient computation of ABX discriminability	May 5, 2025	Representation Learning	CodeCode Available	1
SpectrumFM: A Foundation Model for Intelligent Spectrum Management	May 2, 2025	Anomaly DetectionFew-Shot Learning	CodeCode Available	1
Recursive KL Divergence Optimization: A Dynamic Framework for Representation Learning	Apr 30, 2025	Contrastive LearningDimensionality Reduction	CodeCode Available	1
TSRM: A Lightweight Temporal Feature Encoding Architecture for Time Series Forecasting and Imputation	Apr 26, 2025	ImputationMultivariate Time Series Forecasting	CodeCode Available	1
Quadratic Interest Network for Multimodal Click-Through Rate Prediction	Apr 24, 2025	Click-Through Rate PredictionMultimodal Recommendation	CodeCode Available	1
PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning	Apr 22, 2025	parameter-efficient fine-tuningRepresentation Learning	CodeCode Available	1
Distribution-aware Forgetting Compensation for Exemplar-Free Lifelong Person Re-identification	Apr 21, 2025	Exemplar-FreeKnowledge Distillation	CodeCode Available	1
Mitigating Degree Bias in Graph Representation Learning with Learnable Structural Augmentation and Structural Self-Attention	Apr 21, 2025	FairnessGraph Representation Learning	CodeCode Available	1
CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning	Apr 18, 2025	Common Sense Reasoningimage-classification	CodeCode Available	1
NetTAG: A Multimodal RTL-and-Layout-Aligned Netlist Foundation Model via Text-Attributed Graph	Apr 12, 2025	Graph LearningRepresentation Learning	CodeCode Available	1
Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging	Apr 11, 2025	AttributeComputational Efficiency	CodeCode Available	1
Robo-taxi Fleet Coordination at Scale via Reinforcement Learning	Apr 8, 2025	Computational EfficiencyGraph Representation Learning	CodeCode Available	1
COHESION: Composite Graph Convolutional Network with Dual-Stage Fusion for Multimodal Recommendation	Apr 6, 2025	Multimodal RecommendationRepresentation Learning	CodeCode Available	1
Learning to Normalize on the SPD Manifold under Bures-Wasserstein Geometry	Apr 1, 2025	Representation Learning	CodeCode Available	1
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning	Apr 1, 2025	Representation LearningSelf-Supervised Learning	CodeCode Available	1
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization	Apr 1, 2025	Image GenerationImage Reconstruction	CodeCode Available	1
Pluggable Style Representation Learning for Multi-Style Transfer	Mar 26, 2025	Representation LearningStyle Transfer	CodeCode Available	1
EditCLIP: Representation Learning for Image Editing	Mar 26, 2025	Representation Learning	CodeCode Available	1
CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning	Mar 25, 2025	HallucinationLanguage Modeling	CodeCode Available	1
MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning	Mar 24, 2025	parameter-efficient fine-tuningRepresentation Learning	CodeCode Available	1
HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving	Mar 22, 2025	Autonomous DrivingRepresentation Learning	CodeCode Available	1
When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation Learning	Mar 19, 2025	Representation LearningSelf-Supervised Learning	CodeCode Available	1
Advancing Medical Representation Learning Through High-Quality Data	Mar 18, 2025	Representation Learningzero-shot-classification	CodeCode Available	1
Towards Quantifying Long-Range Interactions in Graph Machine Learning: a Large Graph Dataset and a Measurement	Mar 12, 2025	Graph Representation LearningNode Classification	CodeCode Available	1
REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding	Mar 10, 2025	Instruction FollowingKeypoint Detection	CodeCode Available	1
Dynamic Dictionary Learning for Remote Sensing Image Segmentation	Mar 9, 2025	Dictionary LearningImage Segmentation	CodeCode Available	1
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning	Mar 8, 2025	Deep Reinforcement LearningRepresentation Learning	CodeCode Available	1
Improve Representation for Imbalanced Regression through Geometric Constraints	Mar 2, 2025	Operator learningregression	CodeCode Available	1
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning	Mar 2, 2025	Large Language ModelMulti-Instance Retrieval	CodeCode Available	1
MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention	Mar 1, 2025	ClusteringRepresentation Learning	CodeCode Available	1
Noise-Injected Spiking Graph Convolution for Energy-Efficient 3D Point Cloud Denoising	Feb 27, 2025	DenoisingRepresentation Learning	CodeCode Available	1
EndoMamba: An Efficient Foundation Model for Endoscopic Videos via Hierarchical Pre-training	Feb 26, 2025	MambaRepresentation Learning	CodeCode Available	1
Escaping The Big Data Paradigm in Self-Supervised Representation Learning	Feb 25, 2025	Representation Learning	CodeCode Available	1
Understanding the Emergence of Multimodal Representation Alignment	Feb 22, 2025	Representation Learning	CodeCode Available	1
RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm	Feb 18, 2025	Representation LearningRetrieval	CodeCode Available	1
Myna: Masking-Based Contrastive Learning of Musical Representations	Feb 18, 2025	Contrastive LearningData Augmentation	CodeCode Available	1
Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning	Feb 17, 2025	Audio ClassificationAudio Tagging	CodeCode Available	1
Reading Your Heart: Learning ECG Words and Sentences via Pre-training ECG Language Model	Feb 15, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
AdaPTS: Adapting Univariate Foundation Models to Probabilistic Multivariate Time Series Forecasting	Feb 14, 2025	Multivariate Time Series ForecastingRepresentation Learning	CodeCode Available	1
JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata	Feb 11, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning	Feb 10, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
From Pixels to Components: Eigenvector Masking for Visual Representation Learning	Feb 10, 2025	image-classificationImage Classification	CodeCode Available	1
Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models	Feb 9, 2025	Audio-Visual Speech RecognitionAutomatic Speech Recognition	CodeCode Available	1
Bridging Traffic State and Trajectory for Dynamic Road Network and Trajectory Representation Learning	Feb 8, 2025	Graph AttentionRepresentation Learning	CodeCode Available	1
Intent Representation Learning with Large Language Model for Recommendation	Feb 5, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and Classification	Feb 4, 2025	Cell SegmentationDecoder	CodeCode Available	1

Show:10 25 50

← PrevPage 7 of 212Next →

All datasets SciDocs Animals-10 CIFAR10 Circle Data Sports10

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SciNCL	Avg.	81.8	—	Unverified
2	SPECTER	Avg.	80	—	Unverified
3	Citeomatic	Avg.	76	—	Unverified
4	Sci-DeCLUTR	Avg.	66.6	—	Unverified
5	SciBERT	Avg.	59.6	—	Unverified
6	BioBERT	Avg.	58.8	—	Unverified
7	CiteBERT	Avg.	58.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	top_model_weights_with_3d_2	1:1 Accuracy	0.75	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Resnet 18	Accuracy (%)	97.05	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Morphological Network	Accuracy	97.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Max Margin Contrastive	Silhouette Score	0.56	—	Unverified