Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

Supervised representation learning: learning representations on task A using annotated data and used to solve task B
Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

Representation Learning: A Review and New Perspectives - Bengio et al. (2012)
A Few Words on Representation Learning - Thalles Silva

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 10580 papers

Title	Date	Tasks	Status	Hype
Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard	Jun 13, 2023	Information RetrievalRepresentation Learning	CodeCode Available	4
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models	Jan 30, 2023	Generative Visual Question AnsweringImage Captioning	CodeCode Available	4
A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions	Jun 15, 2022	ClusteringDeep Clustering	CodeCode Available	4
ControlVAE: Tuning, Analytical Properties, and Performance Analysis	Oct 31, 2020	DisentanglementImage Generation	CodeCode Available	4
Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis	May 16, 2025	Continual LearningRepresentation Learning	CodeCode Available	3
OLinear: A Linear Model for Time Series Forecasting in Orthogonally Transformed Domain	May 12, 2025	Multivariate Time Series ForecastingRepresentation Learning	CodeCode Available	3
Elucidating the Design Space of Multimodal Protein Language Models	Apr 15, 2025	DiversityRepresentation Learning	CodeCode Available	3
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation	Apr 11, 2025	DecoderImage Generation	CodeCode Available	3
Multi-Modality Representation Learning for Antibody-Antigen Interactions Prediction	Mar 22, 2025	Graph AttentionPrediction	CodeCode Available	3
NdLinear Is All You Need for Representation Learning	Mar 21, 2025	AllRepresentation Learning	CodeCode Available	3

Show:10 25 50

← PrevPage 3 of 1058Next →

All datasets SciDocs Animals-10 CIFAR10 Circle Data Sports10

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SciNCL	Avg.	81.8	—	Unverified
2	SPECTER	Avg.	80	—	Unverified
3	Citeomatic	Avg.	76	—	Unverified
4	Sci-DeCLUTR	Avg.	66.6	—	Unverified
5	SciBERT	Avg.	59.6	—	Unverified
6	BioBERT	Avg.	58.8	—	Unverified
7	CiteBERT	Avg.	58.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	top_model_weights_with_3d_2	1:1 Accuracy	0.75	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Resnet 18	Accuracy (%)	97.05	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Morphological Network	Accuracy	97.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Max Margin Contrastive	Silhouette Score	0.56	—	Unverified