Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

Supervised representation learning: learning representations on task A using annotated data and used to solve task B
Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

Representation Learning: A Review and New Perspectives - Bengio et al. (2012)
A Few Words on Representation Learning - Thalles Silva

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 10580 papers

Title	Date	Tasks	Status	Hype
VMamba: Visual State Space Model	Jan 18, 2024	Computational EfficiencyLanguage Modeling	CodeCode Available	7
Full Scaling Automation for Sustainable Development of Green Data Centers	May 1, 2023	Cloud ComputingCPU	CodeCode Available	7
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech	Nov 7, 2022	Representation LearningSpeech Representation Learning	CodeCode Available	6
Orbit: A Unified Simulation Framework for Interactive Robot Learning Environments	Jan 10, 2023	GPUImitation Learning	CodeCode Available	5
Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese	Nov 2, 2022	Contrastive Learningimage-classification	CodeCode Available	5
Point Transformer V3: Simpler Faster Stronger	Jan 1, 2024	Representation Learning	CodeCode Available	5
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs	Jun 24, 2024	Representation LearningVisual Grounding	CodeCode Available	5
A Time Series is Worth 64 Words: Long-term Forecasting with Transformers	Nov 27, 2022	Multivariate Time Series ForecastingRepresentation Learning	CodeCode Available	5
CodeGen2: Lessons for Training LLMs on Programming and Natural Languages	May 3, 2023	Causal Language ModelingDecoder	CodeCode Available	5
Masked Completion via Structured Diffusion with White-Box Transformers	Apr 3, 2024	Representation Learning	CodeCode Available	5
Self-Supervised Pre-Training for Table Structure Recognition Transformer	Feb 23, 2024	Representation Learning	CodeCode Available	4
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything	Dec 1, 2023	Decoderimage-classification	CodeCode Available	4
Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard	Jun 13, 2023	Information RetrievalRepresentation Learning	CodeCode Available	4
ControlVAE: Tuning, Analytical Properties, and Performance Analysis	Oct 31, 2020	DisentanglementImage Generation	CodeCode Available	4
Morphological Prototyping for Unsupervised Slide Representation Learning in Computational Pathology	May 19, 2024	Multiple Instance LearningRepresentation Learning	CodeCode Available	4
Lightweight Pixel Difference Networks for Efficient Visual Representation Learning	Feb 1, 2024	Edge DetectionObject Recognition	CodeCode Available	4
A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions	Jun 15, 2022	ClusteringDeep Clustering	CodeCode Available	4
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining	Aug 10, 2023	Audio GenerationIn-Context Learning	CodeCode Available	4
2D Matryoshka Sentence Embeddings	Feb 22, 2024	RAGRepresentation Learning	CodeCode Available	4
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models	Jan 30, 2023	Generative Visual Question AnsweringImage Captioning	CodeCode Available	4
Sundial: A Family of Highly Capable Time Series Foundation Models	Feb 2, 2025	Representation LearningTime Series	CodeCode Available	4
SVFR: A Unified Framework for Generalized Video Face Restoration	Jan 2, 2025	ColorizationRepresentation Learning	CodeCode Available	4
LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation	Nov 7, 2024	Contrastive LearningImage Captioning	CodeCode Available	4
Multi-label Cluster Discrimination for Visual Representation Learning	Jul 24, 2024	Contrastive LearningImage-text Retrieval	CodeCode Available	4
ROLAND: Graph Learning Framework for Dynamic Graphs	Aug 15, 2022	Graph LearningGraph Representation Learning	CodeCode Available	3
Robust and Efficient Medical Imaging with Self-Supervision	May 19, 2022	DiagnosticRepresentation Learning	CodeCode Available	3
Common Sense Reasoning for Deepfake Detection	Jan 31, 2024	Binary ClassificationCommon Sense Reasoning	CodeCode Available	3
Probabilistic Forecasting with Temporal Convolutional Neural Network	Jun 11, 2019	Multivariate Time Series ForecastingProbabilistic Time Series Forecasting	CodeCode Available	3
Addressing Representation Collapse in Vector Quantized Models with One Linear Layer	Nov 4, 2024	QuantizationRepresentation Learning	CodeCode Available	3
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders	Jan 2, 2023	Object DetectionRepresentation Learning	CodeCode Available	3
OLinear: A Linear Model for Time Series Forecasting in Orthogonally Transformed Domain	May 12, 2025	Multivariate Time Series ForecastingRepresentation Learning	CodeCode Available	3
Point Transformer V3: Simpler, Faster, Stronger	Dec 15, 2023	3D Semantic SegmentationLIDAR Semantic Segmentation	CodeCode Available	3
Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis	May 16, 2025	Continual LearningRepresentation Learning	CodeCode Available	3
SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity	Sep 13, 2024	Deep AttentionRepresentation Learning	CodeCode Available	3
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss	Oct 22, 2024	GPURepresentation Learning	CodeCode Available	3
Multi-Modality Representation Learning for Antibody-Antigen Interactions Prediction	Mar 22, 2025	Graph AttentionPrediction	CodeCode Available	3
Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models	May 26, 2023	GSM8KMultimodal Reasoning	CodeCode Available	3
HEST-1k: A Dataset for Spatial Transcriptomics and Histology Image Analysis	Jun 23, 2024	BenchmarkingRepresentation Learning	CodeCode Available	3
Momentum Contrast for Unsupervised Visual Representation Learning	Nov 13, 2019	Contrastive LearningImage Classification	CodeCode Available	3
MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization	Jan 2, 2025	Contrastive LearningKey Detection	CodeCode Available	3
GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting	Aug 21, 2024	Representation Learning	CodeCode Available	3
A Survey on Self-Supervised Learning for Non-Sequential Tabular Data	Feb 2, 2024	Contrastive LearningDescriptive	CodeCode Available	3
Foundation Models for Music: A Survey	Aug 26, 2024	In-Context LearningRepresentation Learning	CodeCode Available	3
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding	Dec 17, 2024	3D Semantic Occupancy PredictionAutonomous Driving	CodeCode Available	3
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding	Oct 23, 2020	Language ModelingLanguage Modelling	CodeCode Available	3
Evaluating representation learning on the protein structure universe	Jun 19, 2024	Representation Learning	CodeCode Available	3
EEGPT: Pretrained Transformer for Universal and Reliable Representation of EEG Signals	Jan 1, 2024	EEGRepresentation Learning	CodeCode Available	3
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis	Jun 5, 2024	MambaMedical Image Analysis	CodeCode Available	3
Elucidating the Design Space of Multimodal Protein Language Models	Apr 15, 2025	DiversityRepresentation Learning	CodeCode Available	3
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs	Jan 11, 2024	Representation LearningSelf-Supervised Learning	CodeCode Available	3

Show:10 25 50

← PrevPage 1 of 212Next →

All datasets SciDocs Animals-10 CIFAR10 Circle Data Sports10

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SciNCL	Avg.	81.8	—	Unverified
2	SPECTER	Avg.	80	—	Unverified
3	Citeomatic	Avg.	76	—	Unverified
4	Sci-DeCLUTR	Avg.	66.6	—	Unverified
5	SciBERT	Avg.	59.6	—	Unverified
6	BioBERT	Avg.	58.8	—	Unverified
7	CiteBERT	Avg.	58.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	top_model_weights_with_3d_2	1:1 Accuracy	0.75	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Resnet 18	Accuracy (%)	97.05	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Morphological Network	Accuracy	97.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Max Margin Contrastive	Silhouette Score	0.56	—	Unverified