Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

Supervised representation learning: learning representations on task A using annotated data and used to solve task B
Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

Representation Learning: A Review and New Perspectives - Bengio et al. (2012)
A Few Words on Representation Learning - Thalles Silva

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 10580 papers

Title	Date	Tasks	Status	Hype
Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models	May 26, 2023	GSM8KMultimodal Reasoning	CodeCode Available	3
Uni-QSAR: an Auto-ML Tool for Molecular Property Prediction	Apr 24, 2023	Drug DiscoveryModel Selection	CodeCode Available	3
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders	Jan 2, 2023	Object DetectionRepresentation Learning	CodeCode Available	3
ROLAND: Graph Learning Framework for Dynamic Graphs	Aug 15, 2022	Graph LearningGraph Representation Learning	CodeCode Available	3
Vision Transformers: From Semantic Segmentation to Dense Prediction	Jul 19, 2022	image-classificationImage Classification	CodeCode Available	3
Robust and Efficient Medical Imaging with Self-Supervision	May 19, 2022	DiagnosticRepresentation Learning	CodeCode Available	3
Simple and Effective Relation-based Embedding Propagation for Knowledge Representation Learning	May 13, 2022	Knowledge GraphsRelation	CodeCode Available	3
UNetFormer: A Unified Vision Transformer Model and Pre-Training Framework for 3D Medical Image Segmentation	Apr 1, 2022	Brain Tumor SegmentationImage Segmentation	CodeCode Available	3
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training	Aug 7, 2021	Contrastive LearningLanguage Modeling	CodeCode Available	3
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding	Oct 23, 2020	Language ModelingLanguage Modelling	CodeCode Available	3
Momentum Contrast for Unsupervised Visual Representation Learning	Nov 13, 2019	Contrastive LearningImage Classification	CodeCode Available	3
Probabilistic Forecasting with Temporal Convolutional Neural Network	Jun 11, 2019	Multivariate Time Series ForecastingProbabilistic Time Series Forecasting	CodeCode Available	3
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion	Jul 8, 2025	3D geometryDomain Generalization	CodeCode Available	2
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model	Jun 26, 2025	Representation LearningRetrieval	CodeCode Available	2
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting	Jun 11, 2025	DiversityRepresentation Learning	CodeCode Available	2
BaryIR: Learning Multi-Source Unified Representation in Continuous Barycenter Space for Generalizable All-in-One Image Restoration	May 27, 2025	AllImage Restoration	CodeCode Available	2
MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models	May 15, 2025	General KnowledgePrompt Engineering	CodeCode Available	2
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization	May 6, 2025	Active Speaker DetectionAudio-Visual Speech Recognition	CodeCode Available	2
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves	May 5, 2025	Image GenerationRepresentation Learning	CodeCode Available	2
Representation Learning for Tabular Data: A Comprehensive Survey	Apr 17, 2025	Representation LearningSurvey	CodeCode Available	2
SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining	Mar 25, 2025	Autonomous DrivingComputational Efficiency	CodeCode Available	2
Manify: A Python Library for Learning Non-Euclidean Representations	Mar 12, 2025	Representation Learning	CodeCode Available	2
MMRL: Multi-Modal Representation Learning for Vision-Language Models	Mar 11, 2025	Prompt EngineeringRepresentation Learning	CodeCode Available	2
LongProLIP: A Probabilistic Vision-Language Model with Long Context Text	Mar 11, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
USP: Unified Self-Supervised Pretraining for Image Generation and Understanding	Mar 8, 2025	Image GenerationRepresentation Learning	CodeCode Available	2
Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation	Mar 3, 2025	Representation LearningRetrieval	CodeCode Available	2
Sanity Checking Causal Representation Learning on a Simple Real-World System	Feb 27, 2025	Representation Learning	CodeCode Available	2
Cluster and Predict Latents Patches for Improved Masked Image Modeling	Feb 12, 2025	Representation Learning	CodeCode Available	2
TimeKAN: KAN-based Frequency Decomposition Learning Architecture for Long-term Time Series Forecasting	Feb 10, 2025	Representation LearningTime Series	CodeCode Available	2
LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes	Jan 7, 2025	Mixture-of-ExpertsRepresentation Learning	CodeCode Available	2
Personalized Representation from Personalized Generation	Dec 20, 2024	Contrastive LearningImage Generation	CodeCode Available	2
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning	Dec 16, 2024	DeepFake Detectiondiffusion-generated faces detection	CodeCode Available	2
Gramian Multimodal Representation Learning and Alignment	Dec 16, 2024	Contrastive LearningRepresentation Learning	CodeCode Available	2
Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach	Dec 16, 2024	Representation LearningRetrieval	CodeCode Available	2
DLF: Disentangled-Language-Focused Multimodal Sentiment Analysis	Dec 16, 2024	DisentanglementMultimodal Sentiment Analysis	CodeCode Available	2
Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation	Dec 5, 2024	Image ComprehensionRepresentation Learning	CodeCode Available	2
SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning	Dec 5, 2024	Domain AdaptationDomain Generalization	CodeCode Available	2
CodeSAM: Source Code Representation Learning by Infusing Self-Attention with Multi-Code-View Graphs	Nov 21, 2024	Clone DetectionCode Search	CodeCode Available	2
BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models	Nov 21, 2024	image-classificationImage Classification	CodeCode Available	2
Learning General-Purpose Biomedical Volume Representations using Randomized Synthesis	Nov 4, 2024	Contrastive LearningDiversity	CodeCode Available	2
VecCity: A Taxonomy-guided Library for Map Entity Representation Learning	Oct 31, 2024	Representation Learning	CodeCode Available	2
PaPaGei: Open Foundation Models for Optical Physiological Signals	Oct 27, 2024	Contrastive LearningDomain Generalization	CodeCode Available	2
TIPS: Text-Image Pretraining with Spatial Awareness	Oct 21, 2024	Depth EstimationImage Captioning	CodeCode Available	2
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios	Oct 18, 2024	Anomaly ClassificationAnomaly Detection	CodeCode Available	2
Explanation-Preserving Augmentation for Semi-Supervised Graph Representation Learning	Oct 16, 2024	Graph ClassificationGraph Representation Learning	CodeCode Available	2
Multiview Scene Graph	Oct 15, 2024	DecoderObject	CodeCode Available	2
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement	Oct 15, 2024	DisentanglementInductive Bias	CodeCode Available	2
MatMamba: A Matryoshka State Space Model	Oct 9, 2024	modelRepresentation Learning	CodeCode Available	2
Compositional Entailment Learning for Hyperbolic Vision-Language Models	Oct 9, 2024	Language ModellingRepresentation Learning	CodeCode Available	2
PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly Detection	Oct 1, 2024	3D Anomaly DetectionAnomaly Detection	CodeCode Available	2

Show:10 25 50

← PrevPage 2 of 212Next →

All datasets SciDocs Animals-10 CIFAR10 Circle Data Sports10

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SciNCL	Avg.	81.8	—	Unverified
2	SPECTER	Avg.	80	—	Unverified
3	Citeomatic	Avg.	76	—	Unverified
4	Sci-DeCLUTR	Avg.	66.6	—	Unverified
5	SciBERT	Avg.	59.6	—	Unverified
6	BioBERT	Avg.	58.8	—	Unverified
7	CiteBERT	Avg.	58.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	top_model_weights_with_3d_2	1:1 Accuracy	0.75	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Resnet 18	Accuracy (%)	97.05	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Morphological Network	Accuracy	97.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Max Margin Contrastive	Silhouette Score	0.56	—	Unverified