Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

Supervised representation learning: learning representations on task A using annotated data and used to solve task B
Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

Representation Learning: A Review and New Perspectives - Bengio et al. (2012)
A Few Words on Representation Learning - Thalles Silva

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 10580 papers

Title	Date	Tasks	Status	Hype	Score
DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation	Sep 18, 2023	3D geometryDecoder	CodeCode Available	2	5
Delving into Inter-Image Invariance for Unsupervised Visual Representations	Aug 26, 2020	Contrastive LearningPseudo Label	CodeCode Available	2	5
DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision	Dec 26, 2023	Deep LearningNeRF	CodeCode Available	2	5
DLF: Disentangled-Language-Focused Multimodal Sentiment Analysis	Dec 16, 2024	DisentanglementMultimodal Sentiment Analysis	CodeCode Available	2	5
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer	Jun 13, 2024	Face Image QualityFace Image Quality Assessment	CodeCode Available	2	5
Advancing Real-time Pandemic Forecasting Using Large Language Models: A COVID-19 Case Study	Apr 10, 2024	Representation LearningTime Series	CodeCode Available	2	5
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images	Jun 17, 2024	GPUObject	CodeCode Available	2	5
Dynamic Graph Representation with Knowledge-aware Attention for Histopathology Whole Slide Image Analysis	Mar 12, 2024	Graph Representation LearningRepresentation Learning	CodeCode Available	2	5
A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics	Jun 1, 2023	DiagnosticRepresentation Learning	CodeCode Available	2	5
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer	Jun 9, 2022	Autonomous DrivingGPU	CodeCode Available	2	5
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings	Apr 21, 2022	Contrastive LearningLanguage Modeling	CodeCode Available	2	5
DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation	Jul 22, 2020	Representation LearningVector Graphics	CodeCode Available	2	5
Deep Reinforcement Learning for Multi-Agent Interaction	Aug 2, 2022	BIG-bench Machine LearningCausal Inference	CodeCode Available	2	5
DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional Transformer	Jun 12, 2024	Image DehazingNonhomogeneous Image Dehazing	CodeCode Available	2	5
DiffMM: Multi-Modal Diffusion Model for Recommendation	Jun 17, 2024	Contrastive Learningmodel	CodeCode Available	2	5
Explanation-Preserving Augmentation for Semi-Supervised Graph Representation Learning	Oct 16, 2024	Graph ClassificationGraph Representation Learning	CodeCode Available	2	5
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning	Feb 28, 2024	Contrastive LearningDecision Making	CodeCode Available	2	5
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios	Oct 18, 2024	Anomaly ClassificationAnomaly Detection	CodeCode Available	2	5
Deconstructing Denoising Diffusion Models for Self-Supervised Learning	Jan 25, 2024	DenoisingImage Generation	CodeCode Available	2	5
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion	Jul 8, 2025	3D geometryDomain Generalization	CodeCode Available	2	5
4D Contrastive Superflows are Dense 3D Representation Learners	Jul 8, 2024	Autonomous DrivingContrastive Learning	CodeCode Available	2	5
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence	May 24, 2023	Dense Pixel Correspondence EstimationRepresentation Learning	CodeCode Available	2	5
DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly Detection	Jun 17, 2023	Anomaly DetectionContrastive Learning	CodeCode Available	2	5
Decoupling Representation Learning from Reinforcement Learning	Sep 14, 2020	Data AugmentationDeep Reinforcement Learning	CodeCode Available	2	5
Diffusion Models and Representation Learning: A Survey	Jun 30, 2024	DenoisingRepresentation Learning	CodeCode Available	2	5
Geometry-Complete Diffusion for 3D Molecule Generation and Optimization	Feb 8, 2023	3D Molecule GenerationDenoising	CodeCode Available	2	5
Gramian Multimodal Representation Learning and Alignment	Dec 16, 2024	Contrastive LearningRepresentation Learning	CodeCode Available	2	5
Graph4Rec: A Universal Toolkit with Graph Neural Networks for Recommender Systems	Dec 2, 2021	graph constructionGraph Neural Network	CodeCode Available	2	5
Graph Neural Networks for Natural Language Processing: A Survey	Jun 10, 2021	Decodergraph construction	CodeCode Available	2	5
Graph Neural Networks for Tabular Data Learning: A Survey with Taxonomy and Directions	Jan 4, 2024	Representation LearningSurvey	CodeCode Available	2	5
Effective Data Augmentation With Diffusion Models	Feb 7, 2023	Data AugmentationDiversity	CodeCode Available	2	5
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram	Feb 2, 2024	DiagnosticECG Classification	CodeCode Available	2	5
Hierarchical Fine-Grained Image Forgery Detection and Localization	Mar 30, 2023	AttributeClassification	CodeCode Available	2	5
Hierarchical Open-vocabulary Universal Image Segmentation	Jul 3, 2023	Image ComprehensionImage Segmentation	CodeCode Available	2	5
Crafting Better Contrastive Views for Siamese Representation Learning	Feb 7, 2022	Contrastive LearningObject Localization	CodeCode Available	2	5
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning	Jun 5, 2024	Audio ClassificationClassification	CodeCode Available	2	5
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model	Jun 26, 2025	Representation LearningRetrieval	CodeCode Available	2	5
Knowledge Representation Learning: A Quantitative Review	Dec 28, 2018	General ClassificationInformation Retrieval	CodeCode Available	2	5
PLA: Language-Driven Open-Vocabulary 3D Scene Understanding	Nov 29, 2022	3D Open-Vocabulary Instance SegmentationContrastive Learning	CodeCode Available	2	5
Language-Driven Representation Learning for Robotics	Feb 24, 2023	Contrastive LearningImitation Learning	CodeCode Available	2	5
Learning to Prompt for Vision-Language Models	Sep 2, 2021	Domain GeneralizationFew-shot Age Estimation	CodeCode Available	2	5
Learning Vision from Models Rivals Learning Vision from Data	Dec 28, 2023	Contrastive LearningImage Captioning	CodeCode Available	2	5
Counterfactual Learning on Graphs: A Survey	Apr 3, 2023	counterfactualFairness	CodeCode Available	2	5
CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition	Feb 29, 2024	Representation LearningVisual Place Recognition	CodeCode Available	2	5
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis	Nov 16, 2022	Image GenerationRepresentation Learning	CodeCode Available	2	5
Manify: A Python Library for Learning Non-Euclidean Representations	Mar 12, 2025	Representation Learning	CodeCode Available	2	5
Masked Autoencoders As Spatiotemporal Learners	May 18, 2022	Inductive BiasRepresentation Learning	CodeCode Available	2	5
Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding	Nov 15, 2023	Highlight DetectionMoment Retrieval	CodeCode Available	2	5
Matryoshka Query Transformer for Large Vision-Language Models	May 29, 2024	Language ModellingRepresentation Learning	CodeCode Available	2	5
CoST: Contrastive Learning of Disentangled Seasonal-Trend Representations for Time Series Forecasting	Feb 3, 2022	Contrastive LearningRepresentation Learning	CodeCode Available	2	5

Show:10 25 50

← PrevPage 4 of 212Next →

All datasets SciDocs Animals-10 CIFAR10 Circle Data Sports10

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SciNCL	Avg.	81.8	—	Unverified
2	SPECTER	Avg.	80	—	Unverified
3	Citeomatic	Avg.	76	—	Unverified
4	Sci-DeCLUTR	Avg.	66.6	—	Unverified
5	SciBERT	Avg.	59.6	—	Unverified
6	BioBERT	Avg.	58.8	—	Unverified
7	CiteBERT	Avg.	58.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	top_model_weights_with_3d_2	1:1 Accuracy	0.75	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Resnet 18	Accuracy (%)	97.05	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Morphological Network	Accuracy	97.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Max Margin Contrastive	Silhouette Score	0.56	—	Unverified