Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

Supervised representation learning: learning representations on task A using annotated data and used to solve task B
Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

Representation Learning: A Review and New Perspectives - Bengio et al. (2012)
A Few Words on Representation Learning - Thalles Silva

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 10580 papers

Title	Date	Tasks	Status	Hype
Cross-view Masked Diffusion Transformers for Person Image Synthesis	Feb 2, 2024	DenoisingImage Generation	CodeCode Available	2
Graph Domain Adaptation: Challenges, Progress and Prospects	Feb 1, 2024	Domain AdaptationGRAPH DOMAIN ADAPTATION	CodeCode Available	2
Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing	Jan 29, 2024	GPURepresentation Learning	CodeCode Available	2
Deconstructing Denoising Diffusion Models for Self-Supervised Learning	Jan 25, 2024	DenoisingImage Generation	CodeCode Available	2
Rethinking Patch Dependence for Masked Autoencoders	Jan 25, 2024	DecoderInstance Segmentation	CodeCode Available	2
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model	Jan 17, 2024	GPUImage Classification	CodeCode Available	2
HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition	Jan 11, 2024	Contrastive LearningDynamic Facial Expression Recognition	CodeCode Available	2
End-to-end Learnable Clustering for Intent Learning in Recommendation	Jan 11, 2024	ClusteringContrastive Learning	CodeCode Available	2
Singer Identity Representation Learning using Self-Supervised Techniques	Jan 10, 2024	Domain GeneralizationRepresentation Learning	CodeCode Available	2
Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, Geometry	Jan 7, 2024	Data AugmentationDrug Discovery	CodeCode Available	2
Graph Neural Networks for Tabular Data Learning: A Survey with Taxonomy and Directions	Jan 4, 2024	Representation LearningSurvey	CodeCode Available	2
ChangeCLIP: Remote sensing change detection with multimodal vision-language representation learning	Jan 4, 2024	Change DetectionDecoder	CodeCode Available	2
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond	Dec 31, 2023	Representation LearningSelf-Supervised Learning	CodeCode Available	2
Learning Vision from Models Rivals Learning Vision from Data	Dec 28, 2023	Contrastive LearningImage Captioning	CodeCode Available	2
One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts	Dec 28, 2023	AllAnatomy	CodeCode Available	2
DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision	Dec 26, 2023	Deep LearningNeRF	CodeCode Available	2
FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning	Dec 19, 2023	Contrastive LearningDenoising	CodeCode Available	2
BIRB: A Generalization Benchmark for Information Retrieval in Bioacoustics	Dec 12, 2023	Information RetrievalRepresentation Learning	CodeCode Available	2
Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding	Nov 15, 2023	Highlight DetectionMoment Retrieval	CodeCode Available	2
SpectralGPT: Spectral Remote Sensing Foundation Model	Nov 13, 2023	Change Detectionmodel	CodeCode Available	2
High-Performance Transformers for Table Structure Recognition Need Early Convolutions	Nov 9, 2023	DecoderRepresentation Learning	CodeCode Available	2
Representation Learning with Large Language Models for Recommendation	Oct 24, 2023	Recommendation SystemsRepresentation Learning	CodeCode Available	2
Pre-training Music Classification Models via Music Source Separation	Oct 24, 2023	ClassificationGenre classification	CodeCode Available	2
DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation	Sep 18, 2023	3D geometryDecoder	CodeCode Available	2
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation	Aug 15, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	2
Effect of Choosing Loss Function when Using T-batching for Representation Learning on Dynamic Networks	Aug 13, 2023	Graph Representation LearningLink Prediction	CodeCode Available	2
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection	Aug 10, 2023	Objectobject-detection	CodeCode Available	2
PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation Learning	Aug 8, 2023	Representation Learning	CodeCode Available	2
Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training	Jul 31, 2023	Organ SegmentationRepresentation Learning	CodeCode Available	2
Hierarchical Open-vocabulary Universal Image Segmentation	Jul 3, 2023	Image ComprehensionImage Segmentation	CodeCode Available	2
DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly Detection	Jun 17, 2023	Anomaly DetectionContrastive Learning	CodeCode Available	2
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models	Jun 15, 2023	Representation LearningTransfer Learning	CodeCode Available	2
Fast Training of Diffusion Models with Masked Transformers	Jun 15, 2023	DecoderDenoising	CodeCode Available	2
TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series Forecasting	Jun 14, 2023	Multivariate Time Series ForecastingRepresentation Learning	CodeCode Available	2
FasterViT: Fast Vision Transformers with Hierarchical Attention	Jun 9, 2023	Image Classificationobject-detection	CodeCode Available	2
MolFM: A Multimodal Molecular Foundation Model	Jun 6, 2023	Cross-Modal RetrievalKnowledge Graphs	CodeCode Available	2
A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics	Jun 1, 2023	DiagnosticRepresentation Learning	CodeCode Available	2
Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning	May 31, 2023	Decision MakingGeneral Knowledge	CodeCode Available	2
Dink-Net: Neural Clustering on Large Graphs	May 28, 2023	ClusteringGraph Clustering	CodeCode Available	2
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence	May 24, 2023	Dense Pixel Correspondence EstimationRepresentation Learning	CodeCode Available	2
Tractable Probabilistic Graph Representation Learning with Graph-Induced Sum-Product Networks	May 17, 2023	Graph ClassificationGraph Representation Learning	CodeCode Available	2
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding	May 14, 2023	3D Classification3D Point Cloud Classification	CodeCode Available	2
TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding	May 1, 2023	3D Object DetectionMonocular Depth Estimation	CodeCode Available	2
NeuralKG-ind: A Python Library for Inductive Knowledge Graph Representation Learning	Apr 28, 2023	Graph Representation LearningKnowledge Graphs	CodeCode Available	2
Unicom: Universal and Compact Representation Learning for Image Retrieval	Apr 12, 2023	Image ClassificationImage Retrieval	CodeCode Available	2
Counterfactual Learning on Graphs: A Survey	Apr 3, 2023	counterfactualFairness	CodeCode Available	2
Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth Estimation	Apr 3, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	2
Hierarchical Fine-Grained Image Forgery Detection and Localization	Mar 30, 2023	AttributeClassification	CodeCode Available	2
A Systematic Study of Joint Representation Learning on Protein Sequences and Structures	Mar 11, 2023	Contrastive LearningProtein Function Prediction	CodeCode Available	2
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners	Mar 3, 2023	Few-Shot LearningRepresentation Learning	CodeCode Available	2

Show:10 25 50

← PrevPage 4 of 212Next →

All datasets SciDocs Animals-10 CIFAR10 Circle Data Sports10

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SciNCL	Avg.	81.8	—	Unverified
2	SPECTER	Avg.	80	—	Unverified
3	Citeomatic	Avg.	76	—	Unverified
4	Sci-DeCLUTR	Avg.	66.6	—	Unverified
5	SciBERT	Avg.	59.6	—	Unverified
6	BioBERT	Avg.	58.8	—	Unverified
7	CiteBERT	Avg.	58.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	top_model_weights_with_3d_2	1:1 Accuracy	0.75	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Resnet 18	Accuracy (%)	97.05	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Morphological Network	Accuracy	97.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Max Margin Contrastive	Silhouette Score	0.56	—	Unverified