Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

Supervised representation learning: learning representations on task A using annotated data and used to solve task B
Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

Representation Learning: A Review and New Perspectives - Bengio et al. (2012)
A Few Words on Representation Learning - Thalles Silva

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 10580 papers

Title	Date	Tasks	Status	Hype
QAEncoder: Towards Aligned Representation Learning in Question Answering System	Sep 30, 2024	Document EmbeddingQuestion Answering	CodeCode Available	2
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection	Sep 26, 2024	Event DetectionRepresentation Learning	CodeCode Available	2
Progressive Representation Learning for Real-Time UAV Tracking	Sep 25, 2024	ObjectObject Tracking	CodeCode Available	2
SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training	Aug 15, 2024	Continual Learningimage-classification	CodeCode Available	2
Multistain Pretraining for Slide Representation Learning in Pathology	Aug 5, 2024	Representation LearningSelf-Supervised Learning	CodeCode Available	2
NAVIX: Scaling MiniGrid Environments with JAX	Jul 28, 2024	CPUDeep Reinforcement Learning	CodeCode Available	2
Contrastive Learning of Asset Embeddings from Financial Time Series	Jul 26, 2024	Contrastive LearningManagement	CodeCode Available	2
Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation	Jul 26, 2024	Knowledge DistillationQuestion Answering	CodeCode Available	2
Representation Learning and Identity Adversarial Training for Facial Behavior Understanding	Jul 15, 2024	Facial Action Unit DetectionFacial Expression Recognition (FER)	CodeCode Available	2
Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation	Jul 11, 2024	object-detectionObject Detection	CodeCode Available	2
PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer	Jul 10, 2024	DecoderHandwritten Mathmatical Expression Recognition	CodeCode Available	2
TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data	Jul 10, 2024	Contrastive Learningmultimodal interaction	CodeCode Available	2
4D Contrastive Superflows are Dense 3D Representation Learners	Jul 8, 2024	Autonomous DrivingContrastive Learning	CodeCode Available	2
HiDe-PET: Continual Learning via Hierarchical Decomposition of Parameter-Efficient Tuning	Jul 7, 2024	Continual LearningRepresentation Learning	CodeCode Available	2
Diffusion Models and Representation Learning: A Survey	Jun 30, 2024	DenoisingRepresentation Learning	CodeCode Available	2
TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation Learning	Jun 21, 2024	FairnessGeographic Question Answering	CodeCode Available	2
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images	Jun 17, 2024	GPUObject	CodeCode Available	2
DiffMM: Multi-Modal Diffusion Model for Recommendation	Jun 17, 2024	Contrastive Learningmodel	CodeCode Available	2
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer	Jun 13, 2024	Face Image QualityFace Image Quality Assessment	CodeCode Available	2
DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional Transformer	Jun 12, 2024	Image DehazingNonhomogeneous Image Dehazing	CodeCode Available	2
RWKV-CLIP: A Robust Vision-Language Representation Learner	Jun 11, 2024	Image-text RetrievalRepresentation Learning	CodeCode Available	2
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning	Jun 5, 2024	Audio ClassificationClassification	CodeCode Available	2
Learning Manipulation by Predicting Interaction	Jun 1, 2024	Representation Learning	CodeCode Available	2
Matryoshka Query Transformer for Large Vision-Language Models	May 29, 2024	Language ModellingRepresentation Learning	CodeCode Available	2
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention	May 28, 2024	GPURepresentation Learning	CodeCode Available	2
SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals	May 28, 2024	Contrastive LearningRepresentation Learning	CodeCode Available	2
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model	May 20, 2024	Audio ClassificationGPU	CodeCode Available	2
Transcriptomics-guided Slide Representation Learning in Computational Pathology	May 19, 2024	Contrastive LearningRepresentation Learning	CodeCode Available	2
Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask	May 9, 2024	Anomaly DetectionImputation	CodeCode Available	2
MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning	May 4, 2024	Earth Observationimage-classification	CodeCode Available	2
Benchmarking Representations for Speech, Music, and Acoustic Events	May 2, 2024	Audio ClassificationBenchmarking	CodeCode Available	2
Vim4Path: Self-Supervised Vision Mamba for Histopathology Images	Apr 20, 2024	DiagnosticMamba	CodeCode Available	2
ShadowRefiner: Towards Mask-free Shadow Removal via Fast Fourier Transformer	Apr 18, 2024	Image Shadow Removalobject-detection	CodeCode Available	2
VideoSAGE: Video Summarization with Graph Representation Learning	Apr 14, 2024	Graph Representation LearningNode Classification	CodeCode Available	2
MindBridge: A Cross-Subject Brain Decoding Framework	Apr 11, 2024	Brain DecodingData Augmentation	CodeCode Available	2
Advancing Real-time Pandemic Forecasting Using Large Language Models: A COVID-19 Case Study	Apr 10, 2024	Representation LearningTime Series	CodeCode Available	2
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields	Apr 1, 2024	3D Object DetectionNeRF	CodeCode Available	2
Omni-Kernel Network for Image Restoration	Mar 24, 2024	DeblurringImage Defocus Deblurring	CodeCode Available	2
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning	Mar 13, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
Dynamic Graph Representation with Knowledge-aware Attention for Histopathology Whole Slide Image Analysis	Mar 12, 2024	Graph Representation LearningRepresentation Learning	CodeCode Available	2
Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture	Mar 12, 2024	Motion MagnificationRepresentation Learning	CodeCode Available	2
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement	Mar 11, 2024	Clinical KnowledgeDescriptive	CodeCode Available	2
EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation	Mar 3, 2024	ObjectRepresentation Learning	CodeCode Available	2
Dual-domain strip attention for image restoration	Mar 1, 2024	DeblurringDenoising	CodeCode Available	2
CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition	Feb 29, 2024	Representation LearningVisual Place Recognition	CodeCode Available	2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning	Feb 28, 2024	Contrastive LearningDecision Making	CodeCode Available	2
CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision	Feb 26, 2024	Representation LearningTransfer Learning	CodeCode Available	2
EEG2Rep: Enhancing Self-supervised EEG Representation Through Informative Masked Inputs	Feb 17, 2024	EEGEEG Signal Classification	CodeCode Available	2
Multi-Patch Prediction: Adapting LLMs for Time Series Representation Learning	Feb 7, 2024	Contrastive LearningPrediction	CodeCode Available	2
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram	Feb 2, 2024	DiagnosticECG Classification	CodeCode Available	2

Show:10 25 50

← PrevPage 3 of 212Next →

All datasets SciDocs Animals-10 CIFAR10 Circle Data Sports10

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SciNCL	Avg.	81.8	—	Unverified
2	SPECTER	Avg.	80	—	Unverified
3	Citeomatic	Avg.	76	—	Unverified
4	Sci-DeCLUTR	Avg.	66.6	—	Unverified
5	SciBERT	Avg.	59.6	—	Unverified
6	BioBERT	Avg.	58.8	—	Unverified
7	CiteBERT	Avg.	58.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	top_model_weights_with_3d_2	1:1 Accuracy	0.75	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Resnet 18	Accuracy (%)	97.05	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Morphological Network	Accuracy	97.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Max Margin Contrastive	Silhouette Score	0.56	—	Unverified