SOTAVerified

Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

  • Supervised representation learning: learning representations on task A using annotated data and used to solve task B
  • Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Showing 151200 of 10580 papers

TitleStatusHype
Cross-view Masked Diffusion Transformers for Person Image SynthesisCode2
Graph Domain Adaptation: Challenges, Progress and ProspectsCode2
Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote SensingCode2
Deconstructing Denoising Diffusion Models for Self-Supervised LearningCode2
Rethinking Patch Dependence for Masked AutoencodersCode2
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space ModelCode2
HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion RecognitionCode2
End-to-end Learnable Clustering for Intent Learning in RecommendationCode2
Singer Identity Representation Learning using Self-Supervised TechniquesCode2
Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, GeometryCode2
Graph Neural Networks for Tabular Data Learning: A Survey with Taxonomy and DirectionsCode2
ChangeCLIP: Remote sensing change detection with multimodal vision-language representation learningCode2
Masked Modeling for Self-supervised Representation Learning on Vision and BeyondCode2
Learning Vision from Models Rivals Learning Vision from DataCode2
One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text PromptsCode2
DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D VisionCode2
FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive LearningCode2
BIRB: A Generalization Benchmark for Information Retrieval in BioacousticsCode2
Correlation-Guided Query-Dependency Calibration for Video Temporal GroundingCode2
SpectralGPT: Spectral Remote Sensing Foundation ModelCode2
High-Performance Transformers for Table Structure Recognition Need Early ConvolutionsCode2
Representation Learning with Large Language Models for RecommendationCode2
Pre-training Music Classification Models via Music Source SeparationCode2
DFormer: Rethinking RGBD Representation Learning for Semantic SegmentationCode2
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View RepresentationCode2
Effect of Choosing Loss Function when Using T-batching for Representation Learning on Dynamic NetworksCode2
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object DetectionCode2
PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation LearningCode2
Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-trainingCode2
Hierarchical Open-vocabulary Universal Image SegmentationCode2
DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly DetectionCode2
Segment Any Point Cloud Sequences by Distilling Vision Foundation ModelsCode2
Fast Training of Diffusion Models with Masked TransformersCode2
TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series ForecastingCode2
FasterViT: Fast Vision Transformers with Hierarchical AttentionCode2
MolFM: A Multimodal Molecular Foundation ModelCode2
A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnosticsCode2
Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation LearningCode2
Dink-Net: Neural Clustering on Large GraphsCode2
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic CorrespondenceCode2
Tractable Probabilistic Graph Representation Learning with Graph-Induced Sum-Product NetworksCode2
ULIP-2: Towards Scalable Multimodal Pre-training for 3D UnderstandingCode2
TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene UnderstandingCode2
NeuralKG-ind: A Python Library for Inductive Knowledge Graph Representation LearningCode2
Unicom: Universal and Compact Representation Learning for Image RetrievalCode2
Counterfactual Learning on Graphs: A SurveyCode2
Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth EstimationCode2
Hierarchical Fine-Grained Image Forgery Detection and LocalizationCode2
A Systematic Study of Joint Representation Learning on Protein Sequences and StructuresCode2
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot LearnersCode2
Show:102550
← PrevPage 4 of 212Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SciNCLAvg.81.8Unverified
2SPECTERAvg.80Unverified
3CiteomaticAvg.76Unverified
4Sci-DeCLUTRAvg.66.6Unverified
5SciBERTAvg.59.6Unverified
6BioBERTAvg.58.8Unverified
7CiteBERTAvg.58.8Unverified
#ModelMetricClaimedVerifiedStatus
1top_model_weights_with_3d_21:1 Accuracy0.75Unverified
#ModelMetricClaimedVerifiedStatus
1Resnet 18Accuracy (%)97.05Unverified
#ModelMetricClaimedVerifiedStatus
1Morphological NetworkAccuracy97.3Unverified
#ModelMetricClaimedVerifiedStatus
1Max Margin ContrastiveSilhouette Score0.56Unverified