SOTAVerified

Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

  • Supervised representation learning: learning representations on task A using annotated data and used to solve task B
  • Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Showing 51100 of 10580 papers

TitleStatusHype
NdLinear Is All You Need for Representation LearningCode3
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image AnalysisCode3
Multi-Modality Representation Learning for Antibody-Antigen Interactions PredictionCode3
OLinear: A Linear Model for Time Series Forecasting in Orthogonally Transformed DomainCode3
Probabilistic Forecasting with Temporal Convolutional Neural NetworkCode3
HEST-1k: A Dataset for Spatial Transcriptomics and Histology Image AnalysisCode3
Momentum Contrast for Unsupervised Visual Representation LearningCode3
A Survey on Self-Supervised Learning for Non-Sequential Tabular DataCode3
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial UnderstandingCode3
GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian SplattingCode3
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image GenerationCode3
Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation HypothesisCode3
FasterViT: Fast Vision Transformers with Hierarchical AttentionCode2
Fast Training of Diffusion Models with Masked TransformersCode2
Explanation-Preserving Augmentation for Semi-Supervised Graph Representation LearningCode2
Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation LearningCode2
Feed-Forward SceneDINO for Unsupervised Semantic Scene CompletionCode2
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point CloudsCode2
A Systematic Study of Joint Representation Learning on Protein Sequences and StructuresCode2
End-to-end Learnable Clustering for Intent Learning in RecommendationCode2
FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive LearningCode2
Effect of Choosing Loss Function when Using T-batching for Representation Learning on Dynamic NetworksCode2
Effective Data Augmentation With Diffusion ModelsCode2
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel TransformerCode2
EEG2Rep: Enhancing Self-supervised EEG Representation Through Informative Masked InputsCode2
EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic SegmentationCode2
MogaNet: Multi-order Gated Aggregation NetworkCode2
All in One: Exploring Unified Video-Language Pre-trainingCode2
Dual-domain strip attention for image restorationCode2
Duoduo CLIP: Efficient 3D Understanding with Multi-View ImagesCode2
Domino: Discovering Systematic Errors with Cross-Modal EmbeddingsCode2
Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-trainingCode2
Dink-Net: Neural Clustering on Large GraphsCode2
Divot: Diffusion Powers Video Tokenizer for Comprehension and GenerationCode2
Do Transformers Really Perform Bad for Graph Representation?Code2
Dynamic Graph Representation with Knowledge-aware Attention for Histopathology Whole Slide Image AnalysisCode2
Delving into Inter-Image Invariance for Unsupervised Visual RepresentationsCode2
Decoupling Representation Learning from Reinforcement LearningCode2
Deconstructing Denoising Diffusion Models for Self-Supervised LearningCode2
Deep Reinforcement Learning for Multi-Agent InteractionCode2
DFormer: Rethinking RGBD Representation Learning for Semantic SegmentationCode2
Cross-view Masked Diffusion Transformers for Person Image SynthesisCode2
DeepSVG: A Hierarchical Generative Network for Vector Graphics AnimationCode2
DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional TransformerCode2
Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote SensingCode2
DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly DetectionCode2
DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D VisionCode2
DLF: Disentangled-Language-Focused Multimodal Sentiment AnalysisCode2
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided TransformerCode2
CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place RecognitionCode2
Show:102550
← PrevPage 2 of 212Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SciNCLAvg.81.8Unverified
2SPECTERAvg.80Unverified
3CiteomaticAvg.76Unverified
4Sci-DeCLUTRAvg.66.6Unverified
5SciBERTAvg.59.6Unverified
6BioBERTAvg.58.8Unverified
7CiteBERTAvg.58.8Unverified
#ModelMetricClaimedVerifiedStatus
1top_model_weights_with_3d_21:1 Accuracy0.75Unverified
#ModelMetricClaimedVerifiedStatus
1Resnet 18Accuracy (%)97.05Unverified
#ModelMetricClaimedVerifiedStatus
1Morphological NetworkAccuracy97.3Unverified
#ModelMetricClaimedVerifiedStatus
1Max Margin ContrastiveSilhouette Score0.56Unverified