SOTAVerified

Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

  • Supervised representation learning: learning representations on task A using annotated data and used to solve task B
  • Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Showing 301350 of 10580 papers

TitleStatusHype
Modality Curation: Building Universal Embeddings for Advanced Multimodal Information RetrievalCode1
LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image SegmentationCode1
UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark DatasetCode1
PyTDC: A multimodal machine learning training, evaluation, and inference platform for biomedical foundation modelsCode1
fastabx: A library for efficient computation of ABX discriminabilityCode1
SpectrumFM: A Foundation Model for Intelligent Spectrum ManagementCode1
Recursive KL Divergence Optimization: A Dynamic Framework for Representation LearningCode1
TSRM: A Lightweight Temporal Feature Encoding Architecture for Time Series Forecasting and ImputationCode1
Quadratic Interest Network for Multimodal Click-Through Rate PredictionCode1
PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud LearningCode1
Distribution-aware Forgetting Compensation for Exemplar-Free Lifelong Person Re-identificationCode1
Mitigating Degree Bias in Graph Representation Learning with Learnable Structural Augmentation and Structural Self-AttentionCode1
CheXWorld: Exploring Image World Modeling for Radiograph Representation LearningCode1
NetTAG: A Multimodal RTL-and-Layout-Aligned Netlist Foundation Model via Text-Attributed GraphCode1
Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical ImagingCode1
Robo-taxi Fleet Coordination at Scale via Reinforcement LearningCode1
COHESION: Composite Graph Convolutional Network with Dual-Stage Fusion for Multimodal RecommendationCode1
Learning to Normalize on the SPD Manifold under Bures-Wasserstein GeometryCode1
SMILE: Infusing Spatial and Motion Semantics in Masked Video LearningCode1
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and QuantizationCode1
Pluggable Style Representation Learning for Multi-Style TransferCode1
EditCLIP: Representation Learning for Image EditingCode1
CAFe: Unifying Representation and Generation with Contrastive-Autoregressive FinetuningCode1
MoST: Efficient Monarch Sparse Tuning for 3D Representation LearningCode1
HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous DrivingCode1
When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation LearningCode1
Advancing Medical Representation Learning Through High-Quality DataCode1
Towards Quantifying Long-Range Interactions in Graph Machine Learning: a Large Graph Dataset and a MeasurementCode1
REF-VLM: Triplet-Based Referring Paradigm for Unified Visual DecodingCode1
Dynamic Dictionary Learning for Remote Sensing Image SegmentationCode1
Studying the Interplay Between the Actor and Critic Representations in Reinforcement LearningCode1
Improve Representation for Imbalanced Regression through Geometric ConstraintsCode1
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation LearningCode1
MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and RetentionCode1
Noise-Injected Spiking Graph Convolution for Energy-Efficient 3D Point Cloud DenoisingCode1
EndoMamba: An Efficient Foundation Model for Endoscopic Videos via Hierarchical Pre-trainingCode1
Escaping The Big Data Paradigm in Self-Supervised Representation LearningCode1
Understanding the Emergence of Multimodal Representation AlignmentCode1
RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation ParadigmCode1
Myna: Masking-Based Contrastive Learning of Musical RepresentationsCode1
Masked Latent Prediction and Classification for Self-Supervised Audio Representation LearningCode1
Reading Your Heart: Learning ECG Words and Sentences via Pre-training ECG Language ModelCode1
AdaPTS: Adapting Univariate Foundation Models to Probabilistic Multivariate Time Series ForecastingCode1
JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed MetadataCode1
RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation LearningCode1
From Pixels to Components: Eigenvector Masking for Visual Representation LearningCode1
Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation ModelsCode1
Bridging Traffic State and Trajectory for Dynamic Road Network and Trajectory Representation LearningCode1
Intent Representation Learning with Large Language Model for RecommendationCode1
Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and ClassificationCode1
Show:102550
← PrevPage 7 of 212Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SciNCLAvg.81.8Unverified
2SPECTERAvg.80Unverified
3CiteomaticAvg.76Unverified
4Sci-DeCLUTRAvg.66.6Unverified
5SciBERTAvg.59.6Unverified
6BioBERTAvg.58.8Unverified
7CiteBERTAvg.58.8Unverified
#ModelMetricClaimedVerifiedStatus
1top_model_weights_with_3d_21:1 Accuracy0.75Unverified
#ModelMetricClaimedVerifiedStatus
1Resnet 18Accuracy (%)97.05Unverified
#ModelMetricClaimedVerifiedStatus
1Morphological NetworkAccuracy97.3Unverified
#ModelMetricClaimedVerifiedStatus
1Max Margin ContrastiveSilhouette Score0.56Unverified