SOTAVerified

Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

  • Supervised representation learning: learning representations on task A using annotated data and used to solve task B
  • Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Showing 51100 of 10580 papers

TitleStatusHype
Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language ModelsCode3
Uni-QSAR: an Auto-ML Tool for Molecular Property PredictionCode3
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked AutoencodersCode3
ROLAND: Graph Learning Framework for Dynamic GraphsCode3
Vision Transformers: From Semantic Segmentation to Dense PredictionCode3
Robust and Efficient Medical Imaging with Self-SupervisionCode3
Simple and Effective Relation-based Embedding Propagation for Knowledge Representation LearningCode3
UNetFormer: A Unified Vision Transformer Model and Pre-Training Framework for 3D Medical Image SegmentationCode3
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-TrainingCode3
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language UnderstandingCode3
Momentum Contrast for Unsupervised Visual Representation LearningCode3
Probabilistic Forecasting with Temporal Convolutional Neural NetworkCode3
Feed-Forward SceneDINO for Unsupervised Semantic Scene CompletionCode2
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding ModelCode2
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian SplattingCode2
BaryIR: Learning Multi-Source Unified Representation in Continuous Barycenter Space for Generalizable All-in-One Image RestorationCode2
MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language ModelsCode2
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative SynchronizationCode2
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by ThemselvesCode2
Representation Learning for Tabular Data: A Comprehensive SurveyCode2
SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data PretrainingCode2
Manify: A Python Library for Learning Non-Euclidean RepresentationsCode2
MMRL: Multi-Modal Representation Learning for Vision-Language ModelsCode2
LongProLIP: A Probabilistic Vision-Language Model with Long Context TextCode2
USP: Unified Self-Supervised Pretraining for Image Generation and UnderstandingCode2
Beyond Matryoshka: Revisiting Sparse Coding for Adaptive RepresentationCode2
Sanity Checking Causal Representation Learning on a Simple Real-World SystemCode2
Cluster and Predict Latents Patches for Improved Masked Image ModelingCode2
TimeKAN: KAN-based Frequency Decomposition Learning Architecture for Long-term Time Series ForecastingCode2
LiMoE: Mixture of LiDAR Representation Learners from Automotive ScenesCode2
Personalized Representation from Personalized GenerationCode2
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation LearningCode2
Gramian Multimodal Representation Learning and AlignmentCode2
Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning ApproachCode2
DLF: Disentangled-Language-Focused Multimodal Sentiment AnalysisCode2
Divot: Diffusion Powers Video Tokenizer for Comprehension and GenerationCode2
SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation LearningCode2
CodeSAM: Source Code Representation Learning by Infusing Self-Attention with Multi-Code-View GraphsCode2
BiomedCoOp: Learning to Prompt for Biomedical Vision-Language ModelsCode2
Learning General-Purpose Biomedical Volume Representations using Randomized SynthesisCode2
VecCity: A Taxonomy-guided Library for Map Entity Representation LearningCode2
PaPaGei: Open Foundation Models for Optical Physiological SignalsCode2
TIPS: Text-Image Pretraining with Spatial AwarenessCode2
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial ScenariosCode2
Explanation-Preserving Augmentation for Semi-Supervised Graph Representation LearningCode2
Multiview Scene GraphCode2
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model DisentanglementCode2
MatMamba: A Matryoshka State Space ModelCode2
Compositional Entailment Learning for Hyperbolic Vision-Language ModelsCode2
PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly DetectionCode2
Show:102550
← PrevPage 2 of 212Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SciNCLAvg.81.8Unverified
2SPECTERAvg.80Unverified
3CiteomaticAvg.76Unverified
4Sci-DeCLUTRAvg.66.6Unverified
5SciBERTAvg.59.6Unverified
6BioBERTAvg.58.8Unverified
7CiteBERTAvg.58.8Unverified
#ModelMetricClaimedVerifiedStatus
1top_model_weights_with_3d_21:1 Accuracy0.75Unverified
#ModelMetricClaimedVerifiedStatus
1Resnet 18Accuracy (%)97.05Unverified
#ModelMetricClaimedVerifiedStatus
1Morphological NetworkAccuracy97.3Unverified
#ModelMetricClaimedVerifiedStatus
1Max Margin ContrastiveSilhouette Score0.56Unverified