SOTAVerified

Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

  • Supervised representation learning: learning representations on task A using annotated data and used to solve task B
  • Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Showing 101150 of 10580 papers

TitleStatusHype
QAEncoder: Towards Aligned Representation Learning in Question Answering SystemCode2
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event DetectionCode2
Progressive Representation Learning for Real-Time UAV TrackingCode2
SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-trainingCode2
Multistain Pretraining for Slide Representation Learning in PathologyCode2
NAVIX: Scaling MiniGrid Environments with JAXCode2
Contrastive Learning of Asset Embeddings from Financial Time SeriesCode2
Towards A Generalizable Pathology Foundation Model via Unified Knowledge DistillationCode2
Representation Learning and Identity Adversarial Training for Facial Behavior UnderstandingCode2
Projecting Points to Axes: Oriented Object Detection via Point-Axis RepresentationCode2
PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest TransformerCode2
TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete DataCode2
4D Contrastive Superflows are Dense 3D Representation LearnersCode2
HiDe-PET: Continual Learning via Hierarchical Decomposition of Parameter-Efficient TuningCode2
Diffusion Models and Representation Learning: A SurveyCode2
TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation LearningCode2
Duoduo CLIP: Efficient 3D Understanding with Multi-View ImagesCode2
DiffMM: Multi-Modal Diffusion Model for RecommendationCode2
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided TransformerCode2
DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional TransformerCode2
RWKV-CLIP: A Robust Vision-Language Representation LearnerCode2
Audio Mamba: Bidirectional State Space Model for Audio Representation LearningCode2
Learning Manipulation by Predicting InteractionCode2
Matryoshka Query Transformer for Large Vision-Language ModelsCode2
ViG: Linear-complexity Visual Sequence Learning with Gated Linear AttentionCode2
SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory SignalsCode2
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space ModelCode2
Transcriptomics-guided Slide Representation Learning in Computational PathologyCode2
Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting MaskCode2
MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation LearningCode2
Benchmarking Representations for Speech, Music, and Acoustic EventsCode2
Vim4Path: Self-Supervised Vision Mamba for Histopathology ImagesCode2
ShadowRefiner: Towards Mask-free Shadow Removal via Fast Fourier TransformerCode2
VideoSAGE: Video Summarization with Graph Representation LearningCode2
MindBridge: A Cross-Subject Brain Decoding FrameworkCode2
Advancing Real-time Pandemic Forecasting Using Large Language Models: A COVID-19 Case StudyCode2
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance FieldsCode2
Omni-Kernel Network for Image RestorationCode2
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningCode2
Dynamic Graph Representation with Knowledge-aware Attention for Histopathology Whole Slide Image AnalysisCode2
Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic ArchitectureCode2
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge EnhancementCode2
EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic SegmentationCode2
Dual-domain strip attention for image restorationCode2
CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place RecognitionCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
CLAP: Learning Transferable Binary Code Representations with Natural Language SupervisionCode2
EEG2Rep: Enhancing Self-supervised EEG Representation Through Informative Masked InputsCode2
Multi-Patch Prediction: Adapting LLMs for Time Series Representation LearningCode2
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of ElectrocardiogramCode2
Show:102550
← PrevPage 3 of 212Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SciNCLAvg.81.8Unverified
2SPECTERAvg.80Unverified
3CiteomaticAvg.76Unverified
4Sci-DeCLUTRAvg.66.6Unverified
5SciBERTAvg.59.6Unverified
6BioBERTAvg.58.8Unverified
7CiteBERTAvg.58.8Unverified
#ModelMetricClaimedVerifiedStatus
1top_model_weights_with_3d_21:1 Accuracy0.75Unverified
#ModelMetricClaimedVerifiedStatus
1Resnet 18Accuracy (%)97.05Unverified
#ModelMetricClaimedVerifiedStatus
1Morphological NetworkAccuracy97.3Unverified
#ModelMetricClaimedVerifiedStatus
1Max Margin ContrastiveSilhouette Score0.56Unverified