SOTAVerified

Representation Learning

Representation Learning is a process in machine learning where algorithms extract meaningful patterns from raw data to create representations that are easier to understand and process. These representations can be designed for interpretability, reveal hidden features, or be used for transfer learning. They are valuable across many fundamental machine learning tasks like image classification and retrieval.

Deep neural networks can be considered representation learning models that typically encode information which is projected into a different subspace. These representations are then usually passed on to a linear classifier to, for instance, train a classifier.

Representation learning can be divided into:

  • Supervised representation learning: learning representations on task A using annotated data and used to solve task B
  • Unsupervised representation learning: learning representations on a task in an unsupervised way (label-free data). These are then used to address downstream tasks and reducing the need for annotated data when learning news tasks. Powerful models like GPT and BERT leverage unsupervised representation learning to tackle language tasks.

More recently, self-supervised learning (SSL) is one of the main drivers behind unsupervised representation learning in fields like computer vision and NLP.

Here are some additional readings to go deeper on the task:

( Image credit: Visualizing and Understanding Convolutional Networks )

Papers

Showing 151200 of 10580 papers

TitleStatusHype
DFormer: Rethinking RGBD Representation Learning for Semantic SegmentationCode2
Delving into Inter-Image Invariance for Unsupervised Visual RepresentationsCode2
DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D VisionCode2
DLF: Disentangled-Language-Focused Multimodal Sentiment AnalysisCode2
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided TransformerCode2
Advancing Real-time Pandemic Forecasting Using Large Language Models: A COVID-19 Case StudyCode2
Duoduo CLIP: Efficient 3D Understanding with Multi-View ImagesCode2
Dynamic Graph Representation with Knowledge-aware Attention for Histopathology Whole Slide Image AnalysisCode2
A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnosticsCode2
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel TransformerCode2
DiffCSE: Difference-based Contrastive Learning for Sentence EmbeddingsCode2
DeepSVG: A Hierarchical Generative Network for Vector Graphics AnimationCode2
Deep Reinforcement Learning for Multi-Agent InteractionCode2
DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional TransformerCode2
DiffMM: Multi-Modal Diffusion Model for RecommendationCode2
Explanation-Preserving Augmentation for Semi-Supervised Graph Representation LearningCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial ScenariosCode2
Deconstructing Denoising Diffusion Models for Self-Supervised LearningCode2
Feed-Forward SceneDINO for Unsupervised Semantic Scene CompletionCode2
4D Contrastive Superflows are Dense 3D Representation LearnersCode2
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic CorrespondenceCode2
DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly DetectionCode2
Decoupling Representation Learning from Reinforcement LearningCode2
Diffusion Models and Representation Learning: A SurveyCode2
Geometry-Complete Diffusion for 3D Molecule Generation and OptimizationCode2
Gramian Multimodal Representation Learning and AlignmentCode2
Graph4Rec: A Universal Toolkit with Graph Neural Networks for Recommender SystemsCode2
Graph Neural Networks for Natural Language Processing: A SurveyCode2
Graph Neural Networks for Tabular Data Learning: A Survey with Taxonomy and DirectionsCode2
Effective Data Augmentation With Diffusion ModelsCode2
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of ElectrocardiogramCode2
Hierarchical Fine-Grained Image Forgery Detection and LocalizationCode2
Hierarchical Open-vocabulary Universal Image SegmentationCode2
Crafting Better Contrastive Views for Siamese Representation LearningCode2
Audio Mamba: Bidirectional State Space Model for Audio Representation LearningCode2
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding ModelCode2
Knowledge Representation Learning: A Quantitative ReviewCode2
PLA: Language-Driven Open-Vocabulary 3D Scene UnderstandingCode2
Language-Driven Representation Learning for RoboticsCode2
Learning to Prompt for Vision-Language ModelsCode2
Learning Vision from Models Rivals Learning Vision from DataCode2
Counterfactual Learning on Graphs: A SurveyCode2
CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place RecognitionCode2
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image SynthesisCode2
Manify: A Python Library for Learning Non-Euclidean RepresentationsCode2
Masked Autoencoders As Spatiotemporal LearnersCode2
Correlation-Guided Query-Dependency Calibration for Video Temporal GroundingCode2
Matryoshka Query Transformer for Large Vision-Language ModelsCode2
CoST: Contrastive Learning of Disentangled Seasonal-Trend Representations for Time Series ForecastingCode2
Show:102550
← PrevPage 4 of 212Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SciNCLAvg.81.8Unverified
2SPECTERAvg.80Unverified
3CiteomaticAvg.76Unverified
4Sci-DeCLUTRAvg.66.6Unverified
5SciBERTAvg.59.6Unverified
6BioBERTAvg.58.8Unverified
7CiteBERTAvg.58.8Unverified
#ModelMetricClaimedVerifiedStatus
1top_model_weights_with_3d_21:1 Accuracy0.75Unverified
#ModelMetricClaimedVerifiedStatus
1Resnet 18Accuracy (%)97.05Unverified
#ModelMetricClaimedVerifiedStatus
1Morphological NetworkAccuracy97.3Unverified
#ModelMetricClaimedVerifiedStatus
1Max Margin ContrastiveSilhouette Score0.56Unverified