Contrastive Learning

Contrastive Learning is a deep learning technique for unsupervised representation learning. The goal is to learn a representation of data such that similar instances are close together in the representation space, while dissimilar instances are far apart.

It has been shown to be effective in various computer vision and natural language processing tasks, including image retrieval, zero-shot learning, and cross-modal retrieval. In these tasks, the learned representations can be used as features for downstream tasks such as classification and clustering.

(Image credit: Schroff et al. 2015)

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 6661 papers

Title	Date	Tasks	Status	Hype	Score
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy	Mar 21, 2024	Contrastive LearningDescriptive	CodeCode Available	7	5
Scaling Vision Pre-Training to 4K Resolution	Mar 25, 2025	4kContrastive Learning	CodeCode Available	7	5
PowerPM: Foundation Model for Power Systems	Aug 7, 2024	Contrastive Learningmodel	CodeCode Available	7	5
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding	Mar 22, 2024	Action ClassificationAction Recognition	CodeCode Available	7	5
Rethinking the Sample Relations for Few-Shot Classification	Jan 23, 2025	ClassificationContrastive Learning	CodeCode Available	7	5
What's Behind the Mask: Understanding Masked Graph Modeling for Graph Autoencoders	May 20, 2022	Contrastive LearningLink Prediction	CodeCode Available	6	5
Secrets of RLHF in Large Language Models Part II: Reward Modeling	Jan 11, 2024	Contrastive LearningMeta-Learning	CodeCode Available	5	5
Time-series attribution maps with regularized contrastive learning	Feb 17, 2025	Contrastive LearningTime Series	CodeCode Available	5	5
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders	Apr 9, 2024	Contrastive LearningDecoder	CodeCode Available	5	5
Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese	Nov 2, 2022	Contrastive Learningimage-classification	CodeCode Available	5	5
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training	May 23, 2023	Contrastive LearningSelf-Supervised Learning	CodeCode Available	5	5
Semi-Mamba-UNet: Pixel-Level Contrastive and Pixel-Level Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation	Feb 11, 2024	Cardiac SegmentationContrastive Learning	CodeCode Available	4	5
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners	Jun 1, 2023	Contrastive Learning	CodeCode Available	4	5
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment	Oct 3, 2023	Audio ClassificationContrastive Learning	CodeCode Available	4	5
Prototypical Verbalizer for Prompt-based Few-shot Tuning	Mar 18, 2022	Contrastive LearningEntity Typing	CodeCode Available	4	5
GLIPv2: Unifying Localization and Vision-Language Understanding	Jun 12, 2022	2D Object DetectionContrastive Learning	CodeCode Available	4	5
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine	Jul 11, 2024	Contrastive LearningLanguage Modelling	CodeCode Available	4	5
InternVideo: General Video Foundation Models via Generative and Discriminative Learning	Dec 6, 2022	Action ClassificationAction Recognition	CodeCode Available	4	5
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects	Dec 13, 2023	3D Object Detection3D Object Tracking	CodeCode Available	4	5
LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation	Nov 7, 2024	Contrastive LearningImage Captioning	CodeCode Available	4	5
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval	Sep 14, 2024	Contrastive LearningImage Retrieval	CodeCode Available	4	5
AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities	Nov 12, 2022	Contrastive LearningCross-Modal Retrieval	CodeCode Available	4	5
Multi-label Cluster Discrimination for Visual Representation Learning	Jul 24, 2024	Contrastive LearningImage-text Retrieval	CodeCode Available	4	5
When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes	Apr 18, 2024	Contrastive LearningFew-Shot Learning	CodeCode Available	3	5
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations	Dec 19, 2024	Contrastive LearningImage Reconstruction	CodeCode Available	3	5
VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis	Feb 27, 2024	Contrastive LearningMedical Image Analysis	CodeCode Available	3	5
Sigmoid Loss for Language Image Pre-Training	Mar 27, 2023	Contrastive LearningDisentanglement	CodeCode Available	3	5
Visual Causal Scene Refinement for Video Question Answering	May 7, 2023	Contrastive LearningQuestion Answering	CodeCode Available	3	5
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling	Jan 9, 2023	2D Object DetectionContrastive Learning	CodeCode Available	3	5
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training	Aug 7, 2021	Contrastive LearningLanguage Modeling	CodeCode Available	3	5
SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery	Dec 15, 2023	Contrastive LearningEarth Observation	CodeCode Available	3	5
Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents	Mar 4, 2024	Contrastive Learning	CodeCode Available	3	5
MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization	Jan 2, 2025	Contrastive LearningKey Detection	CodeCode Available	3	5
Min-Max Similarity: A Contrastive Semi-Supervised Deep Learning Network for Surgical Tools Segmentation	Mar 29, 2022	Contrastive LearningSegmentation	CodeCode Available	3	5
Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation	Apr 15, 2024	Contrastive LearningDescriptive	CodeCode Available	3	5
Large Language Model based Long-tail Query Rewriting in Taobao Search	Nov 7, 2023	Contrastive LearningLanguage Modeling	CodeCode Available	3	5
Large-Scale 3D Medical Image Pre-training with Geometric Context Priors	Oct 13, 2024	Contrastive LearningMedical Image Analysis	CodeCode Available	3	5
Grad: Guided Relation Diffusion Generation for Graph Augmentation in Graph Fraud Detection	Apr 22, 2025	Contrastive LearningFraud Detection	CodeCode Available	3	5
A Survey on Self-Supervised Learning for Non-Sequential Tabular Data	Feb 2, 2024	Contrastive LearningDescriptive	CodeCode Available	3	5
Augmentation-Free Graph Contrastive Learning of Invariant-Discriminative Representations	Oct 15, 2022	Contrastive LearningData Augmentation	CodeCode Available	3	5
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language Alignment	Dec 1, 2023	Contrastive LearningFew-Shot Learning	CodeCode Available	3	5
Focused Transformer: Contrastive Training for Context Scaling	Jul 6, 2023	Contrastive Learning	CodeCode Available	3	5
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion	Dec 5, 2024	Contrastive LearningHallucination	CodeCode Available	3	5
COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations	Apr 25, 2024	Contrastive LearningMusic Generation	CodeCode Available	3	5
Momentum Contrast for Unsupervised Visual Representation Learning	Nov 13, 2019	Contrastive LearningImage Classification	CodeCode Available	3	5
ECG-FM: An Open Electrocardiogram Foundation Model	Aug 9, 2024	Contrastive LearningDiagnostic	CodeCode Available	3	5
FruitNeRF++: A Generalized Multi-Fruit Counting Method Utilizing Contrastive Learning and Neural Radiance Fields	May 26, 2025	Contrastive Learning	CodeCode Available	3	5
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration	Jun 26, 2024	Contrastive LearningDeblurring	CodeCode Available	2	5
Think Twice Before You Act: Enhancing Agent Behavioral Safety with Thought Correction	May 16, 2025	Contrastive LearningSafety Alignment	CodeCode Available	2	5
Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation	Apr 4, 2024	Contrastive LearningReferring Expression	CodeCode Available	2	5

Show:10 25 50

← PrevPage 1 of 134Next →

All datasets imagenet-1k 10,000 People - Human Pose Recognition Data CIFAR-10 STL-10

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	ImageNet Top-1 Accuracy	73.6	—	Unverified
2	ResNet50	ImageNet Top-1 Accuracy	73	—	Unverified
3	ResNet50	ImageNet Top-1 Accuracy	71.1	—	Unverified
4	ResNet50	ImageNet Top-1 Accuracy	69.3	—	Unverified
5	ResNet50 (v2)	ImageNet Top-1 Accuracy	67.6	—	Unverified
6	ResNet50 (v2)	ImageNet Top-1 Accuracy	63.8	—	Unverified
7	ResNet50	ImageNet Top-1 Accuracy	63.6	—	Unverified
8	ResNet50	ImageNet Top-1 Accuracy	61.5	—	Unverified
9	ResNet50	ImageNet Top-1 Accuracy	61.5	—	Unverified
10	ResNet50 (4×)	ImageNet Top-1 Accuracy	61.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	1	0..5sec	1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	IPCL (ResNet18)	Accuracy (Top-1)	84.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	IPCL (ResNet18)	Accuracy (Top-1)	85.55	—	Unverified