Contrastive Learning

Contrastive Learning is a deep learning technique for unsupervised representation learning. The goal is to learn a representation of data such that similar instances are close together in the representation space, while dissimilar instances are far apart.

It has been shown to be effective in various computer vision and natural language processing tasks, including image retrieval, zero-shot learning, and cross-modal retrieval. In these tasks, the learned representations can be used as features for downstream tasks such as classification and clustering.

(Image credit: Schroff et al. 2015)

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 6661 papers

Title	Date	Tasks	Status	Hype
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding	Mar 22, 2024	Action ClassificationAction Recognition	CodeCode Available	7
Scaling Vision Pre-Training to 4K Resolution	Mar 25, 2025	4kContrastive Learning	CodeCode Available	7
PowerPM: Foundation Model for Power Systems	Aug 7, 2024	Contrastive Learningmodel	CodeCode Available	7
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy	Mar 21, 2024	Contrastive LearningDescriptive	CodeCode Available	7
Rethinking the Sample Relations for Few-Shot Classification	Jan 23, 2025	ClassificationContrastive Learning	CodeCode Available	7
What's Behind the Mask: Understanding Masked Graph Modeling for Graph Autoencoders	May 20, 2022	Contrastive LearningLink Prediction	CodeCode Available	6
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders	Apr 9, 2024	Contrastive LearningDecoder	CodeCode Available	5
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training	May 23, 2023	Contrastive LearningSelf-Supervised Learning	CodeCode Available	5
Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese	Nov 2, 2022	Contrastive Learningimage-classification	CodeCode Available	5
Secrets of RLHF in Large Language Models Part II: Reward Modeling	Jan 11, 2024	Contrastive LearningMeta-Learning	CodeCode Available	5
Time-series attribution maps with regularized contrastive learning	Feb 17, 2025	Contrastive LearningTime Series	CodeCode Available	5
LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation	Nov 7, 2024	Contrastive LearningImage Captioning	CodeCode Available	4
InternVideo: General Video Foundation Models via Generative and Discriminative Learning	Dec 6, 2022	Action ClassificationAction Recognition	CodeCode Available	4
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment	Oct 3, 2023	Audio ClassificationContrastive Learning	CodeCode Available	4
Semi-Mamba-UNet: Pixel-Level Contrastive and Pixel-Level Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation	Feb 11, 2024	Cardiac SegmentationContrastive Learning	CodeCode Available	4
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects	Dec 13, 2023	3D Object Detection3D Object Tracking	CodeCode Available	4
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners	Jun 1, 2023	Contrastive Learning	CodeCode Available	4
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval	Sep 14, 2024	Contrastive LearningImage Retrieval	CodeCode Available	4
AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities	Nov 12, 2022	Contrastive LearningCross-Modal Retrieval	CodeCode Available	4
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine	Jul 11, 2024	Contrastive LearningLanguage Modelling	CodeCode Available	4
GLIPv2: Unifying Localization and Vision-Language Understanding	Jun 12, 2022	2D Object DetectionContrastive Learning	CodeCode Available	4
Multi-label Cluster Discrimination for Visual Representation Learning	Jul 24, 2024	Contrastive LearningImage-text Retrieval	CodeCode Available	4
Prototypical Verbalizer for Prompt-based Few-shot Tuning	Mar 18, 2022	Contrastive LearningEntity Typing	CodeCode Available	4
Large Language Model based Long-tail Query Rewriting in Taobao Search	Nov 7, 2023	Contrastive LearningLanguage Modeling	CodeCode Available	3
Large-Scale 3D Medical Image Pre-training with Geometric Context Priors	Oct 13, 2024	Contrastive LearningMedical Image Analysis	CodeCode Available	3

Show:10 25 50

← PrevPage 1 of 267Next →

All datasets imagenet-1k 10,000 People - Human Pose Recognition Data CIFAR-10 STL-10

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ResNet50	ImageNet Top-1 Accuracy	73.6	—	Unverified
2	ResNet50	ImageNet Top-1 Accuracy	73	—	Unverified
3	ResNet50	ImageNet Top-1 Accuracy	71.1	—	Unverified
4	ResNet50	ImageNet Top-1 Accuracy	69.3	—	Unverified
5	ResNet50 (v2)	ImageNet Top-1 Accuracy	67.6	—	Unverified
6	ResNet50 (v2)	ImageNet Top-1 Accuracy	63.8	—	Unverified
7	ResNet50	ImageNet Top-1 Accuracy	63.6	—	Unverified
8	ResNet50	ImageNet Top-1 Accuracy	61.5	—	Unverified
9	ResNet50	ImageNet Top-1 Accuracy	61.5	—	Unverified
10	ResNet50 (4×)	ImageNet Top-1 Accuracy	61.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	1	0..5sec	1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	IPCL (ResNet18)	Accuracy (Top-1)	84.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	IPCL (ResNet18)	Accuracy (Top-1)	85.55	—	Unverified