SOTAVerified

Contrastive Learning

Contrastive Learning is a deep learning technique for unsupervised representation learning. The goal is to learn a representation of data such that similar instances are close together in the representation space, while dissimilar instances are far apart.

It has been shown to be effective in various computer vision and natural language processing tasks, including image retrieval, zero-shot learning, and cross-modal retrieval. In these tasks, the learned representations can be used as features for downstream tasks such as classification and clustering.

(Image credit: Schroff et al. 2015)

Papers

Showing 101150 of 6661 papers

TitleStatusHype
A Unified Framework for 3D Scene UnderstandingCode2
FedTGP: Trainable Global Prototypes with Adaptive-Margin-Enhanced Contrastive Learning for Data and Model Heterogeneity in Federated LearningCode2
Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object DetectionCode2
Learn From Zoom: Decoupled Supervised Contrastive Learning For WCE Image ClassificationCode2
BatchFormer: Learning to Explore Sample Relationships for Robust Representation LearningCode2
A Systematic Study of Joint Representation Learning on Protein Sequences and StructuresCode2
Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report GenerationCode2
LightGCL: Simple Yet Effective Graph Contrastive Learning for RecommendationCode2
End-to-end Learnable Clustering for Intent Learning in RecommendationCode2
Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive LearningCode2
MCL: Multi-view Enhanced Contrastive Learning for Chest X-ray Report GenerationCode2
Med3DVLM: An Efficient Vision-Language Model for 3D Medical Image AnalysisCode2
Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal LossCode2
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature MimickingCode2
Exploring Contrastive Learning for Multimodal Detection of Misogynistic MemesCode2
ECG-Chat: A Large ECG-Language Model for Cardiac Disease DiagnosisCode2
4D Contrastive Superflows are Dense 3D Representation LearnersCode2
Multimodal Industrial Anomaly Detection via Hybrid FusionCode2
A Simple Framework for Contrastive Learning of Visual RepresentationsCode2
Multi-View Reasoning: Consistent Contrastive Learning for Math Word ProblemCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
MWFormer: Multi-Weather Image Restoration Using Degradation-Aware TransformersCode2
Neural-Driven Image EditingCode2
NeuroNet: A Novel Hybrid Self-Supervised Learning Framework for Sleep Stage Classification Using Single-Channel EEGCode2
One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text PromptsCode2
One Train for Two Tasks: An Encrypted Traffic Classification Framework Using Supervised Contrastive LearningCode2
DiffCSE: Difference-based Contrastive Learning for Sentence EmbeddingsCode2
Detecting and Grounding Multi-Modal Media ManipulationCode2
DiffMM: Multi-Modal Diffusion Model for RecommendationCode2
Egocentric Video-Language PretrainingCode2
Extended Agriculture-Vision: An Extension of a Large Aerial Image Dataset for Agricultural Pattern AnalysisCode2
Decoding speech perception from non-invasive brain recordingsCode2
DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly DetectionCode2
Decoupling Static and Hierarchical Motion Perception for Referring Video SegmentationCode2
CoST: Contrastive Learning of Disentangled Seasonal-Trend Representations for Time Series ForecastingCode2
Contrastive Search Is What You Need For Neural Text GenerationCode2
Crafting Better Contrastive Views for Siamese Representation LearningCode2
Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic SegmentationCode2
CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud UnderstandingCode2
DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical AlignmentCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
A Self-Supervised Descriptor for Image Copy DetectionCode2
A DeNoising FPN With Transformer R-CNN for Tiny Object DetectionCode2
DetailCLIP: Detail-Oriented CLIP for Fine-Grained TasksCode2
Detecting and Grounding Multi-Modal Media Manipulation and BeyondCode2
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive LearningCode2
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature DistillationCode2
DNABERT-S: Pioneering Species Differentiation with Species-Aware DNA EmbeddingsCode2
EasyRec: Simple yet Effective Language Models for RecommendationCode2
Cross-lingual and Multilingual CLIPCode2
Show:102550
← PrevPage 3 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ResNet50ImageNet Top-1 Accuracy73.6Unverified
2ResNet50ImageNet Top-1 Accuracy73Unverified
3ResNet50ImageNet Top-1 Accuracy71.1Unverified
4ResNet50ImageNet Top-1 Accuracy69.3Unverified
5ResNet50 (v2)ImageNet Top-1 Accuracy67.6Unverified
6ResNet50 (v2)ImageNet Top-1 Accuracy63.8Unverified
7ResNet50ImageNet Top-1 Accuracy63.6Unverified
8ResNet50ImageNet Top-1 Accuracy61.5Unverified
9ResNet50ImageNet Top-1 Accuracy61.5Unverified
10ResNet50 (4×)ImageNet Top-1 Accuracy61.3Unverified
#ModelMetricClaimedVerifiedStatus
110..5sec1Unverified
#ModelMetricClaimedVerifiedStatus
1IPCL (ResNet18)Accuracy (Top-1)84.77Unverified
#ModelMetricClaimedVerifiedStatus
1IPCL (ResNet18)Accuracy (Top-1)85.55Unverified