SOTAVerified

Contrastive Learning

Contrastive Learning is a deep learning technique for unsupervised representation learning. The goal is to learn a representation of data such that similar instances are close together in the representation space, while dissimilar instances are far apart.

It has been shown to be effective in various computer vision and natural language processing tasks, including image retrieval, zero-shot learning, and cross-modal retrieval. In these tasks, the learned representations can be used as features for downstream tasks such as classification and clustering.

(Image credit: Schroff et al. 2015)

Papers

Showing 326350 of 6661 papers

TitleStatusHype
Efficient Fourier Filtering Network with Contrastive Learning for UAV-based Unaligned Bi-modal Salient Object DetectionCode1
Co-clustering for Federated Recommender SystemCode1
Contrasting with Symile: Simple Model-Agnostic Representation Learning for Unlimited ModalitiesCode1
Language-Assisted Skeleton Action Understanding for Skeleton-Based Temporal Action SegmentationCode1
EchoFM: Foundation Model for Generalizable Echocardiogram AnalysisCode1
Robust Variational Contrastive Learning for Partially View-unaligned ClusteringCode1
Progressive Compositionality In Text-to-Image Generative ModelsCode1
Self-supervised contrastive learning performs non-linear system identificationCode1
UniMTS: Unified Pre-training for Motion Time SeriesCode1
SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation Semantic Segmentation in Remote SensingCode1
Multi-granularity Contrastive Cross-modal Collaborative Generation for End-to-End Long-term Video Question AnsweringCode1
Bridging Gaps: Federated Multi-View Clustering in Heterogeneous Hybrid ViewsCode1
Continuous Contrastive Learning for Long-Tailed Semi-Supervised RecognitionCode1
Beyond Redundancy: Information-aware Unsupervised Multiplex Graph Structure LearningCode1
Semi-LLIE: Semi-supervised Contrastive Learning with Mamba-based Low-light Image EnhancementCode1
DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare DataCode1
Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI GenerationCode1
Scene-Text Grounding for Text-Based Video Question AnsweringCode1
PromSec: Prompt Optimization for Secure Generation of Functional Source Code with Large Language Models (LLMs)Code1
PRAGA: Prototype-aware Graph Adaptive Aggregation for Spatial Multi-modal Omics AnalysisCode1
Robust image representations with counterfactual contrastive learningCode1
Finetuning CLIP to Reason about Pairwise DifferencesCode1
Weakly-supervised Camera Localization by Ground-to-satellite Image RegistrationCode1
Dual-stream Feature Augmentation for Domain GeneralizationCode1
Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual LearningCode1
Show:102550
← PrevPage 14 of 267Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ResNet50ImageNet Top-1 Accuracy73.6Unverified
2ResNet50ImageNet Top-1 Accuracy73Unverified
3ResNet50ImageNet Top-1 Accuracy71.1Unverified
4ResNet50ImageNet Top-1 Accuracy69.3Unverified
5ResNet50 (v2)ImageNet Top-1 Accuracy67.6Unverified
6ResNet50 (v2)ImageNet Top-1 Accuracy63.8Unverified
7ResNet50ImageNet Top-1 Accuracy63.6Unverified
8ResNet50ImageNet Top-1 Accuracy61.5Unverified
9ResNet50ImageNet Top-1 Accuracy61.5Unverified
10ResNet50 (4×)ImageNet Top-1 Accuracy61.3Unverified
#ModelMetricClaimedVerifiedStatus
110..5sec1Unverified
#ModelMetricClaimedVerifiedStatus
1IPCL (ResNet18)Accuracy (Top-1)84.77Unverified
#ModelMetricClaimedVerifiedStatus
1IPCL (ResNet18)Accuracy (Top-1)85.55Unverified