SOTAVerified

Contrastive Learning

Contrastive Learning is a deep learning technique for unsupervised representation learning. The goal is to learn a representation of data such that similar instances are close together in the representation space, while dissimilar instances are far apart.

It has been shown to be effective in various computer vision and natural language processing tasks, including image retrieval, zero-shot learning, and cross-modal retrieval. In these tasks, the learned representations can be used as features for downstream tasks such as classification and clustering.

(Image credit: Schroff et al. 2015)

Papers

Showing 801850 of 6661 papers

TitleStatusHype
Temporal Contrastive Learning for Video Temporal Reasoning in Large Vision-Language Models0
Future Sight and Tough Fights: Revolutionizing Sequential Recommendation with FENRecCode0
Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning0
UniLoc: Towards Universal Place Recognition Using Any Single Modality0
Gramian Multimodal Representation Learning and AlignmentCode2
SE-GCL: An Event-Based Simple and Effective Graph Contrastive Learning for Text Representation0
Leveraging Retrieval-Augmented Tags for Large Vision-Language Understanding in Complex Scenes0
Multi-Graph Co-Training for Capturing User Intent in Session-based RecommendationCode0
RAC3: Retrieval-Augmented Corner Case Comprehension for Autonomous Driving with Vision-Language Models0
Edge Contrastive Learning: An Augmentation-Free Graph Contrastive Learning ModelCode0
Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models0
CATALOG: A Camera Trap Language-guided Contrastive Learning ModelCode0
ExeChecker: Where Did I Go Wrong?Code0
MulSMo: Multimodal Stylized Motion Generation by Bidirectional Control Flow0
TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes0
UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging ModalitiesCode2
A dual contrastive framework0
Label-template based Few-Shot Text Classification with Contrastive Learning0
Single-View Graph Contrastive Learning with Soft Neighborhood AwarenessCode0
DomCLP: Domain-wise Contrastive Learning with Prototype Mixup for Unsupervised Domain GeneralizationCode0
Dynamic Contrastive Knowledge Distillation for Efficient Image RestorationCode1
USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature DecorrelationCode1
Residual Channel Boosts Contrastive Learning for Radio Frequency Fingerprint Identification0
Multi-level Matching Network for Multimodal Entity LinkingCode0
jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images0
Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation0
Why Does Dropping Edges Usually Outperform Adding Edges in Graph Contrastive Learning?Code0
Mitigating Out-of-Entity Errors in Named Entity Recognition: A Sentence-Level Strategy0
Dynamic Modality-Camera Invariant Clustering for Unsupervised Visible-Infrared Person Re-identification0
Fine-grained graph representation learning for heterogeneous mobile networks with attentive fusion and contrastive learning0
AmCLR: Unified Augmented Learning for Cross-Modal RepresentationsCode0
Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning0
Learning Self-Supervised Audio-Visual Representations for Sound Recommendations0
Image Retrieval with Intra-Sweep Representation Learning for Neck Ultrasound Scanning Guidance0
Multi-Scale Contrastive Learning for Video Temporal Grounding0
Fine-grained Text to Image Synthesis0
StyleMaster: Stylize Your Video with Artistic Generation and Translation0
Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence0
Compositional Zero-Shot Learning with Contextualized Cues and Adaptive Contrastive Training0
Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation0
Measuring Pre-training Data Quality without Labels for Time Series Foundation Models0
A Self-Learning Multimodal Approach for Fake News Detection0
MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation0
MG-3D: Multi-Grained Knowledge-Enhanced 3D Medical Vision-Language Pre-trainingCode0
From Deterministic to Probabilistic: A Novel Perspective on Domain Generalization for Medical Image Segmentation0
Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category DiscoveryCode0
A New Perspective on Time Series Anomaly Detection: Faster Patch-based Broad Learning System0
Compositional Image Retrieval via Instruction-Aware Contrastive LearningCode0
Unifying Dual-Space Embedding for Entity Alignment via Contrastive LearningCode0
SimC3D: A Simple Contrastive 3D Pretraining Framework Using RGB ImagesCode0
Show:102550
← PrevPage 17 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ResNet50ImageNet Top-1 Accuracy73.6Unverified
2ResNet50ImageNet Top-1 Accuracy73Unverified
3ResNet50ImageNet Top-1 Accuracy71.1Unverified
4ResNet50ImageNet Top-1 Accuracy69.3Unverified
5ResNet50 (v2)ImageNet Top-1 Accuracy67.6Unverified
6ResNet50 (v2)ImageNet Top-1 Accuracy63.8Unverified
7ResNet50ImageNet Top-1 Accuracy63.6Unverified
8ResNet50ImageNet Top-1 Accuracy61.5Unverified
9ResNet50ImageNet Top-1 Accuracy61.5Unverified
10ResNet50 (4×)ImageNet Top-1 Accuracy61.3Unverified
#ModelMetricClaimedVerifiedStatus
110..5sec1Unverified
#ModelMetricClaimedVerifiedStatus
1IPCL (ResNet18)Accuracy (Top-1)84.77Unverified
#ModelMetricClaimedVerifiedStatus
1IPCL (ResNet18)Accuracy (Top-1)85.55Unverified