SOTAVerified

Triplet

Papers

Showing 51100 of 1626 papers

TitleStatusHype
Read, Watch and Scream! Sound Generation from Text and VideoCode1
Unified Dual-Intent Translation for Joint Modeling of Search and RecommendationCode1
Leveraging Predicate and Triplet Learning for Scene Graph GenerationCode1
CaLa: Complementary Association Learning for Augmenting Composed Image RetrievalCode1
Revisiting Deep Audio-Text Retrieval Through the Lens of TransportationCode1
PAC-Bayesian Generalization Bounds for Knowledge Graph Representation LearningCode1
DACAD: Domain Adaptation Contrastive Learning for Anomaly Detection in Multivariate Time SeriesCode1
Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and NegativesCode1
EndoViT: pretraining vision transformers on a large collection of endoscopic imagesCode1
Knowledge-Enhanced Dual-stream Zero-shot Composed Image RetrievalCode1
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuningCode1
PCR-99: A Practical Method for Point Cloud Registration with 99 Percent OutliersCode1
Event-level Knowledge EditingCode1
Learning to Extract Structured Entities Using Language ModelsCode1
Consistency Guided Knowledge Retrieval and Denoising in LLMs for Zero-shot Document-level Relation Triplet ExtractionCode1
Video Harmonization with Triplet Spatio-Temporal Variation PatternsCode1
Knowledge Graph Error Detection with Contrastive Confidence AdaptionCode1
Collapse-Aware Triplet Decoupling for Adversarially Robust Image RetrievalCode1
InteractDiffusion: Interaction Control in Text-to-Image Diffusion ModelsCode1
Differentiable Registration of Images and LiDAR Point Clouds with VoxelPoint-to-Pixel MatchingCode1
Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and CaptionsCode1
DUnE: Dataset for Unified EditingCode1
MSCMNet: Multi-scale Semantic Correlation Mining for Visible-Infrared Person Re-IdentificationCode1
Neural-Logic Human-Object Interaction DetectionCode1
Mirror: A Universal Framework for Various Information Extraction TasksCode1
InstructPix2NeRF: Instructed 3D Portrait Editing from a Single ImageCode1
Large Language Models are Temporal and Causal Reasoners for Video Question AnsweringCode1
CONTRASTE: Supervised Contrastive Pre-training With Aspect-based Prompts For Aspect Sentiment Triplet ExtractionCode1
LLM4SGG: Large Language Models for Weakly Supervised Scene Graph GenerationCode1
ESA: External Space Attention Aggregation for Image-Text RetrievalCode1
AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place RecognitionCode1
SCALE: Synergized Collaboration of Asymmetric Language Translation EnginesCode1
A Novel Geo-Localization Method for UAV and Satellite Images Using Cross-View Consistent AttentionCode1
Generative Retrieval with Semantic Tree-Structured Item Identifiers via Contrastive LearningCode1
Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive LearningCode1
Realistic Website Fingerprinting By Augmenting Network TraceCode1
Zero-Shot Scene Graph Generation via Triplet Calibration and ReductionCode1
Patent image retrieval using transformer-based deep metric learningCode1
CoVR-2: Automatic Data Construction for Composed Video RetrievalCode1
TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly DetectionCode1
Noisy-Correspondence Learning for Text-to-Image Person Re-identificationCode1
Compositional Feature Augmentation for Unbiased Scene Graph GenerationCode1
Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative EliminationCode1
T-UNet: Triplet UNet for Change Detection in High-Resolution Remote Sensing ImagesCode1
Learning Multi-modal Representations by Watching Hundreds of Surgical Video LecturesCode1
Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person RetrievalCode1
PrimeNet: Pre-Training for Irregular Multivariate Time SeriesCode1
A semantically enhanced dual encoder for aspect sentiment triplet extractionCode1
Instruct-ReID: A Multi-purpose Person Re-identification Task with InstructionsCode1
Few-Shot Open-Set Learning for On-Device Customization of KeyWord Spotting SystemsCode1
Show:102550
← PrevPage 2 of 33Next →

No leaderboard results yet.