| Human-like object concept representations emerge naturally in multimodal large language models | Jul 1, 2024 | Triplet | —Unverified | 0 |
| Enhanced Data Transfer Cooperating with Artificial Triplets for Scene Graph Generation | Jun 27, 2024 | Graph GenerationScene Graph Generation | —Unverified | 0 |
| Optimization of Autonomous Driving Image Detection Based on RFAConv and Triplet Attention | Jun 25, 2024 | Autonomous Drivingimage-classification | —Unverified | 0 |
| SetBERT: Enhancing Retrieval Performance for Boolean Logic and Set Operation Queries | Jun 25, 2024 | RetrievalSentence | —Unverified | 0 |
| UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos | Jun 24, 2024 | TripletVideo Summarization | CodeCode Available | 0 |
| Multi-threshold Deep Metric Learning for Facial Expression Recognition | Jun 24, 2024 | Facial Expression RecognitionFacial Expression Recognition (FER) | —Unverified | 0 |
| Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss | Jun 21, 2024 | Contrastive LearningMachine Translation | —Unverified | 0 |
| Camera-Invariant Meta-Learning Network for Single-Camera-Training Person Re-identification | Jun 21, 2024 | Domain GeneralizationMeta-Learning | —Unverified | 0 |
| Surgical Triplet Recognition via Diffusion Model | Jun 19, 2024 | Action Triplet RecognitionDenoising | —Unverified | 0 |
| Recurrence over Video Frames (RoVF) for the Re-identification of Meerkats | Jun 18, 2024 | Triplet | —Unverified | 0 |
| ChaosMining: A Benchmark to Evaluate Post-Hoc Local Attribution Methods in Low SNR Environments | Jun 17, 2024 | feature selectionTriplet | CodeCode Available | 0 |
| SEFraud: Graph-based Self-Explainable Fraud Detection via Interpretative Mask Learning | Jun 17, 2024 | Fraud DetectionTriplet | —Unverified | 0 |
| SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and Lexical Alterations | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MiniConGTS: A Near Ultimate Minimalist Contrastive Grid Tagging Scheme for Aspect Sentiment Triplet Extraction | Jun 17, 2024 | Aspect Sentiment Triplet ExtractionTriplet | CodeCode Available | 0 |
| Identifying Query-Relevant Neurons in Large Language Models for Long-Form Texts | Jun 16, 2024 | DecoderForm | CodeCode Available | 0 |
| An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval | Jun 13, 2024 | Contrastive LearningImage Retrieval | CodeCode Available | 2 |
| Conditional Similarity Triplets Enable Covariate-Informed Representations of Single-Cell Data | Jun 12, 2024 | Triplet | CodeCode Available | 0 |
| Relational Proxy Loss for Audio-Text based Keyword Spotting | Jun 8, 2024 | Keyword SpottingMetric Learning | —Unverified | 0 |
| Neural Surface Reconstruction from Sparse Views Using Epipolar Geometry | Jun 6, 2024 | Depth EstimationMonocular Depth Estimation | —Unverified | 0 |
| Leveraging Predicate and Triplet Learning for Scene Graph Generation | Jun 4, 2024 | Graph GenerationRelation | CodeCode Available | 1 |
| Making Recommender Systems More Knowledgeable: A Framework to Incorporate Side Information | Jun 2, 2024 | Recommendation SystemsTriplet | CodeCode Available | 0 |
| Bilinear-Convolutional Neural Network Using a Matrix Similarity-based Joint Loss Function for Skin Disease Classification | Jun 2, 2024 | Triplet | —Unverified | 0 |
| OpenDAS: Open-Vocabulary Domain Adaptation for 2D and 3D Segmentation | May 30, 2024 | 3D Instance Segmentation3D Open-Vocabulary Instance Segmentation | —Unverified | 0 |
| SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation | May 29, 2024 | Image GenerationImage Retrieval | —Unverified | 0 |
| Offline Regularised Reinforcement Learning for Large Language Models Alignment | May 29, 2024 | DecoderModels Alignment | —Unverified | 0 |
| CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval | May 29, 2024 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 1 |
| Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identification | May 28, 2024 | Person Re-IdentificationTriplet | CodeCode Available | 2 |
| FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes | May 28, 2024 | Novel View SynthesisTriplet | CodeCode Available | 2 |
| EMERGE: Integrating RAG for Improved Multimodal EHR Predictive Modeling | May 27, 2024 | Knowledge GraphsRAG | —Unverified | 0 |
| ProtFAD: Introducing function-aware domains as implicit modality towards protein function prediction | May 24, 2024 | Contrastive LearningProtein Function Prediction | CodeCode Available | 0 |
| Enhancing Understanding Through Wildlife Re-Identification | May 17, 2024 | Metric LearningTriplet | —Unverified | 0 |
| Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation | May 16, 2024 | AudioCapsEvent Detection | CodeCode Available | 1 |
| Unveiling the Potential: Harnessing Deep Metric Learning to Circumvent Video Streaming Encryption | May 16, 2024 | Metric LearningTriplet | —Unverified | 0 |
| FORESEE: Multimodal and Multi-view Representation Learning for Robust Prediction of Cancer Survival | May 13, 2024 | DenoisingPrognosis | —Unverified | 0 |
| PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning | May 10, 2024 | DecoderGeneralization Bounds | CodeCode Available | 1 |
| Context-Aware Clustering using Large Language Models | May 2, 2024 | ClusteringLanguage Modeling | —Unverified | 0 |
| FITA: Fine-grained Image-Text Aligner for Radiology Report Generation | May 2, 2024 | DescriptiveTriplet | —Unverified | 0 |
| Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers | May 1, 2024 | DenoisingDiagnostic | —Unverified | 0 |
| A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images | Apr 30, 2024 | Triplet | —Unverified | 0 |
| Leak Proof CMap; a framework for training and evaluation of cell line agnostic L1000 similarity methods | Apr 29, 2024 | BenchmarkingDrug Discovery | CodeCode Available | 0 |
| Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering | Apr 27, 2024 | Binary ClassificationLanguage Modeling | —Unverified | 0 |
| VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical Alterations | Apr 25, 2024 | Image to textSensitivity | CodeCode Available | 0 |
| Semantic distance organizes social knowledge: Insights from semantic dementia and cross-modal conceptual space | Apr 23, 2024 | AnatomyTriplet | —Unverified | 0 |
| Hierarchical localization with panoramic views and triplet loss functions | Apr 22, 2024 | Image RetrievalPosition | CodeCode Available | 0 |
| Towards Robust and Interpretable EMG-based Hand Gesture Recognition using Deep Metric Meta Learning | Apr 17, 2024 | Electromyography (EMG)Gesture Recognition | —Unverified | 0 |
| DACAD: Domain Adaptation Contrastive Learning for Anomaly Detection in Multivariate Time Series | Apr 17, 2024 | Anomaly DetectionContrastive Learning | CodeCode Available | 1 |
| Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives | Apr 17, 2024 | Contrastive LearningImage Retrieval | CodeCode Available | 1 |
| Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering | Apr 16, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Negation Triplet Extraction with Syntactic Dependency and Semantic Consistency | Apr 15, 2024 | DecoderLanguage Modelling | CodeCode Available | 0 |
| Learning with Noisy Correspondence | Apr 13, 2024 | Cross-Modal RetrievalCross-modal retrieval with noisy correspondence | —Unverified | 0 |