| CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval | May 29, 2024 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 1 |
| Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identification | May 28, 2024 | Person Re-IdentificationTriplet | CodeCode Available | 2 |
| FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes | May 28, 2024 | Novel View SynthesisTriplet | CodeCode Available | 2 |
| EMERGE: Integrating RAG for Improved Multimodal EHR Predictive Modeling | May 27, 2024 | Knowledge GraphsRAG | —Unverified | 0 |
| ProtFAD: Introducing function-aware domains as implicit modality towards protein function prediction | May 24, 2024 | Contrastive LearningProtein Function Prediction | CodeCode Available | 0 |
| Enhancing Understanding Through Wildlife Re-Identification | May 17, 2024 | Metric LearningTriplet | —Unverified | 0 |
| Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation | May 16, 2024 | AudioCapsEvent Detection | CodeCode Available | 1 |
| Unveiling the Potential: Harnessing Deep Metric Learning to Circumvent Video Streaming Encryption | May 16, 2024 | Metric LearningTriplet | —Unverified | 0 |
| FORESEE: Multimodal and Multi-view Representation Learning for Robust Prediction of Cancer Survival | May 13, 2024 | DenoisingPrognosis | —Unverified | 0 |
| PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning | May 10, 2024 | DecoderGeneralization Bounds | CodeCode Available | 1 |
| Context-Aware Clustering using Large Language Models | May 2, 2024 | ClusteringLanguage Modeling | —Unverified | 0 |
| FITA: Fine-grained Image-Text Aligner for Radiology Report Generation | May 2, 2024 | DescriptiveTriplet | —Unverified | 0 |
| Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers | May 1, 2024 | DenoisingDiagnostic | —Unverified | 0 |
| A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images | Apr 30, 2024 | Triplet | —Unverified | 0 |
| Leak Proof CMap; a framework for training and evaluation of cell line agnostic L1000 similarity methods | Apr 29, 2024 | BenchmarkingDrug Discovery | CodeCode Available | 0 |
| Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering | Apr 27, 2024 | Binary ClassificationLanguage Modeling | —Unverified | 0 |
| VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical Alterations | Apr 25, 2024 | Image to textSensitivity | CodeCode Available | 0 |
| Semantic distance organizes social knowledge: Insights from semantic dementia and cross-modal conceptual space | Apr 23, 2024 | AnatomyTriplet | —Unverified | 0 |
| Hierarchical localization with panoramic views and triplet loss functions | Apr 22, 2024 | Image RetrievalPosition | CodeCode Available | 0 |
| Towards Robust and Interpretable EMG-based Hand Gesture Recognition using Deep Metric Meta Learning | Apr 17, 2024 | Electromyography (EMG)Gesture Recognition | —Unverified | 0 |
| DACAD: Domain Adaptation Contrastive Learning for Anomaly Detection in Multivariate Time Series | Apr 17, 2024 | Anomaly DetectionContrastive Learning | CodeCode Available | 1 |
| Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives | Apr 17, 2024 | Contrastive LearningImage Retrieval | CodeCode Available | 1 |
| Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering | Apr 16, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Negation Triplet Extraction with Syntactic Dependency and Semantic Consistency | Apr 15, 2024 | DecoderLanguage Modelling | CodeCode Available | 0 |
| Learning with Noisy Correspondence | Apr 13, 2024 | Cross-Modal RetrievalCross-modal retrieval with noisy correspondence | —Unverified | 0 |