| Scale Up Composed Image Retrieval Learning via Modification Text Generation | Feb 21, 2025 | Image RetrievalRetrieval | —Unverified | 0 |
| Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining | Feb 20, 2025 | Depth EstimationKnowledge Distillation | —Unverified | 0 |
| Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition | Feb 17, 2025 | Re-RankingTriplet | CodeCode Available | 1 |
| M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis | Feb 17, 2025 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| Phantom: Subject-consistent video generation via cross-modal alignment | Feb 16, 2025 | cross-modal alignmentHuman-Domain Subject-to-Video | CodeCode Available | 5 |
| KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAG | Feb 13, 2025 | Knowledge GraphsLarge Language Model | CodeCode Available | 2 |
| End-to-End triplet loss based fine-tuning for network embedding in effective PII detection | Feb 13, 2025 | feature selectionLarge Language Model | —Unverified | 0 |
| GenIAS: Generator for Instantiating Anomalies in time Series | Feb 12, 2025 | Anomaly DetectionDiversity | —Unverified | 0 |
| SNAT-YOLO: Efficient Cross-Layer Aggregation Network for Edge-Oriented Gangue Detection | Feb 9, 2025 | Triplet | —Unverified | 0 |
| The Complexity of Learning Sparse Superposed Features with Feedback | Feb 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PSM-SQL: Progressive Schema Learning with Multi-granularity Semantics for Text-to-SQL | Feb 7, 2025 | Text to SQLText-To-SQL | —Unverified | 0 |
| Boundary-Driven Table-Filling with Cross-Granularity Contrastive Learning for Aspect Sentiment Triplet Extraction | Feb 4, 2025 | Aspect Sentiment Triplet ExtractionContrastive Learning | —Unverified | 0 |
| Patch Triplet Similarity Purification for Guided Real-World Low-Dose CT Image Denoising | Feb 1, 2025 | DenoisingImage Denoising | —Unverified | 0 |
| Joint Power and Spectrum Orchestration for D2D Semantic Communication Underlying Energy-Efficient Cellular Networks | Jan 30, 2025 | ManagementSemantic Communication | —Unverified | 0 |
| Test-Time Code-Switching for Cross-lingual Aspect Sentiment Triplet Extraction | Jan 24, 2025 | Aspect Sentiment Triplet ExtractionBoundary Detection | —Unverified | 0 |
| YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID | Jan 23, 2025 | Multi-Object Trackingobject-detection | CodeCode Available | 2 |
| Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation | Jan 22, 2025 | counterfactualImage Generation | —Unverified | 0 |
| Hybrid Losses for Hierarchical Embedding Learning | Jan 22, 2025 | Multi-Task LearningRetrieval | CodeCode Available | 0 |
| The Dual-use Dilemma in LLMs: Do Empowering Ethical Capacities Make a Degraded Utility? | Jan 20, 2025 | Data AugmentationQuestion Answering | —Unverified | 0 |
| Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis | Jan 16, 2025 | DecoderImage Captioning | CodeCode Available | 0 |
| Metric Learning with Progressive Self-Distillation for Audio-Visual Embedding Learning | Jan 16, 2025 | Metric LearningRepresentation Learning | —Unverified | 0 |
| FARE: A Deep Learning-Based Framework for Radar-based Face Recognition and Out-of-distribution Detection | Jan 14, 2025 | ClassificationFace Recognition | —Unverified | 0 |
| ADKGD: Anomaly Detection in Knowledge Graphs with Dual-Channel Training | Jan 13, 2025 | Anomaly DetectionKnowledge Graphs | CodeCode Available | 0 |
| SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval | Jan 12, 2025 | Image RetrievalRetrieval | —Unverified | 0 |
| UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation | Jan 10, 2025 | DecoderGraph Generation | —Unverified | 0 |