| FreeSplat++: Generalizable 3D Gaussian Splatting for Efficient Indoor Scene Reconstruction | Mar 29, 2025 | 3DGSIndoor Scene Reconstruction | CodeCode Available | 2 |
| Tune It Up: Music Genre Transfer and Prediction | Mar 27, 2025 | Music Genre TransferMusic Style Transfer | CodeCode Available | 0 |
| Learnable Sequence Augmenter for Triplet Contrastive Learning in Sequential Recommendation | Mar 26, 2025 | Contrastive LearningSelf-Supervised Learning | —Unverified | 0 |
| A-MESS: Anchor based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition | Mar 25, 2025 | Contrastive LearningIntent Recognition | —Unverified | 0 |
| CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation | Mar 25, 2025 | Triplet | —Unverified | 0 |
| fine-CLIP: Enhancing Zero-Shot Fine-Grained Surgical Action Recognition with Vision-Language Models | Mar 25, 2025 | Action RecognitionSurgical phase recognition | —Unverified | 0 |
| CoLLM: A Large Language Model for Composed Image Retrieval | Mar 25, 2025 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual Learning | Mar 23, 2025 | Continual LearningExemplar-Free | CodeCode Available | 1 |
| EMPLACE: Self-Supervised Urban Scene Change Detection | Mar 22, 2025 | Change DetectionScene Change Detection | CodeCode Available | 0 |
| What can Off-the-Shelves Large Multi-Modal Models do for Dynamic Scene Graph Generation? | Mar 20, 2025 | DecoderGraph Generation | —Unverified | 0 |
| Edgeworth Expansion for Semi-hard Triplet Loss | Mar 17, 2025 | Triplet | —Unverified | 0 |
| Oscillatory Signatures of Parkinson's Disease: Central and Parietal EEG Alterations Across Multiple Frequency Bands | Mar 16, 2025 | DiagnosticEEG | —Unverified | 0 |
| Multi-Domain Biometric Recognition using Body Embeddings | Mar 13, 2025 | Person IdentificationPerson Re-Identification | —Unverified | 0 |
| DreamRelation: Relation-Centric Video Customization | Mar 10, 2025 | RelationTriplet | —Unverified | 0 |
| A Graph-based Verification Framework for Fact-Checking | Mar 10, 2025 | Fact Checkinggraph construction | —Unverified | 0 |
| REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding | Mar 10, 2025 | Instruction FollowingKeypoint Detection | CodeCode Available | 1 |
| Structural Entropy Guided Unsupervised Graph Out-Of-Distribution Detection | Mar 5, 2025 | Contrastive LearningGraph Classification | CodeCode Available | 0 |
| MuCo-KGC: Multi-Context-Aware Knowledge Graph Completion | Mar 5, 2025 | Knowledge Graph CompletionKnowledge Graphs | —Unverified | 0 |
| Radar Pulse Deinterleaving with Transformer Based Deep Metric Learning | Mar 4, 2025 | Metric LearningTriplet | —Unverified | 0 |
| Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond | Mar 3, 2025 | Infrared And Visible Image FusionScene Understanding | —Unverified | 0 |
| Unbiased Video Scene Graph Generation via Visual and Semantic Dual Debiasing | Mar 1, 2025 | Graph GenerationScene Graph Generation | —Unverified | 0 |
| Polish-ASTE: Aspect-Sentiment Triplet Extraction Datasets for Polish | Feb 27, 2025 | Aspect Sentiment Triplet ExtractionSentiment Analysis | —Unverified | 0 |
| TRIX: A More Expressive Model for Zero-shot Domain Transfer in Knowledge Graphs | Feb 26, 2025 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 0 |
| MuCoS: Efficient Drug-Target Prediction through Multi-Context-Aware Sampling | Feb 25, 2025 | Drug DiscoveryPrediction | —Unverified | 0 |
| Supervised contrastive learning from weakly-labeled audio segments for musical version matching | Feb 24, 2025 | Contrastive LearningTriplet | —Unverified | 0 |
| Scale Up Composed Image Retrieval Learning via Modification Text Generation | Feb 21, 2025 | Image RetrievalRetrieval | —Unverified | 0 |
| Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining | Feb 20, 2025 | Depth EstimationKnowledge Distillation | —Unverified | 0 |
| Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition | Feb 17, 2025 | Re-RankingTriplet | CodeCode Available | 1 |
| M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis | Feb 17, 2025 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| Phantom: Subject-consistent video generation via cross-modal alignment | Feb 16, 2025 | cross-modal alignmentHuman-Domain Subject-to-Video | CodeCode Available | 5 |
| KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAG | Feb 13, 2025 | Knowledge GraphsLarge Language Model | CodeCode Available | 2 |
| End-to-End triplet loss based fine-tuning for network embedding in effective PII detection | Feb 13, 2025 | feature selectionLarge Language Model | —Unverified | 0 |
| GenIAS: Generator for Instantiating Anomalies in time Series | Feb 12, 2025 | Anomaly DetectionDiversity | —Unverified | 0 |
| SNAT-YOLO: Efficient Cross-Layer Aggregation Network for Edge-Oriented Gangue Detection | Feb 9, 2025 | Triplet | —Unverified | 0 |
| The Complexity of Learning Sparse Superposed Features with Feedback | Feb 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PSM-SQL: Progressive Schema Learning with Multi-granularity Semantics for Text-to-SQL | Feb 7, 2025 | Text to SQLText-To-SQL | —Unverified | 0 |
| Boundary-Driven Table-Filling with Cross-Granularity Contrastive Learning for Aspect Sentiment Triplet Extraction | Feb 4, 2025 | Aspect Sentiment Triplet ExtractionContrastive Learning | —Unverified | 0 |
| Patch Triplet Similarity Purification for Guided Real-World Low-Dose CT Image Denoising | Feb 1, 2025 | DenoisingImage Denoising | —Unverified | 0 |
| Joint Power and Spectrum Orchestration for D2D Semantic Communication Underlying Energy-Efficient Cellular Networks | Jan 30, 2025 | ManagementSemantic Communication | —Unverified | 0 |
| Test-Time Code-Switching for Cross-lingual Aspect Sentiment Triplet Extraction | Jan 24, 2025 | Aspect Sentiment Triplet ExtractionBoundary Detection | —Unverified | 0 |
| YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID | Jan 23, 2025 | Multi-Object Trackingobject-detection | CodeCode Available | 2 |
| Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation | Jan 22, 2025 | counterfactualImage Generation | —Unverified | 0 |
| Hybrid Losses for Hierarchical Embedding Learning | Jan 22, 2025 | Multi-Task LearningRetrieval | CodeCode Available | 0 |
| The Dual-use Dilemma in LLMs: Do Empowering Ethical Capacities Make a Degraded Utility? | Jan 20, 2025 | Data AugmentationQuestion Answering | —Unverified | 0 |
| Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis | Jan 16, 2025 | DecoderImage Captioning | CodeCode Available | 0 |
| Metric Learning with Progressive Self-Distillation for Audio-Visual Embedding Learning | Jan 16, 2025 | Metric LearningRepresentation Learning | —Unverified | 0 |
| FARE: A Deep Learning-Based Framework for Radar-based Face Recognition and Out-of-distribution Detection | Jan 14, 2025 | ClassificationFace Recognition | —Unverified | 0 |
| ADKGD: Anomaly Detection in Knowledge Graphs with Dual-Channel Training | Jan 13, 2025 | Anomaly DetectionKnowledge Graphs | CodeCode Available | 0 |
| SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval | Jan 12, 2025 | Image RetrievalRetrieval | —Unverified | 0 |
| UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation | Jan 10, 2025 | DecoderGraph Generation | —Unverified | 0 |