| FreeSplat++: Generalizable 3D Gaussian Splatting for Efficient Indoor Scene Reconstruction | Mar 29, 2025 | 3DGSIndoor Scene Reconstruction | CodeCode Available | 2 |
| Tune It Up: Music Genre Transfer and Prediction | Mar 27, 2025 | Music Genre TransferMusic Style Transfer | CodeCode Available | 0 |
| Learnable Sequence Augmenter for Triplet Contrastive Learning in Sequential Recommendation | Mar 26, 2025 | Contrastive LearningSelf-Supervised Learning | —Unverified | 0 |
| A-MESS: Anchor based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition | Mar 25, 2025 | Contrastive LearningIntent Recognition | —Unverified | 0 |
| fine-CLIP: Enhancing Zero-Shot Fine-Grained Surgical Action Recognition with Vision-Language Models | Mar 25, 2025 | Action RecognitionSurgical phase recognition | —Unverified | 0 |
| CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation | Mar 25, 2025 | Triplet | —Unverified | 0 |
| CoLLM: A Large Language Model for Composed Image Retrieval | Mar 25, 2025 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual Learning | Mar 23, 2025 | Continual LearningExemplar-Free | CodeCode Available | 1 |
| EMPLACE: Self-Supervised Urban Scene Change Detection | Mar 22, 2025 | Change DetectionScene Change Detection | CodeCode Available | 0 |
| What can Off-the-Shelves Large Multi-Modal Models do for Dynamic Scene Graph Generation? | Mar 20, 2025 | DecoderGraph Generation | —Unverified | 0 |