| Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning | Dec 17, 2024 | Denoising | CodeCode Available | 2 |
| WiFo: Wireless Foundation Model for Channel Prediction | Dec 12, 2024 | modelMulti-Task Learning | —Unverified | 0 |
| Towards Open-Vocabulary Video Semantic Segmentation | Dec 12, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM | Dec 12, 2024 | Image ComprehensionImage Generation | —Unverified | 0 |
| Disentanglement and Compositionality of Letter Identity and Letter Position in Variational Auto-Encoder Vision Models | Dec 11, 2024 | DisentanglementPosition | —Unverified | 0 |
| Large Concept Models: Language Modeling in a Sentence Representation Space | Dec 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation | Dec 11, 2024 | MambaSegmentation | CodeCode Available | 1 |
| Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion | Dec 11, 2024 | GPUImage Segmentation | CodeCode Available | 0 |
| ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement Learning | Dec 10, 2024 | Evolutionary AlgorithmsLifelong learning | CodeCode Available | 0 |
| Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image Segmentation | Dec 9, 2024 | Domain AdaptationImage Segmentation | CodeCode Available | 1 |