| Aligning LLM Agents by Learning Latent Preference from User Edits | Apr 23, 2024 | DescriptiveLanguage Modelling | CodeCode Available | 1 |
| Mixture of Low-rank Experts for Transferable AI-Generated Image Detection | Apr 7, 2024 | Descriptiveparameter-efficient fine-tuning | CodeCode Available | 1 |
| A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM) | Apr 2, 2024 | Descriptive | CodeCode Available | 1 |
| Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery | Mar 12, 2024 | DescriptiveRetrieval | CodeCode Available | 1 |
| FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font Applications | Mar 11, 2024 | AttributeDescriptive | CodeCode Available | 1 |
| TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation | Feb 24, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings | Jan 28, 2024 | Contrastive LearningDescriptive | CodeCode Available | 1 |
| Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training | Jan 4, 2024 | DescriptiveImage Captioning | CodeCode Available | 1 |
| VideoStudio: Generating Consistent-Content and Multi-Scene Videos | Jan 2, 2024 | DescriptiveVideo Generation | CodeCode Available | 1 |
| SPU-PMD: Self-Supervised Point Cloud Upsampling via Progressive Mesh Deformation | Jan 1, 2024 | Descriptivepoint cloud upsampling | CodeCode Available | 1 |