| SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation | Apr 3, 2024 | AttributeSemantic Segmentation | CodeCode Available | 1 |
| Development of Compositionality and Generalization through Interactive Learning of Language and Action of Robots | Mar 29, 2024 | Attribute | CodeCode Available | 1 |
| U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation | Mar 29, 2024 | AttributeDisentanglement | CodeCode Available | 1 |
| Attribute First, then Generate: Locally-attributable Grounded Text Generation | Mar 25, 2024 | AttributeDocument Summarization | CodeCode Available | 1 |
| Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval | Mar 24, 2024 | AttributeImage Retrieval | CodeCode Available | 1 |
| Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute Editing | Mar 20, 2024 | Attribute | CodeCode Available | 1 |
| Reinforcement Learning with Token-level Feedback for Controllable Text Generation | Mar 18, 2024 | Attributereinforcement-learning | CodeCode Available | 1 |
| PhD: A ChatGPT-Prompted Visual hallucination Evaluation Dataset | Mar 17, 2024 | AttributeCommon Sense Reasoning | CodeCode Available | 1 |
| ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models | Mar 17, 2024 | Attributenamed-entity-recognition | CodeCode Available | 1 |
| FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font Applications | Mar 11, 2024 | AttributeDescriptive | CodeCode Available | 1 |