| Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs | Aug 2, 2024 | AttributeHallucination | CodeCode Available | 1 |
| Nested Music Transformer: Sequentially Decoding Compound Tokens in Symbolic Music and Audio Generation | Aug 2, 2024 | AttributeAudio Generation | CodeCode Available | 1 |
| Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval | Aug 1, 2024 | AttributeOptical Character Recognition | CodeCode Available | 1 |
| LADDER: Language Driven Slice Discovery and Error Rectification | Jul 31, 2024 | AttributeClustering | CodeCode Available | 1 |
| Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection | Jul 30, 2024 | Attribute | CodeCode Available | 1 |
| Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing | Jul 26, 2024 | AttributeLanguage Modelling | CodeCode Available | 1 |
| MLLM-CompBench: A Comparative Reasoning Benchmark for Multimodal LLMs | Jul 23, 2024 | Attribute | CodeCode Available | 1 |
| TimeInf: Time Series Data Contribution via Influence Functions | Jul 21, 2024 | AttributeTime Series | CodeCode Available | 1 |
| A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Jul 19, 2024 | AttributeData Compression | CodeCode Available | 1 |
| Length-Aware Motion Synthesis via Latent Diffusion | Jul 16, 2024 | AttributeMotion Synthesis | CodeCode Available | 1 |
| Multi-Modal and Multi-Attribute Generation of Single Cells with CFGen | Jul 16, 2024 | AttributeData Augmentation | CodeCode Available | 1 |
| CiteME: Can Language Models Accurately Cite Scientific Claims? | Jul 10, 2024 | Attribute | CodeCode Available | 1 |
| MARS: Paying more attention to visual attributes for text-based person search | Jul 5, 2024 | AttributePerson Re-Identification | CodeCode Available | 1 |
| Learning Action and Reasoning-Centric Image Editing from Videos and Simulations | Jul 3, 2024 | AttributeSpatial Reasoning | CodeCode Available | 1 |
| LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation | Jun 30, 2024 | AttributeImage Generation | CodeCode Available | 1 |
| Towards Learning Abductive Reasoning using VSA Distributed Representations | Jun 27, 2024 | AttributeTransfer Learning | CodeCode Available | 1 |
| TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings | Jun 21, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Odyssey 2024 - Speech Emotion Recognition Challenge: Dataset, Baseline Framework, and Results | Jun 20, 2024 | AttributeEmotion Recognition | CodeCode Available | 1 |
| AITTI: Learning Adaptive Inclusive Token for Text-to-Image Generation | Jun 18, 2024 | AttributeFairness | CodeCode Available | 1 |
| RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding | Jun 18, 2024 | AttributeInstruction Following | CodeCode Available | 1 |
| Composing Object Relations and Attributes for Image-Text Matching | Jun 17, 2024 | AttributeGraph Attention | CodeCode Available | 1 |
| When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives | Jun 17, 2024 | Attribute | CodeCode Available | 1 |
| Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training | Jun 10, 2024 | AttributeDiversity | CodeCode Available | 1 |
| CMamba: Channel Correlation Enhanced State Space Models for Multivariate Time Series Forecasting | Jun 8, 2024 | AttributeMamba | CodeCode Available | 1 |
| Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning | Jun 6, 2024 | AttributeLanguage Modelling | CodeCode Available | 1 |