| Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning | Oct 21, 2024 | Attribute | CodeCode Available | 1 |
| Zero-shot Generalist Graph Anomaly Detection with Unified Neighborhood Prompts | Oct 18, 2024 | Anomaly DetectionAttribute | CodeCode Available | 1 |
| Tree of Attributes Prompt Learning for Vision-Language Models | Oct 15, 2024 | AttributeKnowledge Graphs | CodeCode Available | 1 |
| When Graph meets Multimodal: Benchmarking on Multimodal Attributed Graphs Learning | Oct 11, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| Entering Real Social World! Benchmarking the Social Intelligence of Large Language Models from a First-person Perspective | Oct 8, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency | Oct 7, 2024 | Attribute | CodeCode Available | 1 |
| MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain | Oct 7, 2024 | AttributeMetric Learning | CodeCode Available | 1 |
| Image Watermarks are Removable Using Controllable Regeneration from Clean Noise | Oct 7, 2024 | AttributeDenoising | CodeCode Available | 1 |
| Towards Fairness and Privacy: A Novel Data Pre-processing Optimization Framework for Non-binary Protected Attributes | Oct 1, 2024 | AttributeCombinatorial Optimization | CodeCode Available | 1 |
| CliMB: An AI-enabled Partner for Clinical Predictive Modeling | Sep 30, 2024 | AttributeAutoML | CodeCode Available | 1 |
| Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function | Sep 30, 2024 | AttributeDisentanglement | CodeCode Available | 1 |
| Domain Consistency Representation Learning for Lifelong Person Re-Identification | Sep 30, 2024 | AttributeKnowledge Distillation | CodeCode Available | 1 |
| ComiCap: A VLMs pipeline for dense captioning of Comic Panels | Sep 24, 2024 | AttributeDense Captioning | CodeCode Available | 1 |
| Finetuning CLIP to Reason about Pairwise Differences | Sep 15, 2024 | AttributeContrastive Learning | CodeCode Available | 1 |
| AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model | Sep 6, 2024 | AttributeAutoML | CodeCode Available | 1 |
| MARS: Matching Attribute-aware Representations for Text-based Sequential Recommendation | Sep 1, 2024 | AttributeSequential Recommendation | CodeCode Available | 1 |
| UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios | Aug 30, 2024 | Attributegeo-localization | CodeCode Available | 1 |
| Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and VisualAnalysis Strategy | Aug 22, 2024 | AttributeCamouflaged Object Segmentation | CodeCode Available | 1 |
| Toward Enhancing Vehicle Color Recognition in Adverse Conditions: A Dataset and Benchmark | Aug 21, 2024 | AttributeFine-Grained Vehicle Classification | CodeCode Available | 1 |
| Navigating Spatio-Temporal Heterogeneity: A Graph Transformer Approach for Traffic Forecasting | Aug 20, 2024 | AttributeMixture-of-Experts | CodeCode Available | 1 |
| Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models | Aug 18, 2024 | AttributeHallucination | CodeCode Available | 1 |
| Layerwise Recurrent Router for Mixture-of-Experts | Aug 13, 2024 | AttributeMixture-of-Experts | CodeCode Available | 1 |
| What Ails Generative Structure-based Drug Design: Expressivity is Too Little or Too Much? | Aug 12, 2024 | AttributeDrug Design | CodeCode Available | 1 |
| Diffusion Guided Language Modeling | Aug 8, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image Synthesis | Aug 7, 2024 | AttributeImage Generation | CodeCode Available | 1 |
| Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs | Aug 2, 2024 | AttributeHallucination | CodeCode Available | 1 |
| Nested Music Transformer: Sequentially Decoding Compound Tokens in Symbolic Music and Audio Generation | Aug 2, 2024 | AttributeAudio Generation | CodeCode Available | 1 |
| Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval | Aug 1, 2024 | AttributeOptical Character Recognition | CodeCode Available | 1 |
| LADDER: Language Driven Slice Discovery and Error Rectification | Jul 31, 2024 | AttributeClustering | CodeCode Available | 1 |
| Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection | Jul 30, 2024 | Attribute | CodeCode Available | 1 |
| Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing | Jul 26, 2024 | AttributeLanguage Modelling | CodeCode Available | 1 |
| MLLM-CompBench: A Comparative Reasoning Benchmark for Multimodal LLMs | Jul 23, 2024 | Attribute | CodeCode Available | 1 |
| TimeInf: Time Series Data Contribution via Influence Functions | Jul 21, 2024 | AttributeTime Series | CodeCode Available | 1 |
| A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Jul 19, 2024 | AttributeData Compression | CodeCode Available | 1 |
| Length-Aware Motion Synthesis via Latent Diffusion | Jul 16, 2024 | AttributeMotion Synthesis | CodeCode Available | 1 |
| Multi-Modal and Multi-Attribute Generation of Single Cells with CFGen | Jul 16, 2024 | AttributeData Augmentation | CodeCode Available | 1 |
| CiteME: Can Language Models Accurately Cite Scientific Claims? | Jul 10, 2024 | Attribute | CodeCode Available | 1 |
| MARS: Paying more attention to visual attributes for text-based person search | Jul 5, 2024 | AttributePerson Re-Identification | CodeCode Available | 1 |
| Learning Action and Reasoning-Centric Image Editing from Videos and Simulations | Jul 3, 2024 | AttributeSpatial Reasoning | CodeCode Available | 1 |
| LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation | Jun 30, 2024 | AttributeImage Generation | CodeCode Available | 1 |
| Towards Learning Abductive Reasoning using VSA Distributed Representations | Jun 27, 2024 | AttributeTransfer Learning | CodeCode Available | 1 |
| TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings | Jun 21, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Odyssey 2024 - Speech Emotion Recognition Challenge: Dataset, Baseline Framework, and Results | Jun 20, 2024 | AttributeEmotion Recognition | CodeCode Available | 1 |
| AITTI: Learning Adaptive Inclusive Token for Text-to-Image Generation | Jun 18, 2024 | AttributeFairness | CodeCode Available | 1 |
| RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding | Jun 18, 2024 | AttributeInstruction Following | CodeCode Available | 1 |
| Composing Object Relations and Attributes for Image-Text Matching | Jun 17, 2024 | AttributeGraph Attention | CodeCode Available | 1 |
| When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives | Jun 17, 2024 | Attribute | CodeCode Available | 1 |
| Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training | Jun 10, 2024 | AttributeDiversity | CodeCode Available | 1 |
| CMamba: Channel Correlation Enhanced State Space Models for Multivariate Time Series Forecasting | Jun 8, 2024 | AttributeMamba | CodeCode Available | 1 |
| Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning | Jun 6, 2024 | AttributeLanguage Modelling | CodeCode Available | 1 |