| EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Language Models | Nov 27, 2023 | AttributeQuestion Answering | CodeCode Available | 1 |
| Self-correcting LLM-controlled Diffusion Models | Nov 27, 2023 | AttributeImage Generation | CodeCode Available | 1 |
| Benchmarking Robustness of Text-Image Composed Retrieval | Nov 24, 2023 | AttributeBenchmarking | CodeCode Available | 1 |
| HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data | Nov 22, 2023 | Attributecounterfactual | CodeCode Available | 1 |
| LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions | Nov 20, 2023 | AttributeClassification | CodeCode Available | 1 |
| Exploring Variational Auto-Encoder Architectures, Configurations, and Datasets for Generative Music Explainable AI | Nov 14, 2023 | AttributeMusic Generation | CodeCode Available | 1 |
| AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation | Nov 13, 2023 | AttributeHallucination | CodeCode Available | 1 |
| SEMQA: Semi-Extractive Multi-Source Question Answering | Nov 8, 2023 | AttributeLong Form Question Answering | CodeCode Available | 1 |
| A Simple and Efficient Baseline for Data Attribution on Images | Nov 3, 2023 | AttributeSelf-Supervised Learning | CodeCode Available | 1 |
| Towards Machine Unlearning Benchmarks: Forgetting the Personal Identities in Facial Recognition Systems | Nov 3, 2023 | Age EstimationAttribute | CodeCode Available | 1 |
| HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception | Oct 31, 2023 | 2D Pose EstimationAttribute | CodeCode Available | 1 |
| GaitFormer: Learning Gait Representations with Noisy Multi-Task Learning | Oct 30, 2023 | AttributeMulti-Task Learning | CodeCode Available | 1 |
| Chain-of-Choice Hierarchical Policy Learning for Conversational Recommendation | Oct 27, 2023 | AttributeConversational Recommendation | CodeCode Available | 1 |
| Causality-Inspired Fair Representation Learning for Multimodal Recommendation | Oct 26, 2023 | AttributeCausal Inference | CodeCode Available | 1 |
| Salient Object Detection in RGB-D Videos | Oct 24, 2023 | AttributeObject | CodeCode Available | 1 |
| MIRACLE: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute Control | Oct 22, 2023 | AttributeChatbot | CodeCode Available | 1 |
| GraphMaker: Can Diffusion Models Generate Large Attributed Graphs? | Oct 20, 2023 | AttributeGraph Generation | CodeCode Available | 1 |
| Learning with Unmasked Tokens Drives Stronger Vision Learners | Oct 20, 2023 | AttributeDecoder | CodeCode Available | 1 |
| ExtractGPT: Exploring the Potential of Large Language Models for Product Attribute Value Extraction | Oct 19, 2023 | AttributeAttribute Value Extraction | CodeCode Available | 1 |
| Multi‑camera trajectory matching based on hierarchical clustering and constraints | Oct 19, 2023 | AttributeAutonomous Driving | CodeCode Available | 1 |
| ExtSwap: Leveraging Extended Latent Mapper for Generating High Quality Face Swapping | Oct 19, 2023 | AttributeDecoder | CodeCode Available | 1 |
| Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity? | Oct 14, 2023 | AttributeOut-of-Distribution Generalization | CodeCode Available | 1 |
| Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model | Oct 14, 2023 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Multimodal Variational Auto-encoder based Audio-Visual Segmentation | Oct 12, 2023 | AttributeRepresentation Learning | CodeCode Available | 1 |
| Sentence-level Prompts Benefit Composed Image Retrieval | Oct 9, 2023 | AttributeComposed Image Retrieval (CoIR) | CodeCode Available | 1 |