| Diffusion Guided Language Modeling | Aug 8, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Human Speech Perception in Noise: Can Large Language Models Paraphrase to Improve It? | Aug 7, 2024 | AttributeText Generation | CodeCode Available | 0 |
| ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling | Aug 7, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image Synthesis | Aug 7, 2024 | AttributeImage Generation | CodeCode Available | 1 |
| Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks | Aug 7, 2024 | AttributeIn-Context Learning | CodeCode Available | 2 |
| Training LLMs to Recognize Hedges in Spontaneous Narratives | Aug 6, 2024 | Attribute | CodeCode Available | 0 |
| SelfBC: Self Behavior Cloning for Offline Reinforcement Learning | Aug 4, 2024 | AttributeD4RL | —Unverified | 0 |
| KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving | Aug 4, 2024 | 3D Object DetectionAttribute | CodeCode Available | 0 |
| ML-EAT: A Multilevel Embedding Association Test for Interpretable and Transparent Social Science | Aug 4, 2024 | AttributeDiachronic Word Embeddings | CodeCode Available | 0 |
| MMPKUBase: A Comprehensive and High-quality Chinese Multi-modal Knowledge Graph | Aug 3, 2024 | AttributeContrastive Learning | —Unverified | 0 |
| SAT3D: Image-driven Semantic Attribute Transfer in 3D | Aug 3, 2024 | AttributeReading Comprehension | —Unverified | 0 |
| Regularized Contrastive Partial Multi-view Outlier Detection | Aug 2, 2024 | AttributeContrastive Learning | —Unverified | 0 |
| DERA: Dense Entity Retrieval for Entity Alignment in Knowledge Graphs | Aug 2, 2024 | AttributeEntity Alignment | —Unverified | 0 |
| Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs | Aug 2, 2024 | AttributeHallucination | CodeCode Available | 1 |
| Nested Music Transformer: Sequentially Decoding Compound Tokens in Symbolic Music and Audio Generation | Aug 2, 2024 | AttributeAudio Generation | CodeCode Available | 1 |
| Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval | Aug 1, 2024 | AttributeOptical Character Recognition | CodeCode Available | 1 |
| DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation | Aug 1, 2024 | Attributeimage-classification | CodeCode Available | 0 |
| PrivateGaze: Preserving User Privacy in Black-box Mobile Gaze Tracking Services | Aug 1, 2024 | AttributeGaze Estimation | CodeCode Available | 0 |
| Learned Compression of Point Cloud Geometry and Attributes in a Single Model through Multimodal Rate-Control | Aug 1, 2024 | AttributeDecoder | —Unverified | 0 |
| "Patriarchy Hurts Men Too." Does Your Model Agree? A Discussion on Fairness Assumptions | Aug 1, 2024 | AttributeBinary Classification | —Unverified | 0 |
| LADDER: Language Driven Slice Discovery and Error Rectification | Jul 31, 2024 | AttributeClustering | CodeCode Available | 1 |
| Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection | Jul 30, 2024 | Attribute | CodeCode Available | 1 |
| HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation | Jul 30, 2024 | 3D Hand Pose EstimationAttribute | CodeCode Available | 0 |
| VersusDebias: Universal Zero-Shot Debiasing for Text-to-Image Models via SLM-Based Prompt Engineering and Generative Adversary | Jul 28, 2024 | AttributeFairness | CodeCode Available | 0 |
| Multi-Modal CLIP-Informed Protein Editing | Jul 27, 2024 | AttributeContrastive Learning | —Unverified | 0 |
| Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing | Jul 26, 2024 | AttributeLanguage Modelling | CodeCode Available | 1 |
| Diffusion-driven lensless fiber endomicroscopic quantitative phase imaging towards digital pathology | Jul 26, 2024 | AttributeCell Segmentation | —Unverified | 0 |
| Unveiling Privacy Vulnerabilities: Investigating the Role of Structure in Graph Data | Jul 26, 2024 | AttributeGraph Sampling | —Unverified | 0 |
| A Reference-Based 3D Semantic-Aware Framework for Accurate Local Facial Attribute Editing | Jul 25, 2024 | Attribute | —Unverified | 0 |
| Learning mental states estimation through self-observation: a developmental synergy between intentions and beliefs representations in a deep-learning model of Theory of Mind | Jul 25, 2024 | Attribute | —Unverified | 0 |
| Lifelong Graph Learning for Graph Summarization | Jul 25, 2024 | AttributeGraph Learning | CodeCode Available | 0 |
| PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control | Jul 24, 2024 | Attributecontinuous-control | —Unverified | 0 |
| Hidden or Inferred: Fair Learning-To-Rank with Unknown Demographics | Jul 24, 2024 | AttributeFairness | CodeCode Available | 0 |
| Quantifying the Role of Textual Predictability in Automatic Speech Recognition | Jul 23, 2024 | AttributeAutomatic Speech Recognition | —Unverified | 0 |
| MLLM-CompBench: A Comparative Reasoning Benchmark for Multimodal LLMs | Jul 23, 2024 | Attribute | CodeCode Available | 1 |
| Unveiling and Mitigating Bias in Audio Visual Segmentation | Jul 23, 2024 | AttributeVisual Grounding | —Unverified | 0 |
| VisMin: Visual Minimal-Change Understanding | Jul 23, 2024 | Attribute | —Unverified | 0 |
| AI-Enhanced 7-Point Checklist for Melanoma Detection Using Clinical Knowledge Graphs and Data-Driven Quantification | Jul 23, 2024 | AttributeClinical Knowledge | CodeCode Available | 0 |
| Text2Place: Affordance-aware Text Guided Human Placement | Jul 22, 2024 | AttributeHallucination | —Unverified | 0 |
| Regression under demographic parity constraints via unlabeled post-processing | Jul 22, 2024 | AttributeMulti-class Classification | —Unverified | 0 |
| TimeInf: Time Series Data Contribution via Influence Functions | Jul 21, 2024 | AttributeTime Series | CodeCode Available | 1 |
| AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement | Jul 20, 2024 | AttributeImage Enhancement | —Unverified | 0 |
| Out of spuriousity: Improving robustness to spurious correlations without group annotations | Jul 20, 2024 | Attribute | —Unverified | 0 |
| An Explainable Fast Deep Neural Network for Emotion Recognition | Jul 20, 2024 | AttributeEmotion Classification | —Unverified | 0 |
| Img2CAD: Reverse Engineering 3D CAD Models from Images through VLM-Assisted Conditional Factorization | Jul 19, 2024 | Attribute | —Unverified | 0 |
| Are handcrafted filters helpful for attributing AI-generated images? | Jul 19, 2024 | AttributeImage Attribution | —Unverified | 0 |
| PD-APE: A Parallel Decoding Framework with Adaptive Position Encoding for 3D Visual Grounding | Jul 19, 2024 | 3D visual groundingAttribute | —Unverified | 0 |
| T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation | Jul 19, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Jul 19, 2024 | AttributeData Compression | CodeCode Available | 1 |
| Learning Visual Grounding from Generative Vision and Language Model | Jul 18, 2024 | AttributeLanguage Modeling | —Unverified | 0 |