| Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration | Sep 28, 2024 | AllAttribute | CodeCode Available | 2 |
| Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks | Aug 7, 2024 | AttributeIn-Context Learning | CodeCode Available | 2 |
| T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation | Jul 19, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement | Jul 9, 2024 | AttributeDisentanglement | CodeCode Available | 2 |
| UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models | Jun 27, 2024 | AttributeBenchmarking | CodeCode Available | 2 |
| RouteFinder: Towards Foundation Models for Vehicle Routing Problems | Jun 21, 2024 | AttributeMulti-Task Learning | CodeCode Available | 2 |
| Task Me Anything | Jun 17, 2024 | 2kAttribute | CodeCode Available | 2 |
| A Synthetic Dataset for Personal Attribute Inference | Jun 11, 2024 | AttributeAuthor Profiling | CodeCode Available | 2 |
| Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring | Jun 11, 2024 | AttributeDomain Generalization | CodeCode Available | 2 |
| MVGamba: Unify 3D Content Generation as State Space Sequence Modeling | Jun 10, 2024 | 3D GenerationAttribute | CodeCode Available | 2 |
| Binarized Diffusion Model for Image Super-Resolution | Jun 9, 2024 | AttributeBinarization | CodeCode Available | 2 |
| Non-destructive Degradation Pattern Decoupling for Ultra-early Battery Prototype Verification Using Physics-informed Machine Learning | Jun 1, 2024 | AttributePhysics-informed machine learning | CodeCode Available | 2 |
| DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution | May 25, 2024 | Attribute | CodeCode Available | 2 |
| LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation | Apr 30, 2024 | AttributeSemantic Segmentation | CodeCode Available | 2 |
| CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic Understanding | Apr 22, 2024 | Attribute | CodeCode Available | 2 |
| CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching | Apr 4, 2024 | AttributeImage Captioning | CodeCode Available | 2 |
| LLM Attributor: Interactive Visual Attribution for LLM Generation | Apr 1, 2024 | ArticlesAttribute | CodeCode Available | 2 |
| Measuring Style Similarity in Diffusion Models | Apr 1, 2024 | AttributeStyle Detection | CodeCode Available | 2 |
| SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects | Mar 29, 2024 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 |
| Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding | Mar 27, 2024 | AttributeDecision Making | CodeCode Available | 2 |
| Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions | Mar 25, 2024 | Attribute | CodeCode Available | 2 |
| Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt | Mar 18, 2024 | AttributeDecoder | CodeCode Available | 2 |
| Faceptor: A Generalist Model for Face Perception | Mar 14, 2024 | Age EstimationAttribute | CodeCode Available | 2 |
| Task Attribute Distance for Few-Shot Learning: Theoretical Analysis and Applications | Mar 6, 2024 | AttributeData Augmentation | CodeCode Available | 2 |
| RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations | Feb 27, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |