| LODGE: Joint Hierarchical Task Planning and Learning of Domain Models with Grounded Execution | May 15, 2025 | Robot ManipulationTask Planning | —Unverified | 0 |
| LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis | May 14, 2025 | cognitive diagnosisWorld Knowledge | CodeCode Available | 0 |
| Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration | May 13, 2025 | RAGRetrieval | —Unverified | 0 |
| Advancing and Benchmarking Personalized Tool Invocation for LLMs | May 7, 2025 | BenchmarkingWorld Knowledge | CodeCode Available | 0 |
| Evaluating Contrastive Feedback for Effective User Simulations | May 5, 2025 | Information RetrievalPrompt Engineering | CodeCode Available | 0 |
| WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation | May 2, 2025 | Image GenerationText to Image Generation | —Unverified | 0 |
| Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers | Apr 29, 2025 | Data AugmentationKnowledge Graphs | —Unverified | 0 |
| Towards Automated Scoping of AI for Social Good Projects | Apr 28, 2025 | World Knowledge | —Unverified | 0 |
| Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models | Apr 27, 2025 | Visual ReasoningWorld Knowledge | —Unverified | 0 |
| WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion | Apr 18, 2025 | Contrastive LearningDenoising | CodeCode Available | 1 |