| EventVAD: Training-Free Event-Aware Video Anomaly Detection | Apr 17, 2025 | Anomaly DetectionBoundary Detection | —Unverified | 0 |
| Can GPT tell us why these images are synthesized? Empowering Multimodal Large Language Models for Forensics | Apr 16, 2025 | Few-Shot LearningImage Manipulation | —Unverified | 0 |
| Rethinking LLM-Based Recommendations: A Query Generation-Based, Training-Free Approach | Apr 16, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Coding-Prior Guided Diffusion Network for Video Deblurring | Apr 16, 2025 | DeblurringVideo Deblurring | —Unverified | 0 |
| ARise: Towards Knowledge-Augmented Reasoning via Risk-Adaptive Search | Apr 15, 2025 | RAGRetrieval-augmented Generation | —Unverified | 0 |
| Enhancing LLM-based Recommendation through Semantic-Aligned Collaborative Knowledge | Apr 14, 2025 | Collaborative FilteringTransfer Learning | —Unverified | 0 |
| Large Language Model Empowered Recommendation Meets All-domain Continual Pre-Training | Apr 11, 2025 | AllLanguage Modeling | —Unverified | 0 |
| ConceptFormer: Towards Efficient Use of Knowledge-Graph Embeddings in Large Language Models | Apr 10, 2025 | Knowledge Graph EmbeddingsKnowledge Graphs | —Unverified | 0 |
| DiffusionCom: Structure-Aware Multimodal Diffusion Model for Multimodal Knowledge Graph Completion | Apr 9, 2025 | Graph AttentionKnowledge Graph Completion | —Unverified | 0 |
| Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability | Apr 9, 2025 | Image Generationmultimodal generation | —Unverified | 0 |
| Memory-Modular Classification: Learning to Generalize with Memory Replacement | Apr 8, 2025 | Classificationimage-classification | CodeCode Available | 0 |
| Large Language Models Enhanced Hyperbolic Space Recommender Systems | Apr 8, 2025 | Contrastive LearningRecommendation Systems | —Unverified | 0 |
| User Feedback Alignment for LLM-powered Exploration in Large-scale Recommendation Systems | Apr 7, 2025 | DiversityRecommendation Systems | —Unverified | 0 |
| RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model | Apr 7, 2025 | Image Captioningimage-classification | —Unverified | 0 |
| Adaptive Elicitation of Latent Information Using Natural Language | Apr 5, 2025 | Uncertainty QuantificationWorld Knowledge | —Unverified | 0 |
| F-ViTA: Foundation Model Guided Visible to Thermal Translation | Apr 3, 2025 | Scene UnderstandingStyle Transfer | CodeCode Available | 1 |
| Knowledge Graph Completion with Mixed Geometry Tensor Factorization | Apr 3, 2025 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 0 |
| GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation | Apr 3, 2025 | Image GenerationWorld Knowledge | CodeCode Available | 3 |
| OnRL-RAG: Real-Time Personalized Mental Health Dialogue System | Apr 2, 2025 | RAGRetrieval | —Unverified | 0 |
| A Diffusion-Based Framework for Occluded Object Movement | Apr 2, 2025 | ObjectWorld Knowledge | —Unverified | 0 |
| Generative Retrieval and Alignment Model: A New Paradigm for E-commerce Retrieval | Apr 2, 2025 | General KnowledgeRetrieval | —Unverified | 0 |
| Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors | Mar 26, 2025 | Depth EstimationWorld Knowledge | CodeCode Available | 1 |
| LLM-based Agent Simulation for Maternal Health Interventions: Uncertainty Estimation and Decision-focused Evaluation | Mar 25, 2025 | counterfactualDecision Making | CodeCode Available | 0 |
| Test-Time Reasoning Through Visual Human Preferences with VLMs and Soft Rewards | Mar 25, 2025 | World Knowledge | —Unverified | 0 |
| Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics | Mar 24, 2025 | Human-Object Interaction DetectionLanguage Modeling | —Unverified | 0 |