| Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation | Feb 12, 2025 | cross-modal alignmentmultimodal generation | CodeCode Available | 3 |
| Safety at Scale: A Comprehensive Survey of Large Model Safety | Feb 2, 2025 | Autonomous DrivingData Poisoning | CodeCode Available | 3 |
| A Survey on LLM Test-Time Compute via Search: Tasks, LLM Profiling, Search Algorithms, and Relevant Frameworks | Jan 17, 2025 | Survey | CodeCode Available | 3 |
| Lifelong Learning of Large Language Model based Agents: A Roadmap | Jan 13, 2025 | Incremental LearningLanguage Modeling | CodeCode Available | 3 |
| Towards Visual Grounding: A Survey | Dec 28, 2024 | Phrase GroundingReferring Expression | CodeCode Available | 3 |
| A Survey on Inference Optimization Techniques for Mixture of Experts Models | Dec 18, 2024 | Computational EfficiencyDistributed Computing | CodeCode Available | 3 |
| Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey | Dec 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey | Dec 9, 2024 | Speech SynthesisSurvey | CodeCode Available | 3 |
| Reinforcement Learning Enhanced LLMs: A Survey | Dec 5, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 3 |
| Remote Sensing Temporal Vision-Language Models: A Comprehensive Survey | Dec 3, 2024 | Change DetectionDescriptive | CodeCode Available | 3 |
| Large Language Model-Brained GUI Agents: A Survey | Nov 27, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 3 |
| MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs | Nov 22, 2024 | image-classificationImage Classification | CodeCode Available | 3 |
| Model Inversion Attacks: A Survey of Approaches and Countermeasures | Nov 15, 2024 | Survey | CodeCode Available | 3 |
| WavChat: A Survey of Spoken Dialogue Models | Nov 15, 2024 | speech-recognitionSpeech Recognition | CodeCode Available | 3 |
| A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends | Oct 19, 2024 | AllImage Restoration | CodeCode Available | 3 |
| Data Augmentation for Sequential Recommendation: A Survey | Sep 20, 2024 | Data AugmentationRecommendation Systems | CodeCode Available | 3 |
| Deep Graph Anomaly Detection: A Survey and New Perspectives | Sep 16, 2024 | Anomaly DetectionGraph Anomaly Detection | CodeCode Available | 3 |
| Attention Heads of Large Language Models: A Survey | Sep 5, 2024 | Survey | CodeCode Available | 3 |
| A Survey of Camouflaged Object Detection and Beyond | Aug 26, 2024 | Instance SegmentationObject | CodeCode Available | 3 |
| Foundation Models for Music: A Survey | Aug 26, 2024 | In-Context LearningRepresentation Learning | CodeCode Available | 3 |
| Recent Event Camera Innovations: A Survey | Aug 24, 2024 | ArticlesEvent-based vision | CodeCode Available | 3 |
| Controllable Text Generation for Large Language Models: A Survey | Aug 22, 2024 | AttributePrompt Engineering | CodeCode Available | 3 |
| A Survey of Embodied Learning for Object-Centric Robotic Manipulation | Aug 21, 2024 | Imitation LearningObject | CodeCode Available | 3 |
| Graph Retrieval-Augmented Generation: A Survey | Aug 15, 2024 | HallucinationRAG | CodeCode Available | 3 |
| 3D Gaussian Splatting: Survey, Technologies, Challenges, and Opportunities | Jul 24, 2024 | 3DGSSurvey | CodeCode Available | 3 |