| MusiScene: Leveraging MU-LLaMA for Scene Imagination and Enhanced Video Background Music Generation | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PrefixAgent: An LLM-Powered Design Framework for Efficient Prefix Adder Optimization | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PRIME: Large Language Model Personalization with Cognitive Memory and Thought Processes | Jul 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model | Jul 7, 2025 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| Inaugural MOASEI Competition at AAMAS'2025: A Technical Report | Jul 7, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| Transforming Calabi-Yau Constructions: Generating New Calabi-Yau Manifolds with Transformers | Jul 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment | Jul 3, 2025 | cross-modal alignmentInstruction Following | CodeCode Available | 2 |
| OpenTable-R1: A Reinforcement Learning Augmented Tool Agent for Open-Domain Table Question Answering | Jul 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Flexible Language Modeling in Continuous Space with Transformer-based Autoregressive Flows | Jul 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |