| Augment or Not? A Comparative Study of Pure and Augmented Large Language Model Recommenders | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TailorSQL: An NL2SQL System Tailored to Your Query Workload | May 29, 2025 | Large Language ModelTranslation | —Unverified | 0 |
| Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems | May 29, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Cross-Task Experiential Learning on LLM-based Multi-Agent Collaboration | May 29, 2025 | Large Language Model | —Unverified | 0 |
| 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents | May 28, 2025 | ChatbotLanguage Modeling | —Unverified | 0 |
| Universal Visuo-Tactile Video Understanding for Embodied Interaction | May 28, 2025 | FrictionLarge Language Model | —Unverified | 0 |
| EnsemW2S: Enhancing Weak-to-Strong Generalization with Large Language Model Ensembles | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatCFD: an End-to-End CFD Agent with Domain-specific Structured Thinking | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Speech as a Multimodal Digital Phenotype for Multi-Task LLM-based Mental Health Prediction | May 28, 2025 | Depression DetectionLanguage Modeling | —Unverified | 0 |
| ICH-Qwen: A Large Language Model Towards Chinese Intangible Cultural Heritage | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Design and testing of an agent chatbot supporting decision making with public transport data | May 28, 2025 | ChatbotDecision Making | —Unverified | 0 |
| Cross-modal RAG: Sub-dimensional Retrieval-Augmented Text-to-Image Generation | May 28, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 0 |
| LLM-ODDR: A Large Language Model Framework for Joint Order Dispatching and Driver Repositioning | May 28, 2025 | Combinatorial OptimizationFairness | —Unverified | 0 |
| GateNLP at SemEval-2025 Task 10: Hierarchical Three-Step Prompting for Multilingual Narrative Classification | May 28, 2025 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition | May 28, 2025 | Large Language Model | —Unverified | 0 |
| Operationalizing CaMeL: Strengthening LLM Defenses for Enterprise Deployment | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Conversational Alignment with Artificial Intelligence in Context | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Tool for Generating Exceptional Behavior Tests With Large Language Models | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning | May 28, 2025 | CAD ReconstructionLarge Language Model | CodeCode Available | 2 |
| A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems | May 28, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| BugWhisperer: Fine-Tuning LLMs for SoC Hardware Vulnerability Detection | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems | May 28, 2025 | Large Language ModelQuestion Answering | —Unverified | 0 |
| Modeling and Optimizing User Preferences in AI Copilots: A Comprehensive Survey and Taxonomy | May 28, 2025 | Large Language ModelRecommendation Systems | —Unverified | 0 |
| Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems | May 28, 2025 | Graph LearningLarge Language Model | —Unverified | 0 |
| Incorporating LLMs for Large-Scale Urban Complex Mobility Simulation | May 28, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Zero-Shot Vision Encoder Grafting via LLM Surrogates | May 28, 2025 | DecoderLanguage Modeling | CodeCode Available | 2 |
| The Multilingual Divide and Its Impact on Global AI Safety | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Code Researcher: Deep Research Agent for Large Systems Code and Commit History | May 27, 2025 | Large Language Model | —Unverified | 0 |
| Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ChemHAS: Hierarchical Agent Stacking for Enhancing Chemistry Tools | May 27, 2025 | AI AgentLanguage Modeling | —Unverified | 0 |
| Think Before You Diffuse: LLMs-Guided Physics-Aware Video Generation | May 27, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization Algorithms | May 27, 2025 | Bayesian OptimizationBenchmarking | CodeCode Available | 2 |
| CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models | May 27, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution | May 27, 2025 | 8kAvg | CodeCode Available | 1 |
| OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions | May 27, 2025 | Audio-Visual SynchronizationConversational Response Generation | —Unverified | 0 |
| Creativity in LLM-based Multi-Agent Systems: A Survey | May 27, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 |
| REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| StreamLink: Large-Language-Model Driven Distributed Data Engineering System | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework | May 27, 2025 | DiagnosticKnowledge Graphs | —Unverified | 0 |
| Automated Privacy Information Annotation in Large Language Model Interactions | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LLM Web Dynamics: Tracing Model Collapse in a Network of LLMs | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HAMburger: Accelerating LLM Inference via Token Smashing | May 26, 2025 | Large Language Model | —Unverified | 0 |
| What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning? | May 26, 2025 | In-Context Learningknowledge editing | —Unverified | 0 |
| Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MSD-LLM: Predicting Ship Detention in Port State Control Inspections with Large Language Model | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |