| SAND: Boosting LLM Agents with Self-Taught Action Deliberation | Jul 10, 2025 | Large Language ModelSand | —Unverified | 0 |
| A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding | Jul 9, 2025 | 3D visual groundingAutonomous Navigation | —Unverified | 0 |
| The Dark Side of LLMs Agent-based Attacks for Complete Computer Takeover | Jul 9, 2025 | Large Language ModelRAG | —Unverified | 0 |
| Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model | Jul 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection | Jul 9, 2025 | Human-Object Interaction DetectionLarge Language Model | CodeCode Available | 0 |
| Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery | Jul 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning | Jul 9, 2025 | Caption GenerationClustering | —Unverified | 0 |
| Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models | Jul 8, 2025 | Future predictionLarge Language Model | —Unverified | 0 |
| BlueLM-2.5-3B Technical Report | Jul 8, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval | Jul 8, 2025 | Image RetrievalLarge Language Model | —Unverified | 0 |
| LeAD: The LLM Enhanced Planning System Converged with End-to-end Autonomous Driving | Jul 8, 2025 | Autonomous DrivingImitation Learning | —Unverified | 0 |
| CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal Representations | Jul 8, 2025 | Generative Adversarial NetworkLarge Language Model | CodeCode Available | 0 |
| TalkFashion: Intelligent Virtual Try-On Assistant Based on Multimodal Large Language Model | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RecRankerEval: A Flexible and Extensible Framework for Top-k LLM-based Recommendation | Jul 8, 2025 | Large Language Model | —Unverified | 0 |
| PrefixAgent: An LLM-Powered Design Framework for Efficient Prefix Adder Optimization | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions | Jul 7, 2025 | Large Language ModelRAG | —Unverified | 0 |
| PRIME: Large Language Model Personalization with Cognitive Memory and Thought Processes | Jul 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Inaugural MOASEI Competition at AAMAS'2025: A Technical Report | Jul 7, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models | Jul 7, 2025 | ArticlesLarge Language Model | —Unverified | 0 |
| DeepRetro: Retrosynthetic Pathway Discovery using Iterative LLM Reasoning | Jul 7, 2025 | HallucinationLarge Language Model | —Unverified | 0 |
| CoT-lized Diffusion: Let's Reinforce T2I Generation Step-by-step | Jul 6, 2025 | DenoisingLarge Language Model | —Unverified | 0 |
| BiFair: A Fairness-aware Training Framework for LLM-enhanced Recommender Systems via Bi-level Optimization | Jul 6, 2025 | FairnessLarge Language Model | —Unverified | 0 |
| GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation | Jul 4, 2025 | Document Level Machine TranslationDocument Translation | —Unverified | 0 |
| Behaviour Space Analysis of LLM-driven Meta-heuristic Discovery | Jul 4, 2025 | Large Language Model | —Unverified | 0 |
| Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization | Jul 3, 2025 | DescriptiveDisentanglement | —Unverified | 0 |