| GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning | Jul 9, 2025 | Caption GenerationClustering | —Unverified | 0 |
| Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model | Jul 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal Representations | Jul 8, 2025 | Generative Adversarial NetworkLarge Language Model | CodeCode Available | 0 |
| Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models | Jul 8, 2025 | Future predictionLarge Language Model | —Unverified | 0 |
| BlueLM-2.5-3B Technical Report | Jul 8, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval | Jul 8, 2025 | Image RetrievalLarge Language Model | —Unverified | 0 |
| TalkFashion: Intelligent Virtual Try-On Assistant Based on Multimodal Large Language Model | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RecRankerEval: A Flexible and Extensible Framework for Top-k LLM-based Recommendation | Jul 8, 2025 | Large Language Model | —Unverified | 0 |
| LeAD: The LLM Enhanced Planning System Converged with End-to-end Autonomous Driving | Jul 8, 2025 | Autonomous DrivingImitation Learning | —Unverified | 0 |
| PrefixAgent: An LLM-Powered Design Framework for Efficient Prefix Adder Optimization | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PRIME: Large Language Model Personalization with Cognitive Memory and Thought Processes | Jul 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DeepRetro: Retrosynthetic Pathway Discovery using Iterative LLM Reasoning | Jul 7, 2025 | HallucinationLarge Language Model | —Unverified | 0 |
| Inaugural MOASEI Competition at AAMAS'2025: A Technical Report | Jul 7, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions | Jul 7, 2025 | Large Language ModelRAG | —Unverified | 0 |
| AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models | Jul 7, 2025 | ArticlesLarge Language Model | —Unverified | 0 |
| BiFair: A Fairness-aware Training Framework for LLM-enhanced Recommender Systems via Bi-level Optimization | Jul 6, 2025 | FairnessLarge Language Model | —Unverified | 0 |
| CoT-lized Diffusion: Let's Reinforce T2I Generation Step-by-step | Jul 6, 2025 | DenoisingLarge Language Model | —Unverified | 0 |
| GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation | Jul 4, 2025 | Document Level Machine TranslationDocument Translation | —Unverified | 0 |
| Behaviour Space Analysis of LLM-driven Meta-heuristic Discovery | Jul 4, 2025 | Large Language Model | —Unverified | 0 |
| Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization | Jul 3, 2025 | DescriptiveDisentanglement | —Unverified | 0 |
| Early Signs of Steganographic Capabilities in Frontier LLMs | Jul 3, 2025 | Large Language Model | CodeCode Available | 0 |
| OpenTable-R1: A Reinforcement Learning Augmented Tool Agent for Open-Domain Table Question Answering | Jul 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Thought-Augmented Planning for LLM-Powered Interactive Recommender Agent | Jun 30, 2025 | Interactive RecommendationLarge Language Model | CodeCode Available | 0 |
| Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning | Jun 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval | Jun 28, 2025 | Cross-Modal RetrievalImage Captioning | —Unverified | 0 |
| ActAlign: Zero-Shot Fine-Grained Video Classification via Language-Guided Sequence Alignment | Jun 28, 2025 | Dynamic Time WarpingLarge Language Model | CodeCode Available | 0 |
| ARAG: Agentic Retrieval Augmented Generation for Personalized Recommendation | Jun 27, 2025 | Large Language ModelNatural Language Inference | —Unverified | 0 |
| A Large Language Model-Empowered Agent for Reliable and Robust Structural Analysis | Jun 27, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| AgentStealth: Reinforcing Large Language Model for Anonymizing User-generated Text | Jun 26, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |
| OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography | Jun 26, 2025 | DeciphermentLarge Language Model | CodeCode Available | 0 |
| Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test | Jun 26, 2025 | Code GenerationLarge Language Model | —Unverified | 0 |
| Can "consciousness" be observed from large language model (LLM) internal states? Dissecting LLM representations obtained from Theory of Mind test with Integrated Information Theory and Span Representation analysis | Jun 26, 2025 | Explainable Artificial Intelligence (XAI)Interpretable Machine Learning | —Unverified | 0 |
| LLM-guided Chemical Process Optimization with a Multi-Agent Approach | Jun 26, 2025 | Chemical ProcessComputational Efficiency | —Unverified | 0 |
| PsyLite Technical Report | Jun 26, 2025 | Large Language ModelLightweight Deployment | CodeCode Available | 0 |
| Detecting Referring Expressions in Visually Grounded Dialogue with Autoregressive Language Models | Jun 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MT2-CSD: A New Dataset and Multi-Semantic Knowledge Fusion Method for Conversational Stance Detection | Jun 26, 2025 | Large Language ModelOpinion Mining | —Unverified | 0 |
| mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale | Jun 26, 2025 | Anomaly DetectionBenchmarking | CodeCode Available | 0 |
| GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding | Jun 26, 2025 | 3D visual groundingLarge Language Model | —Unverified | 0 |
| Large Language Model Agent for Modular Task Execution in Drug Discovery | Jun 26, 2025 | Drug DiscoveryLanguage Modeling | —Unverified | 0 |
| Prompt-Guided Turn-Taking Prediction | Jun 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multimodal Prompt Alignment for Facial Expression Recognition | Jun 26, 2025 | Facial Expression RecognitionFacial Expression Recognition (FER) | —Unverified | 0 |
| MedPrompt: LLM-CNN Fusion with Weight Routing for Medical Image Segmentation and Classification | Jun 26, 2025 | Image SegmentationLarge Language Model | —Unverified | 0 |
| Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios | Jun 25, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Predicting Readiness to Engage in Psychotherapy of People with Chronic Pain Based on their Pain-Related Narratives Saar | Jun 25, 2025 | Large Language ModelSensitivity | —Unverified | 0 |
| Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content | Jun 25, 2025 | ArticlesContinual Pretraining | —Unverified | 0 |
| AALC: Large Language Model Efficient Reasoning via Adaptive Accuracy-Length Control | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Irec: A Metacognitive Scaffolding for Self-Regulated Learning through Just-in-Time Insight Recall: A Conceptual Framework and System Prototype | Jun 25, 2025 | graph constructionLarge Language Model | —Unverified | 0 |
| An Agentic System for Rare Disease Diagnosis with Traceable Reasoning | Jun 25, 2025 | DiagnosticLarge Language Model | —Unverified | 0 |
| A Multi-Pass Large Language Model Framework for Precise and Efficient Radiology Report Error Detection | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Community-Driven Agents for Machine Learning Engineering | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |