| Med-gte-hybrid: A contextual embedding transformer model for extracting actionable information from clinical texts | Feb 21, 2025 | Contrastive LearningDecision Making | —Unverified | 0 |
| Graph Attention Convolutional U-NET: A Semantic Segmentation Model for Identifying Flooded Areas | Feb 21, 2025 | Decision MakingGraph Attention | —Unverified | 0 |
| Exploring Embodied Multimodal Large Models: Development, Datasets, and Future Directions | Feb 21, 2025 | Decision Making | —Unverified | 0 |
| A Knowledge Distillation-Based Approach to Enhance Transparency of Classifier Models | Feb 21, 2025 | Decision MakingKnowledge Distillation | CodeCode Available | 0 |
| Alignment, Agency and Autonomy in Frontier AI: A Systems Engineering Perspective | Feb 20, 2025 | Decision MakingPhilosophy | —Unverified | 0 |
| An Interpretable Machine Learning Approach to Understanding the Relationships between Solar Flares and Source Active Regions | Feb 20, 2025 | Binary ClassificationDecision Making | —Unverified | 0 |
| Multi-Objective Causal Bayesian Optimization | Feb 20, 2025 | Bayesian OptimizationDecision Making | CodeCode Available | 1 |
| SPRIG: Stackelberg Perception-Reinforcement Learning with Internal Game Dynamics | Feb 20, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications | Feb 20, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC | Feb 20, 2025 | Decision Making | CodeCode Available | 9 |
| Investigating the Impact of LLM Personality on Cognitive Bias Manifestation in Automated Decision-Making Tasks | Feb 20, 2025 | Decision MakingFairness | —Unverified | 0 |
| The Impact and Feasibility of Self-Confidence Shaping for AI-Assisted Decision-Making | Feb 20, 2025 | Decision Making | —Unverified | 0 |
| Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems | Feb 20, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| STeCa: Step-level Trajectory Calibration for LLM Agent Learning | Feb 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation | Feb 20, 2025 | Decision MakingEfficient Exploration | —Unverified | 0 |
| How Far are LLMs from Being Our Digital Twins? A Benchmark for Persona-Based Behavior Chain Simulation | Feb 20, 2025 | Decision Making | CodeCode Available | 1 |
| MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models | Feb 20, 2025 | Decision MakingHallucination | —Unverified | 0 |
| Human Misperception of Generative-AI Alignment: A Laboratory Experiment | Feb 20, 2025 | Decision Making | —Unverified | 0 |
| Online detection of forecast model inadequacies using forecast errors | Feb 20, 2025 | Decision Making | —Unverified | 0 |
| Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition | Feb 19, 2025 | AttributeDecision Making | —Unverified | 0 |
| Playing Hex and Counter Wargames using Reinforcement Learning and Recurrent Neural Networks | Feb 19, 2025 | Board GamesDecision Making | CodeCode Available | 0 |
| Human-Artificial Interaction in the Age of Agentic AI: A System-Theoretical Approach | Feb 19, 2025 | Decision Making | —Unverified | 0 |
| LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems | Feb 19, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Benchmarking LLMs for Political Science: A United Nations Perspective | Feb 19, 2025 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region | Feb 19, 2025 | Decision MakingSafety Alignment | —Unverified | 0 |
| RobustX: Robust Counterfactual Explanations Made Easy | Feb 19, 2025 | counterfactualDecision Making | CodeCode Available | 1 |
| AgentCF++: Memory-enhanced LLM-based Agents for Popularity-aware Cross-domain Recommendations | Feb 19, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Fighter Jet Navigation and Combat using Deep Reinforcement Learning with Explainable AI | Feb 19, 2025 | counterfactualDecision Making | CodeCode Available | 0 |
| AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence | Feb 19, 2025 | Code GenerationDecision Making | CodeCode Available | 1 |
| RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering | Feb 19, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering | Feb 19, 2025 | Decision MakingKnowledge Base Question Answering | —Unverified | 0 |
| Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements | Feb 18, 2025 | Decision MakingFraud Detection | CodeCode Available | 1 |
| LLM Trading: Analysis of LLM Agent Behavior in Experimental Asset Markets | Feb 18, 2025 | Decision Making | —Unverified | 0 |
| MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding | Feb 18, 2025 | Decision Making | —Unverified | 0 |
| Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL | Feb 18, 2025 | counterfactualDeception Detection | —Unverified | 0 |
| AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks | Feb 18, 2025 | Decision Making | —Unverified | 0 |
| Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger | Feb 18, 2025 | Decision Making | —Unverified | 0 |
| Value Gradient Sampler: Sampling as Sequential Decision Making | Feb 18, 2025 | Anomaly DetectionDecision Making | CodeCode Available | 0 |
| Capturing Human Cognitive Styles with Language: Towards an Experimental Evaluation Paradigm | Feb 18, 2025 | Decision Making | —Unverified | 0 |
| Towards Robust and Secure Embodied AI: A Survey on Vulnerabilities and Attacks | Feb 18, 2025 | Adversarial AttackAutonomous Vehicles | —Unverified | 0 |
| Conditional Max-Sum for Asynchronous Multiagent Decision Making | Feb 18, 2025 | Decision Making | —Unverified | 0 |
| Adjust for Trust: Mitigating Trust-Induced Inappropriate Reliance on AI Assistance | Feb 18, 2025 | Decision Making | —Unverified | 0 |
| AI-Assisted Decision Making with Human Learning | Feb 18, 2025 | Decision MakingDiagnostic | —Unverified | 0 |
| Addressing Moral Uncertainty using Large Language Models for Ethical Decision-Making | Feb 17, 2025 | Decision MakingEthics | —Unverified | 0 |
| One for All: A General Framework of LLMs-based Multi-Criteria Decision Making on Human Expert Level | Feb 17, 2025 | AllDecision Making | —Unverified | 0 |
| Unveiling Privacy Risks in LLM Agent Memory | Feb 17, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Human-centered explanation does not fit all: The interplay of sociotechnical, cognitive, and individual factors in the effect AI explanations in algorithmic decision-making | Feb 17, 2025 | AllDecision Making | —Unverified | 0 |
| Scaling Autonomous Agents via Automatic Reward Modeling And Planning | Feb 17, 2025 | Decision MakingMathematical Problem-Solving | —Unverified | 0 |
| ExaGPT: Example-Based Machine-Generated Text Detection for Human Interpretability | Feb 17, 2025 | Decision MakingText Detection | —Unverified | 0 |
| QoS based resource management for concurrent operation using MCTS | Feb 17, 2025 | Decision MakingManagement | —Unverified | 0 |