| MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering | Feb 19, 2025 | Decision MakingKnowledge Base Question Answering | —Unverified | 0 |
| Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements | Feb 18, 2025 | Decision MakingFraud Detection | CodeCode Available | 1 |
| LLM Trading: Analysis of LLM Agent Behavior in Experimental Asset Markets | Feb 18, 2025 | Decision Making | —Unverified | 0 |
| Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL | Feb 18, 2025 | counterfactualDeception Detection | —Unverified | 0 |
| MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding | Feb 18, 2025 | Decision Making | —Unverified | 0 |
| AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks | Feb 18, 2025 | Decision Making | —Unverified | 0 |
| Value Gradient Sampler: Sampling as Sequential Decision Making | Feb 18, 2025 | Anomaly DetectionDecision Making | CodeCode Available | 0 |
| Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger | Feb 18, 2025 | Decision Making | —Unverified | 0 |
| Capturing Human Cognitive Styles with Language: Towards an Experimental Evaluation Paradigm | Feb 18, 2025 | Decision Making | —Unverified | 0 |
| Adjust for Trust: Mitigating Trust-Induced Inappropriate Reliance on AI Assistance | Feb 18, 2025 | Decision Making | —Unverified | 0 |