Decision Making

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 12311 papers

Title	Date	Tasks	Status	Hype
AutoWebGLM: A Large Language Model-based Web Navigating Agent	Apr 4, 2024	Decision MakingLanguage Modeling	CodeCode Available	4
A Survey on Large Language Model-Based Game Agents	Apr 2, 2024	Decision MakingLanguage Modeling	CodeCode Available	4
Eureka: Human-Level Reward Design via Coding Large Language Models	Oct 19, 2023	Decision MakingIn-Context Learning	CodeCode Available	4
Cognitive Architectures for Language Agents	Sep 5, 2023	Decision Making	CodeCode Available	4
AgentBench: Evaluating LLMs as Agents	Aug 7, 2023	Decision MakingInstruction Following	CodeCode Available	4
TorchRL: A data-driven decision-making library for PyTorch	Jun 1, 2023	Computational EfficiencyDecision Making	CodeCode Available	4
pgmpy: A Python Toolkit for Bayesian Networks	Apr 17, 2023	Causal DiscoveryCausal Identification	CodeCode Available	4
Reflexion: Language Agents with Verbal Reinforcement Learning	Mar 20, 2023	Decision MakingHumanEval	CodeCode Available	4
Mastering Diverse Domains through World Models	Jan 10, 2023	Atari Games 100kDecision Making	CodeCode Available	4
Constitutional AI: Harmlessness from AI Feedback	Dec 15, 2022	Decision Making	CodeCode Available	4
ReAct: Synergizing Reasoning and Acting in Language Models	Oct 6, 2022	Decision MakingFact Verification	CodeCode Available	4
A Smart Multimodal Healthcare Copilot with Powerful LLM Reasoning	Jun 3, 2025	Decision MakingDiagnostic	CodeCode Available	3
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution	Apr 9, 2025	2kDecision Making	CodeCode Available	3
Playing Non-Embedded Card-Based Games with Reinforcement Learning	Apr 7, 2025	Board GamesDecision Making	CodeCode Available	3
Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB	Apr 1, 2025	Decision MakingRAG	CodeCode Available	3
Will LLMs be Professional at Fund Investment? DeepFund: A Live Arena Perspective	Mar 24, 2025	Decision Making	CodeCode Available	3
A Survey on the Optimization of Large Language Model-based Agents	Mar 16, 2025	Decision MakingLanguage Modeling	CodeCode Available	3
Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems	Mar 5, 2025	Decision MakingLanguage Modeling	CodeCode Available	3
Automated Hypothesis Validation with Agentic Sequential Falsifications	Feb 14, 2025	Decision MakingHallucination	CodeCode Available	3
Rethinking Early Stopping: Refine, Then Calibrate	Jan 31, 2025	Decision Making	CodeCode Available	3
MineStudio: A Streamlined Package for Minecraft AI Agent Development	Dec 24, 2024	AI AgentDecision Making	CodeCode Available	3
Embodied CoT Distillation From LLM To Off-the-shelf Agents	Dec 16, 2024	Decision MakingIn-Context Learning	CodeCode Available	3
AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games	Dec 14, 2024	Decision Making	CodeCode Available	3
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models	Nov 29, 2024	Decision MakingRAG	CodeCode Available	3
Game-theoretic LLM: Agent Workflow for Negotiation Games	Nov 8, 2024	Decision Making	CodeCode Available	3

Show:10 25 50

← PrevPage 2 of 493Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SRLA	Average Remaining Cycles	6.4	—	Unverified