SOTAVerified

Decision Making

Papers

Showing 501525 of 12311 papers

TitleStatusHype
Trainable Noise Model as an XAI evaluation method: application on Sobol for remote sensing image segmentationCode1
Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and BeyondCode1
Deep Neural Networks Tend To Extrapolate PredictablyCode1
Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language ModelsCode1
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement LearningCode1
Motif: Intrinsic Motivation from Artificial Intelligence FeedbackCode1
AdaRefiner: Refining Decisions of Language Models with Adaptive FeedbackCode1
Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive NegotiationCode1
Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4Code1
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of AgentsCode1
Are Human-generated Demonstrations Necessary for In-context Learning?Code1
Maximum diffusion reinforcement learningCode1
RL-I2IT: Image-to-Image Translation with Deep Reinforcement LearningCode1
A Study on Learning Social Robot Navigation with Multimodal PerceptionCode1
Hierarchical Multi-Agent Reinforcement Learning for Air Combat ManeuveringCode1
CFGPT: Chinese Financial Assistant with Large Language ModelCode1
MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language FeedbackCode1
MMST-ViT: Climate Change-aware Crop Yield Prediction via Multi-Modal Spatial-Temporal Vision TransformerCode1
LASER: LLM Agent with State-Space Exploration for Web NavigationCode1
TrafficGPT: Viewing, Processing and Interacting with Traffic Foundation ModelsCode1
The Moral Machine Experiment on Large Language ModelsCode1
Long-term drought prediction using deep neural networks based on geospatial weather dataCode1
A Survey on Interpretable Cross-modal ReasoningCode1
DeViL: Decoding Vision features into LanguageCode1
Emergent Linear Representations in World Models of Self-Supervised Sequence ModelsCode1
Show:102550
← PrevPage 21 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified