SOTAVerified

Decision Making

Papers

Showing 501550 of 12311 papers

TitleStatusHype
Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and BeyondCode1
Towards Robust Fidelity for Evaluating Explainability of Graph Neural NetworksCode1
Deep Neural Networks Tend To Extrapolate PredictablyCode1
Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language ModelsCode1
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement LearningCode1
Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive NegotiationCode1
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of AgentsCode1
Motif: Intrinsic Motivation from Artificial Intelligence FeedbackCode1
Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4Code1
AdaRefiner: Refining Decisions of Language Models with Adaptive FeedbackCode1
Are Human-generated Demonstrations Necessary for In-context Learning?Code1
Maximum diffusion reinforcement learningCode1
RL-I2IT: Image-to-Image Translation with Deep Reinforcement LearningCode1
A Study on Learning Social Robot Navigation with Multimodal PerceptionCode1
Hierarchical Multi-Agent Reinforcement Learning for Air Combat ManeuveringCode1
MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language FeedbackCode1
CFGPT: Chinese Financial Assistant with Large Language ModelCode1
MMST-ViT: Climate Change-aware Crop Yield Prediction via Multi-Modal Spatial-Temporal Vision TransformerCode1
LASER: LLM Agent with State-Space Exploration for Web NavigationCode1
TrafficGPT: Viewing, Processing and Interacting with Traffic Foundation ModelsCode1
The Moral Machine Experiment on Large Language ModelsCode1
Long-term drought prediction using deep neural networks based on geospatial weather dataCode1
A Survey on Interpretable Cross-modal ReasoningCode1
DeViL: Decoding Vision features into LanguageCode1
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and DutiesCode1
Emergent Linear Representations in World Models of Self-Supervised Sequence ModelsCode1
GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature FieldsCode1
survex: an R package for explaining machine learning survival modelsCode1
Two-Stage Violence Detection Using ViTPose and Classification Models at Smart AirportsCode1
Diagnosing Infeasible Optimization Problems Using Large Language ModelsCode1
Out of the Cage: How Stochastic Parrots Win in Cyber Security EnvironmentsCode1
Domain Generalization via Rationale InvarianceCode1
TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly DetectionCode1
"Guinea Pig Trials" Utilizing GPT: A Novel Smart Agent-Based Modeling Approach for Studying Firm Competition and CollusionCode1
Robust Uncertainty Quantification Using Conformalised Monte Carlo PredictionCode1
Equitable Restless Multi-Armed Bandits: A General Framework Inspired By Digital HealthCode1
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent ConversationCode1
Uncertainty Quantification for Image-based Traffic Prediction across CitiesCode1
A Comparative Visual Analytics Framework for Evaluating Evolutionary Processes in Multi-objective OptimizationCode1
WeaverBird: Empowering Financial Decision-Making with Large Language Model, Knowledge Base, and Search EngineCode1
SSL-SoilNet: A Hybrid Transformer-based Framework with Self-Supervised Learning for Large-scale Soil Organic Carbon PredictionCode1
Causal thinking for decision making on Electronic Health Records: why and howCode1
Explainable Deep Learning for Tumor Dynamic Modeling and Overall Survival Prediction using Neural-ODECode1
Synthesizing Event-centric Knowledge Graphs of Daily Activities Using Virtual SpaceCode1
ScribbleVC: Scribble-supervised Medical Image Segmentation with Vision-Class EmbeddingCode1
ProtoASNet: Dynamic Prototypes for Inherently Interpretable and Uncertainty-Aware Aortic Stenosis Classification in EchocardiographyCode1
Decision-Focused Learning: Foundations, State of the Art, Benchmark and Future OpportunitiesCode1
Continuation Path Learning for Homotopy OptimizationCode1
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop FeedbackCode1
Decoding the Enigma: Benchmarking Humans and AIs on the Many Facets of Working MemoryCode1
Show:102550
← PrevPage 11 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified