SOTAVerified

Decision Making

Papers

Showing 10511100 of 12311 papers

TitleStatusHype
Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool0
Explainability-Driven Quality Assessment for Rule-Based Systems0
TeLL-Drive: Enhancing Autonomous Driving with Teacher LLM-Guided Deep Reinforcement Learning0
Compact Rule-Based Classifier Learning via Gradient Descent0
Meta-Prompt Optimization for LLM-Based Sequential Decision Making0
RTBAgent: A LLM-based Agent System for Real-Time BiddingCode1
Decision-informed Neural Networks with Large Language Model Integration for Portfolio Optimization0
Compositional Concept-Based Neuron-Level Interpretability for Deep Reinforcement Learning0
INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Hazard Detection and Edge Case Evaluation0
Stochastic Linear Bandits with Latent Heterogeneity0
Doubly Robust Monte Carlo Tree Search0
A Differentiated Reward Method for Reinforcement Learning based Multi-Vehicle Cooperative Decision-Making Algorithms0
MarketSenseAI 2.0: Enhancing Stock Analysis through LLM Agents0
A Comprehensive Review: Applicability of Deep Neural Networks in Business Decision Making and Market Prediction Investment0
Evaluating Deep Human-in-the-Loop Optimization for Retinal Implants Using Sighted Participants0
Frame-dependent Random Utility0
CoSTI: Consistency Models for (a faster) Spatio-Temporal ImputationCode0
Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon GameCode0
Offline Learning for Combinatorial Multi-armed Bandits0
Vintix: Action Model via In-Context Reinforcement LearningCode1
Rethinking Early Stopping: Refine, Then CalibrateCode3
Towards Adaptive Self-Improvement for Smarter Energy Systems0
Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge0
Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning0
Deceptive Sequential Decision-Making via Regularized Policy Optimization0
Advancing Personalized Federated Learning: Integrative Approaches with AI for Enhanced Privacy and Customization0
Using Computer Vision for Skin Disease Diagnosis in Bangladesh Enhancing Interpretability and Transparency in Deep Learning Models for Skin Cancer Classification0
Normative Evaluation of Large Language Models with Everyday Moral Dilemmas0
Bandits with Anytime Knapsacks0
Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents0
Statistical multi-metric evaluation and visualization of LLM system predictive performance0
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding0
Exploring Potential Prompt Injection Attacks in Federated Military LLMs and Their Mitigation0
Contextual Online Decision Making with Infinite-Dimensional Functional Regression0
Interpretable Dual-Filter Fuzzy Neural Networks for Affective Brain-Computer Interfaces0
LEKA:LLM-Enhanced Knowledge Augmentation0
Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI AssistantCode0
Decision-Theoretic Approaches in Learning-Augmented Algorithms0
ASAP: Learning Generalizable Online Bin Packing via Adaptive Selection After Pruning0
Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models0
Open-Source Retrieval Augmented Generation Framework for Retrieving Accurate Medication Insights from Formularies for African Healthcare Workers0
Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems0
WASUP: Interpretable Classification with Weight-Input Alignment and Class-Discriminative SUPports Vectors0
Engaging with AI: How Interface Design Shapes Human-AI Collaboration in High-Stakes Decision-Making0
RDMM: Fine-Tuned LLM Models for On-Device Robotic Decision Making with Enhanced Contextual Awareness in Specific DomainsCode0
Towards Resource-Efficient Compound AI Systems0
Impact and influence of modern AI in metadata management0
Safe Gradient Flow for Bilevel OptimizationCode0
Integrating Probabilistic Trees and Causal Networks for Clinical and Epidemiological Data0
Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge GraphsCode1
Show:102550
← PrevPage 22 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified