SOTAVerified

Decision Making

Papers

Showing 34013450 of 12311 papers

TitleStatusHype
When to Accept Automated Predictions and When to Defer to Human Judgment?0
Optimal Decision Making Through Scenario Simulations Using Large Language Models0
Economic span selection of bridge based on deep reinforcement learningCode0
Solving General Natural-Language-Description Optimization Problems with Large Language Models0
Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction0
MDP Geometry, Normalization and Reward Balancing Solvers0
Learning to Complement and to Defer to Multiple UsersCode0
Evaluating Human-AI Collaboration: A Review and Methodological Framework0
Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge0
neuralGAM: Explainable generalized additive neural networks with independent neural network trainingCode0
Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile RobotsCode0
Enhancing Language Model Rationality with Bi-Directional Deliberation Reasoning0
Simulation-based Benchmarking for Causal Structure Learning in Gene Perturbation ExperimentsCode0
Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation0
MapsTP: HD Map Images Based Multimodal Trajectory Prediction for Automated Vehicles0
Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation0
Towards Reliable Neural Optimizers: A Permutation Equivariant Neural Approximation for Information Processing Applications0
CLIMB: A Benchmark of Clinical Bias in Large Language ModelsCode0
BadCLM: Backdoor Attack in Clinical Language Models for Electronic Health Records0
PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT InferenceCode0
Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs0
Automating Venture Capital: Founder assessment using LLM-powered segmentation, feature engineering and automated labeling techniques0
Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions0
Graph Reinforcement Learning for Power Grids: A Comprehensive Survey0
Fair Submodular Cover0
Improving ensemble extreme precipitation forecasts using generative artificial intelligence0
Leveraging Graph Structures to Detect Hallucinations in Large Language ModelsCode0
Nash epidemics0
Short-Long Policy Evaluation with Novel Actions0
Prediction-Free Coordinated Dispatch of Microgrid: A Data-Driven Online Optimization Approach0
Quantifying Prediction Consistency Under Fine-Tuning Multiplicity in Tabular LLMs0
On Evaluating Explanation Utility for Human-AI Decision Making in NLPCode0
Multi-Task Decision-Making for Multi-User 360 Video Processing over Wireless Networks0
VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values0
xApp Distillation: AI-based Conflict Mitigation in B5G O-RAN0
On Large Language Models in National Security Applications0
Impact of Financial Literacy on Investment Decisions and Stock Market Participation using Extreme Learning Machines0
Predictions and Decision Making for Resilient Intelligent Sustainable Energy Systems0
Beyond Numeric Awards: In-Context Dueling Bandits with LLM Agents0
Automated Knowledge Graph Learning in Industrial Processes0
Revolutionising Role-Playing Games with ChatGPT0
Research on Autonomous Robots Navigation based on Reinforcement Learning0
Cloud-Edge-Terminal Collaborative AIGC for Autonomous Driving0
CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications0
Distributional Regression U-Nets for the Postprocessing of Precipitation Ensemble ForecastsCode0
An Efficient and Sybil Attack Resistant Voting Mechanism0
Improving Trip Mode Choice Modeling Using Ensemble Synthesizer (ENSY)0
Multifidelity Cross-validation0
Improve ROI with Causal Learning and Conformal Prediction0
DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models0
Show:102550
← PrevPage 69 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified