SOTAVerified

Decision Making

Papers

Showing 25012550 of 12311 papers

TitleStatusHype
BadCLM: Backdoor Attack in Clinical Language Models for Electronic Health Records0
Nash epidemics0
Fair Submodular Cover0
Graph Reinforcement Learning for Power Grids: A Comprehensive Survey0
Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions0
Automating Venture Capital: Founder assessment using LLM-powered segmentation, feature engineering and automated labeling techniques0
Improving ensemble extreme precipitation forecasts using generative artificial intelligence0
Leveraging Graph Structures to Detect Hallucinations in Large Language ModelsCode0
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM AgentsCode2
Short-Long Policy Evaluation with Novel Actions0
Quantifying Prediction Consistency Under Fine-Tuning Multiplicity in Tabular LLMs0
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the WildCode2
Prediction-Free Coordinated Dispatch of Microgrid: A Data-Driven Online Optimization Approach0
Impact of Financial Literacy on Investment Decisions and Stock Market Participation using Extreme Learning Machines0
Multi-Task Decision-Making for Multi-User 360 Video Processing over Wireless Networks0
On Large Language Models in National Security Applications0
On Evaluating Explanation Utility for Human-AI Decision Making in NLPCode0
Predictions and Decision Making for Resilient Intelligent Sustainable Energy Systems0
VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values0
xApp Distillation: AI-based Conflict Mitigation in B5G O-RAN0
Cloud-Edge-Terminal Collaborative AIGC for Autonomous Driving0
Research on Autonomous Robots Navigation based on Reinforcement Learning0
Language Model Alignment in Multilingual Trolley ProblemsCode1
Beyond Numeric Awards: In-Context Dueling Bandits with LLM Agents0
Distributional Regression U-Nets for the Postprocessing of Precipitation Ensemble ForecastsCode0
Automated Knowledge Graph Learning in Industrial Processes0
Revolutionising Role-Playing Games with ChatGPT0
CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications0
An Efficient and Sybil Attack Resistant Voting Mechanism0
Multifidelity Cross-validation0
DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models0
View From Above: A Framework for Evaluating Distribution Shifts in Model BehaviorCode0
Let Hybrid A* Path Planner Obey Traffic Rules: A Deep Reinforcement Learning-Based Planning Framework0
EconNLI: Evaluating Large Language Models on Economics ReasoningCode0
Improve ROI with Causal Learning and Conformal Prediction0
OSL-ActionSpotting: A Unified Library for Action Spotting in Sports Videos0
Diffusion Forcing: Next-token Prediction Meets Full-Sequence DiffusionCode9
Improving Trip Mode Choice Modeling Using Ensemble Synthesizer (ENSY)0
MUSE-Net: Missingness-aware mUlti-branching Self-attention Encoder for Irregular Longitudinal Electronic Health Records0
Enhancing Travel Decision-Making: A Contrastive Learning Approach for Personalized Review Rankings in Accommodations0
Exploring a Physics-Informed Decision Transformer for Distribution System Restoration: Methodology and Performance Analysis0
Deep Reinforcement Learning Strategies in Finance: Insights into Asset Holding, Trading Behavior, and Purchase Diversity0
Unraveling the Versatility and Impact of Multi-Objective Optimization: Algorithms, Applications, and Trends for Solving Complex Real-World Problems0
Balancing Forecast Accuracy and Switching Costs in Online Optimization of Energy Management SystemsCode0
Nonequilibrium dynamics and thermodynamics provide the underlying physical mechanism of the perceptual rivalry0
A Rule-Based Behaviour Planner for Autonomous Driving0
AI Age Discrepancy: A Novel Parameter for Frailty Assessment in Kidney Tumor Patients0
PUZZLES: A Benchmark for Neural Algorithmic ReasoningCode1
Tradeoffs When Considering Deep Reinforcement Learning for Contingency Management in Advanced Air Mobility0
Evaluating Human Alignment and Model Faithfulness of LLM Rationale0
Show:102550
← PrevPage 51 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified