SOTAVerified

Decision Making

Papers

Showing 25012525 of 12311 papers

TitleStatusHype
BadCLM: Backdoor Attack in Clinical Language Models for Electronic Health Records0
Nash epidemics0
Fair Submodular Cover0
Automating Venture Capital: Founder assessment using LLM-powered segmentation, feature engineering and automated labeling techniques0
Graph Reinforcement Learning for Power Grids: A Comprehensive Survey0
Leveraging Graph Structures to Detect Hallucinations in Large Language ModelsCode0
Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions0
Improving ensemble extreme precipitation forecasts using generative artificial intelligence0
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM AgentsCode2
Short-Long Policy Evaluation with Novel Actions0
Prediction-Free Coordinated Dispatch of Microgrid: A Data-Driven Online Optimization Approach0
Quantifying Prediction Consistency Under Fine-Tuning Multiplicity in Tabular LLMs0
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the WildCode2
Multi-Task Decision-Making for Multi-User 360 Video Processing over Wireless Networks0
Impact of Financial Literacy on Investment Decisions and Stock Market Participation using Extreme Learning Machines0
On Large Language Models in National Security Applications0
On Evaluating Explanation Utility for Human-AI Decision Making in NLPCode0
Predictions and Decision Making for Resilient Intelligent Sustainable Energy Systems0
xApp Distillation: AI-based Conflict Mitigation in B5G O-RAN0
VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values0
Cloud-Edge-Terminal Collaborative AIGC for Autonomous Driving0
Language Model Alignment in Multilingual Trolley ProblemsCode1
Research on Autonomous Robots Navigation based on Reinforcement Learning0
Beyond Numeric Awards: In-Context Dueling Bandits with LLM Agents0
Revolutionising Role-Playing Games with ChatGPT0
Show:102550
← PrevPage 101 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified