SOTAVerified

Decision Making

Papers

Showing 25512600 of 12311 papers

TitleStatusHype
Enhancing Radiological Diagnosis: A Collaborative Approach Integrating AI and Human Expertise for Visual Miss Correction0
Multimodal Data Integration for Precision Oncology: Challenges and Future Directions0
Practical Power System Inertia Monitoring Based on Pumped Storage Hydropower Operation Signature0
Operator World Models for Reinforcement LearningCode0
Boosting cattle face recognition under uncontrolled scenes by embedding enhancement and optimizationCode0
Instance Temperature Knowledge DistillationCode0
Fine-tuned network relies on generic representation to solve unseen cognitive task0
Evidential Concept Embedding Models: Towards Reliable Concept Explanations for Skin Disease DiagnosisCode1
The Rise of Artificial Intelligence in Educational Measurement: Opportunities and Ethical Challenges0
Sequential three-way group decision-making for double hierarchy hesitant fuzzy linguistic term set0
Estimating Long-term Heterogeneous Dose-response Curve: Generalization Bound Leveraging Optimal Transport Weights0
From Biased Selective Labels to Pseudo-Labels: An Expectation-Maximization Framework for Learning from Biased DecisionsCode0
CELLO: Causal Evaluation of Large Vision-Language ModelsCode1
The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning0
FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts0
Prompting Whole Slide Image Based Genetic Biomarker PredictionCode0
Complexity Aversion0
On Calibration of Speech Classification Models: Insights from Energy-Based Model Investigations0
Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks0
Kolmogorov-Arnold Graph Neural Networks0
Mental Modeling of Reinforcement Learning Agents by Language Models0
Multi-step Inference over Unstructured Data0
Evaluating the Efficacy of Foundational Models: Advancing Benchmarking Practices to Enhance Fine-Tuning Decision-Making0
Enhancing Explainability of Knowledge Learning Paths: Causal Knowledge Networks0
The State-Action-Reward-State-Action Algorithm in Spatial Prisoner's Dilemma Game0
Unbiasing on the Fly: Explanation-Guided Human Oversight of Machine Learning System Decisions0
Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization?Code0
Decentralized Task Offloading and Load-Balancing for Mobile Edge Computing in Dense Networks0
Towards a Science Exocortex0
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-MakingCode1
Integrating Generative AI with Network Digital Twins for Enhanced Network Operations0
Large Language Models Assume People are More Rational than We Really areCode0
Paraphrase and Aggregate with Large Language Models for Minimizing Intent Classification Errors0
What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Gaussian-Noise-free Text-Image Corruption and EvaluationCode1
QuadrupedGPT: Towards a Versatile Quadruped Agent in Open-ended Worlds0
Differentiable Distributionally Robust Optimization LayersCode0
Hacking a surrogate model approach to XAI0
UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models0
Conditional Bayesian QuadratureCode0
GPT-4V Explorations: Mining Autonomous Driving0
CAV-AHDV-CAV: Mitigating Traffic Oscillations for CAVs through a Novel Car-Following Structure and Reinforcement Learning0
Hardware-Aware Neural Dropout Search for Reliable Uncertainty Prediction on FPGACode0
Imperfect-Recall Games: Equilibrium Concepts and Their Complexity0
Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization0
Adaptive Digital Twin and Communication-Efficient Federated Learning Network Slicing for 5G-enabled Internet of Things0
Learning Abstract World Model for Value-preserving Planning with Options0
Privacy Implications of Explainable AI in Data-Driven Systems0
CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans0
PathoWAve: A Deep Learning-based Weight Averaging Method for Improving Domain Generalization in Histopathology ImagesCode0
Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradientsCode0
Show:102550
← PrevPage 52 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified