SOTAVerified

Decision Making

Papers

Showing 17511775 of 12311 papers

TitleStatusHype
AdvAgent: Controllable Blackbox Red-teaming on Web Agents0
Dynamic Adaptive Rank Space Exploration for Efficient Sentiment Analysis with Large Language Models0
Dynamic graph neural networks for enhanced volatility prediction in financial markets0
Convex Markov Games: A New Frontier for Multi-Agent Reinforcement Learning0
Literature Meets Data: A Synergistic Approach to Hypothesis GenerationCode2
Improving Causal Reasoning in Large Language Models: A SurveyCode2
Hierarchical Upper Confidence Bounds for Constrained Online Learning0
Contrasting Attitudes Towards Current and Future AI Applications for Computerised Interpretation of ECG: A Clinical Stakeholder Interview Study0
Resource-Efficient Sensor Fusion via System-Wide Dynamic Gated Neural Networks0
How Can We Diagnose and Treat Bias in Large Language Models for Clinical Decision-Making?Code0
Finite-Sample and Distribution-Free Fair Classification: Optimal Trade-off Between Excess Risk and Fairness, and the Cost of Group-Blindness0
How Performance Pressure Influences AI-Assisted Decision Making0
Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency0
Learning-to-Defer for Extractive Question Answering0
High-Fidelity Transfer of Functional Priors for Wide Bayesian Neural Networks by Learning ActivationsCode0
Solving Sparse \& High-Dimensional-Output Regression via Compression0
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language ModelsCode2
How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?0
A Dual Process VLA: Efficient Robotic Manipulation Leveraging VLM0
Systematic Exploration of Dialogue Summarization Approaches for Reproducibility, Comparative Assessment, and Methodological Innovations for Advancing Natural Language Processing in Abstractive Summarization0
Reflection-Bench: probing AI intelligence with reflectionCode1
Fine-Tuning LLMs for Reliable Medical Question-Answering Services0
Learning-Augmented Algorithms for the Bahncard Problem0
SceneGraMMi: Scene Graph-boosted Hybrid-fusion for Multi-Modal Misinformation Veracity Prediction0
TAGExplainer: Narrating Graph Explanations for Text-Attributed Graph Learning Models0
Show:102550
← PrevPage 71 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified