SOTAVerified

Decision Making

Papers

Showing 35263550 of 12311 papers

TitleStatusHype
CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM PipelinesCode0
Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers0
Active Learning for Fair and Stable Online Allocations0
SituationalLLM: Proactive language models with scene awareness for dynamic, contextual task guidanceCode0
Research on fusing topological data analysis with convolutional neural network0
Analyzing Diversity in Healthcare LLM Research: A Scientometric Perspective0
FreqRISE: Explaining time series using frequency maskingCode0
Nicer Than Humans: How do Large Language Models Behave in the Prisoner's Dilemma?0
Reinforcing Pre-trained Models Using Counterfactual Images0
Combining Combined Forecasts: a Network Approach0
ARDuP: Active Region Video Diffusion for Universal Policies0
Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond0
Reasoning with trees: interpreting CNNs using hierarchiesCode0
Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation0
Solarcast-ML: Per Node GraphCast Extension for Solar Energy Production0
Utility Pole Fire Risk Inspection from 2D Street-Side Images0
UAV-based Intelligent Information Systems on Winter Road Safety for Autonomous Vehicles0
Investigating the Role of Explainability and AI Literacy in User Compliance0
MiSuRe is all you need to explain your image segmentation0
Hoping for the best while preparing for the worst in the face of uncertainty: a new type of incomplete preferences0
Optimal Transport-Assisted Risk-Sensitive Q-Learning0
Grade Score: Quantifying LLM Performance in Option SelectionCode0
Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms0
Computing in the Life Sciences: From Early Algorithms to Modern AICode0
Efficient Sequential Decision Making with Large Language Models0
Show:102550
← PrevPage 142 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified