SOTAVerified

Decision Making

Papers

Showing 451500 of 12311 papers

TitleStatusHype
Developing Optimal Causal Cyber-Defence Agents via Cyber Security SimulationCode1
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced DatasetsCode1
Bias in Multimodal AI: Testbed for Fair Automatic RecruitmentCode1
Detecting Individual Decision-Making Style: Exploring Behavioral Stylometry in ChessCode1
DeViL: Decoding Vision features into LanguageCode1
Dense Uncertainty EstimationCode1
BIMCaP: BIM-based AI-supported LiDAR-Camera Pose RefinementCode1
Binocular Mutual Learning for Improving Few-shot ClassificationCode1
BLADE: Benchmarking Language Model Agents for Data-Driven ScienceCode1
Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing InducementsCode1
DEFN: Dual-Encoder Fourier Group Harmonics Network for Three-Dimensional Indistinct-Boundary Object SegmentationCode1
From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language ModelsCode1
Detect and Locate: Exposing Face Manipulation by Semantic- and Noise-level TelltalesCode1
CityLearn: Diverse Real-World Environments for Sample-Efficient Navigation Policy LearningCode1
Frustum-PointPillars: A Multi-Stage Approach for 3D Object Detection using RGB Camera and LiDARCode1
Future-conditioned Unsupervised Pretraining for Decision TransformerCode1
DexArt: Benchmarking Generalizable Dexterous Manipulation with Articulated ObjectsCode1
Brain Tumor Segmentation and Radiomics Survival Prediction: Contribution to the BRATS 2017 ChallengeCode1
Deep Reinforcement Learning For Sequence to Sequence ModelsCode1
GATSBI: Generative Agent-centric Spatio-temporal Object InteractionCode1
Building a Scalable and Interpretable Bayesian Deep Learning Framework for Quality Control of Free Form SurfacesCode1
BuildingView: Constructing Urban Building Exteriors Databases with Street View Imagery and Multimodal Large Language ModeCode1
Bundle Recommendation with Graph Convolutional NetworksCode1
Generalized Linear Bandits with Local Differential PrivacyCode1
Deep Reinforcement Learning with Task-Adaptive Retrieval via HypernetworkCode1
Generating Hierarchical Explanations on Text Classification via Feature Interaction DetectionCode1
Genetic programming approaches to learning fair classifiersCode1
GisPy: A Tool for Measuring Gist Inference Score in TextCode1
Deep Reinforcement Learning for Entity AlignmentCode1
Adaptive Conformal Predictions for Time SeriesCode1
RL-I2IT: Image-to-Image Translation with Deep Reinforcement LearningCode1
CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement LearningCode1
Defeasible Visual Entailment: Benchmark, Evaluator, and Reward-Driven OptimizationCode1
DG-Trans: Dual-level Graph Transformer for Spatiotemporal Incident Impact Prediction on Traffic NetworksCode1
Can GPT-4V(ision) Serve Medical Applications? Case Studies on GPT-4V for Multimodal Medical DiagnosisCode1
Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Mixed Traffic EnvironmentsCode1
Accuracy and Fairness Trade-offs in Machine Learning: A Stochastic Multi-Objective ApproachCode1
Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI GymCode1
Group-Aware Coordination Graph for Multi-Agent Reinforcement LearningCode1
Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMsCode1
CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language ModelsCode1
"Guinea Pig Trials" Utilizing GPT: A Novel Smart Agent-Based Modeling Approach for Studying Firm Competition and CollusionCode1
CLIMAT: Clinically-Inspired Multi-Agent Transformers for Knee Osteoarthritis Trajectory ForecastingCode1
Handcrafted Histological Transformer (H2T): Unsupervised Representation of Whole Slide ImagesCode1
An Objective Metric for Explainable AI: How and Why to Estimate the Degree of ExplainabilityCode1
Hematoxylin and eosin stained oral squamous cell carcinoma histological images datasetCode1
An Introduction to Deep Reinforcement LearningCode1
Deep Policies for Online Bipartite Matching: A Reinforcement Learning ApproachCode1
Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and ActingCode1
Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions ModelingCode1
Show:102550
← PrevPage 10 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified