| EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries | Feb 25, 2024 | Decision MakingQuestion Answering | CodeCode Available | 1 |
| Reflect-RL: Two-Player Online RL Fine-Tuning for LMs | Feb 20, 2024 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 1 |
| XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques | Feb 20, 2024 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 1 |
| Dynamic planning in hierarchical active inference | Feb 18, 2024 | Decision Making | CodeCode Available | 1 |
| PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control | Feb 16, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Explaining generative diffusion models via visual analysis for interpretable decision-making process | Feb 16, 2024 | Decision MakingDenoising | CodeCode Available | 1 |
| Uncertainty Quantification for Forward and Inverse Problems of PDEs via Latent Global Evolution | Feb 13, 2024 | Decision MakingDeep Learning | CodeCode Available | 1 |
| TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection | Feb 12, 2024 | Decision MakingFake News Detection | CodeCode Available | 1 |
| Addressing cognitive bias in medical language models | Feb 12, 2024 | Decision Making | CodeCode Available | 1 |
| A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation | Feb 12, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Self-Calibrating Conformal Prediction | Feb 11, 2024 | Binary ClassificationConformal Prediction | CodeCode Available | 1 |
| Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement | Feb 9, 2024 | Code GenerationDecision Making | CodeCode Available | 1 |
| Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss | Feb 9, 2024 | Computational Efficiencycontinuous-control | CodeCode Available | 1 |
| Conformal Convolution and Monte Carlo Meta-learners for Predictive Inference of Individual Treatment Effects | Feb 7, 2024 | Decision MakingMarketing | CodeCode Available | 1 |
| Sym-Q: Adaptive Symbolic Regression via Sequential Decision-Making | Feb 7, 2024 | Decision Makingregression | CodeCode Available | 1 |
| Measuring Implicit Bias in Explicitly Unbiased Large Language Models | Feb 6, 2024 | Decision MakingDiagnostic | CodeCode Available | 1 |
| Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills | Feb 5, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Deep hybrid models: infer and plan in a dynamic world | Feb 1, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| LLM Voting: Human Choices and AI Collective Decision Making | Jan 31, 2024 | Decision MakingDiversity | CodeCode Available | 1 |
| Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis | Jan 30, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 1 |
| Prompting Large Language Models for Zero-Shot Clinical Prediction with Structured Longitudinal Electronic Health Record Data | Jan 25, 2024 | Decision MakingIn-Context Learning | CodeCode Available | 1 |
| Distributional Counterfactual Explanations With Optimal Transport | Jan 23, 2024 | counterfactualCounterfactual Explanation | CodeCode Available | 1 |
| HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments | Jan 23, 2024 | Common Sense ReasoningDecision Making | CodeCode Available | 1 |
| Word-Level ASR Quality Estimation for Efficient Corpus Sampling and Post-Editing through Analyzing Attentions of a Reference-Free Metric | Jan 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| ORGANA: A Robotic Assistant for Automated Chemistry Experimentation and Characterization | Jan 13, 2024 | Decision MakingScheduling | CodeCode Available | 1 |
| PCB-Vision: A Multiscene RGB-Hyperspectral Benchmark Dataset of Printed Circuit Boards | Jan 12, 2024 | Decision Making | CodeCode Available | 1 |
| Uncertainty quantification for probabilistic machine learning in earth observation using conformal prediction | Jan 12, 2024 | Computational EfficiencyConformal Prediction | CodeCode Available | 1 |
| Uncertainty Quantification on Clinical Trial Outcome Prediction | Jan 7, 2024 | Decision MakingDrug Discovery | CodeCode Available | 1 |
| Escalation Risks from Language Models in Military and Diplomatic Decision-Making | Jan 7, 2024 | Decision MakingLanguage Modelling | CodeCode Available | 1 |
| t-DGR: A Trajectory-Based Deep Generative Replay Method for Continual Learning in Decision Making | Jan 4, 2024 | Continual LearningDecision Making | CodeCode Available | 1 |
| Representation Learning of Multivariate Time Series using Attention and Adversarial Training | Jan 3, 2024 | counterfactualDecision Making | CodeCode Available | 1 |
| SwapTransformer: highway overtaking tactical planner model via imitation learning on OSHA dataset | Jan 2, 2024 | Decision MakingImitation Learning | CodeCode Available | 1 |
| IdentiFace : A VGG Based Multimodal Facial Biometric System | Jan 2, 2024 | Decision MakingEmotion Recognition | CodeCode Available | 1 |
| Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement Learning | Dec 27, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning | Dec 26, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| LLM-SAP: Large Language Models Situational Awareness Based Planning | Dec 26, 2023 | Decision MakingLanguage Modelling | CodeCode Available | 1 |
| Multimodal Gen-AI for Fundamental Investment Research | Dec 24, 2023 | AI AgentDecision Making | CodeCode Available | 1 |
| DuaLight: Enhancing Traffic Signal Control by Leveraging Scenario-Specific and Scenario-Shared Knowledge | Dec 22, 2023 | Decision MakingTraffic Signal Control | CodeCode Available | 1 |
| Scalable Agent-Based Modeling for Complex Financial Market Simulations | Dec 22, 2023 | Decision MakingDistributed Computing | CodeCode Available | 1 |
| Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQA | Dec 21, 2023 | Contrastive Learningcounterfactual | CodeCode Available | 1 |
| FiFAR: A Fraud Detection Dataset for Learning to Defer | Dec 20, 2023 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Parameterized Decision-making with Multi-modal Perception for Autonomous Driving | Dec 19, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| Transformers in Unsupervised Structure-from-Motion | Dec 16, 2023 | Decision Makingimage-classification | CodeCode Available | 1 |
| auto-sktime: Automated Time Series Forecasting | Dec 13, 2023 | AutoMLBayesian Optimization | CodeCode Available | 1 |
| diff History for Neural Language Agents | Dec 12, 2023 | Decision MakingNetHack | CodeCode Available | 1 |
| Sequential Planning in Large Partially Observable Environments guided by LLMs | Dec 12, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 1 |
| Open Datasheets: Machine-readable Documentation for Open Datasets and Responsible AI Assessments | Dec 11, 2023 | Decision Making | CodeCode Available | 1 |
| BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous Driving | Dec 11, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| DiffAIL: Diffusion Adversarial Imitation Learning | Dec 11, 2023 | Decision MakingImitation Learning | CodeCode Available | 1 |
| Using Large Language Models for Hyperparameter Optimization | Dec 7, 2023 | Bayesian OptimizationDecision Making | CodeCode Available | 1 |