| Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym | Dec 6, 2023 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving | Dec 6, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| RiskBench: A Scenario-based Benchmark for Risk Identification | Dec 4, 2023 | Decision Making | CodeCode Available | 1 |
| MEDPSeg: Hierarchical polymorphic multitask learning for the segmentation of ground-glass opacities, consolidation, and pulmonary structures on computed tomography | Dec 4, 2023 | AnatomyComputed Tomography (CT) | CodeCode Available | 1 |
| Understanding the (Extra-)Ordinary: Validating Deep Model Decisions with Prototypical Concept-based Explanations | Nov 28, 2023 | Decision Making | CodeCode Available | 1 |
| Utilizing Explainability Techniques for Reinforcement Learning Model Assurance | Nov 27, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models | Nov 27, 2023 | Decision MakingQuestion Answering | CodeCode Available | 1 |
| VSViG: Real-time Video-based Seizure Detection via Skeleton-based Spatiotemporal ViG | Nov 24, 2023 | Action RecognitionDecision Making | CodeCode Available | 1 |
| Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents | Nov 22, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Physical Reasoning and Object Planning for Household Embodied Agents | Nov 22, 2023 | 2kDecision Making | CodeCode Available | 1 |
| Labeling Neural Representations with Inverse Recognition | Nov 22, 2023 | Decision MakingSegmentation | CodeCode Available | 1 |
| From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models | Nov 21, 2023 | Decision Making | CodeCode Available | 1 |
| DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation | Nov 16, 2023 | Decision MakingInstruction Following | CodeCode Available | 1 |
| Inherently Interpretable Time Series Classification via Multiple Instance Learning | Nov 16, 2023 | Decision MakingMultiple Instance Learning | CodeCode Available | 1 |
| ToolTalk: Evaluating Tool-Usage in a Conversational Setting | Nov 15, 2023 | Decision Making | CodeCode Available | 1 |
| XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs | Nov 15, 2023 | Decision MakingDecoder | CodeCode Available | 1 |
| A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering | Nov 13, 2023 | Decision MakingExplanation Generation | CodeCode Available | 1 |
| Real-Time Machine-Learning-Based Optimization Using Input Convex Long Short-Term Memory Network | Nov 13, 2023 | Chemical ProcessComputational Efficiency | CodeCode Available | 1 |
| Benchmarking PtO and PnO Methods in the Predictive Combinatorial Optimization Regime | Nov 13, 2023 | BenchmarkingCombinatorial Optimization | CodeCode Available | 1 |
| MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty | Nov 10, 2023 | Autonomous VehiclesDecision Making | CodeCode Available | 1 |
| ADaPT: As-Needed Decomposition and Planning with Language Models | Nov 8, 2023 | Decision Making | CodeCode Available | 1 |
| Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation | Nov 7, 2023 | Decision Making | CodeCode Available | 1 |
| ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents | Nov 6, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Cal-DETR: Calibrated Detection Transformer | Nov 6, 2023 | Decision Making | CodeCode Available | 1 |
| An algorithmic framework for synthetic cost-aware decision making in molecular design | Nov 3, 2023 | Decision MakingProperty Prediction | CodeCode Available | 1 |