| DEFN: Dual-Encoder Fourier Group Harmonics Network for Three-Dimensional Indistinct-Boundary Object Segmentation | Nov 1, 2023 | 3D ReconstructionData Augmentation | CodeCode Available | 1 |
| Advances in Embodied Navigation Using Large Language Models: A Survey | Nov 1, 2023 | Decision Making | CodeCode Available | 1 |
| Interpretable Prototype-based Graph Information Bottleneck | Oct 30, 2023 | Decision MakingPrediction | CodeCode Available | 1 |
| Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning | Oct 30, 2023 | Decision MakingOffline RL | CodeCode Available | 1 |
| Hierarchical Framework for Interpretable and Probabilistic Model-Based Safe Reinforcement Learning | Oct 28, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images | Oct 28, 2023 | Decision MakingMedical Visual Question Answering | CodeCode Available | 1 |
| Tree Prompting: Efficient Task Adaptation without Fine-Tuning | Oct 21, 2023 | ClassificationDecision Making | CodeCode Available | 1 |
| EconAgent: Large Language Model-Empowered Agents for Simulating Macroeconomic Activities | Oct 16, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes | Oct 16, 2023 | Decision MakingMath | CodeCode Available | 1 |
| Can GPT-4V(ision) Serve Medical Applications? Case Studies on GPT-4V for Multimodal Medical Diagnosis | Oct 15, 2023 | AnatomyComputed Tomography (CT) | CodeCode Available | 1 |
| On Statistical Learning of Branch and Bound for Vehicle Routing Optimization | Oct 15, 2023 | Decision MakingGraph Attention | CodeCode Available | 1 |
| QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking | Oct 11, 2023 | Decision MakingFact Checking | CodeCode Available | 1 |
| Explainable Image Similarity: Integrating Siamese Networks and Grad-CAM | Oct 11, 2023 | counterfactualDecision Making | CodeCode Available | 1 |
| Linear Latent World Models in Simple Transformers: A Case Study on Othello-GPT | Oct 11, 2023 | Decision Making | CodeCode Available | 1 |
| What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models | Oct 10, 2023 | BenchmarkingCode Generation | CodeCode Available | 1 |
| Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language Models | Oct 8, 2023 | Claim VerificationDecision Making | CodeCode Available | 1 |
| AvalonBench: Evaluating LLMs Playing the Game of Avalon | Oct 8, 2023 | Decision Making | CodeCode Available | 1 |
| Deep Learning for Two-Stage Robust Integer Optimization | Oct 6, 2023 | Decision MakingDeep Learning | CodeCode Available | 1 |
| Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets | Oct 6, 2023 | D4RLDecision Making | CodeCode Available | 1 |
| Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning | Oct 4, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use | Oct 4, 2023 | Decision Making | CodeCode Available | 1 |
| Trainable Noise Model as an XAI evaluation method: application on Sobol for remote sensing image segmentation | Oct 3, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| Towards Robust Fidelity for Evaluating Explainability of Graph Neural Networks | Oct 3, 2023 | Decision Making | CodeCode Available | 1 |
| Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving | Oct 3, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| Mini-BEHAVIOR: A Procedurally Generated Benchmark for Long-horizon Decision-Making in Embodied AI | Oct 3, 2023 | Decision Making | CodeCode Available | 1 |