Toward Self-learning End-to-End Task-Oriented Dialog Systems Jan 18, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Programmatic Policy Extraction by Iterative Local Search Jan 18, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Accelerating Representation Learning with View-Consistent Dynamics in Data-Efficient Reinforcement Learning Jan 18, 2022 Data Augmentation Deep Reinforcement Learning
— Unverified 0Differentially Private Reinforcement Learning with Linear Function Approximation Jan 18, 2022 Privacy Preserving reinforcement-learning
— Unverified 0K-nearest Multi-agent Deep Reinforcement Learning for Collaborative Tasks with a Variable Number of Agents Jan 18, 2022 Deep Reinforcement Learning Management
— Unverified 0Designing realistic RL environment for power systems Jan 17, 2022 Reinforcement Learning (RL)
— Unverified 0Exploration by Random Network Distillation Jan 17, 2022 Atari Games Deep Reinforcement Learning
— Unverified 0An Understanding of Learning from Demonstrations for Neural Text Generation Jan 17, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Implementations that Matter in Cooperative Multi-Agent Reinforcement Learning Jan 17, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 011 Summaries of Papers on Explainable Reinforcement Learning With Some Commentary Jan 17, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0State of the Art of Reinforcement Learning Jan 17, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0SQUIRE: A Sequence-to-sequence Framework for Multi-hop Knowledge Graph Reasoning Jan 17, 2022 Decoder Navigate
Code Code Available 1Towards deep observation: A systematic survey on artificial intelligence techniques to monitor fetus via Ultrasound Images Jan 17, 2022 Anatomy Miscellaneous
— Unverified 0Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction Jan 17, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Spatiotemporal Costmap Inference for MPC via Deep Inverse Reinforcement Learning Jan 17, 2022 Autonomous Driving reinforcement-learning
— Unverified 0Railway Operation Rescheduling System via Dynamic Simulation and Reinforcement Learning Jan 17, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Chaining Value Functions for Off-Policy Learning Jan 17, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0An Improved Reinforcement Learning Algorithm for Learning to Branch Jan 17, 2022 Combinatorial Optimization Imitation Learning
— Unverified 0Jointly Reinforced User Simulator and Task-oriented Dialog System with Simplified Generative Architecture Jan 16, 2022 Language Modeling Language Modelling
— Unverified 0Reinforcement Learning with Large Action Spaces for Neural Machine Translation Jan 16, 2022 Machine Translation NMT
— Unverified 0MUST: A Framework for Training Task-oriented Dialogue Systems with Multiple User SimulaTors Jan 16, 2022 Reinforcement Learning (RL) Task-Oriented Dialogue Systems
— Unverified 0Revisiting the Roles of “Text” in Text Games Jan 16, 2022 Natural Language Understanding Passage Retrieval
— Unverified 0Unsupervised Reinforcement Adaptation for Class-Imbalanced TextClassification Jan 16, 2022 Domain Adaptation reinforcement-learning
— Unverified 0Parsing Natural Language into Propositional and First-Order Logic with Dual Reinforcement Learning Jan 16, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Long-Tail Classification for Distinctive Image Captioning: A Simple yet Effective Remedy for Side Effects of Reinforcement Learning Jan 16, 2022 Image Captioning Reinforcement Learning (RL)
— Unverified 0CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning Jan 16, 2022 Conversational Question Answering Passage Retrieval
— Unverified 0A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning Jan 16, 2022 Deep Reinforcement Learning Hierarchical Reinforcement Learning
— Unverified 0Inherently Explainable Reinforcement Learning in Natural Language Jan 16, 2022 Graph Attention reinforcement-learning
— Unverified 0Learning from Atypical Behavior: Temporary Interest Aware Recommendation Based on Reinforcement Learning Jan 16, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Interpretable and Effective Reinforcement Learning for Attacking against Graph-based Rumor Detection Jan 15, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Deep Reinforcement Learning for Shared Autonomous Vehicles (SAV) Fleet Management Jan 15, 2022 Autonomous Vehicles Deep Reinforcement Learning
— Unverified 0Block Policy Mirror Descent Jan 15, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Profitable Strategy Design by Using Deep Reinforcement Learning for Trades on Cryptocurrency Markets Jan 15, 2022 Deep Reinforcement Learning Imitation Learning
— Unverified 0Recursive Least Squares Advantage Actor-Critic Algorithms Jan 15, 2022 Computational Efficiency continuous-control
— Unverified 0Reinforcement Learning based Air Combat Maneuver Generation Jan 14, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning to Solve NP-hard Problems: an Application to the CVRP Jan 14, 2022 Combinatorial Optimization reinforcement-learning
— Unverified 0Demystifying Reinforcement Learning in Time-Varying Systems Jan 14, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning Jan 14, 2022 model MuJoCo
— Unverified 0Smart Magnetic Microrobots Learn to Swim with Deep Reinforcement Learning Jan 14, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Solving Dynamic Graph Problems with Multi-Attention Deep Reinforcement Learning Jan 13, 2022 Combinatorial Optimization Deep Reinforcement Learning
Code Code Available 1Weakly Supervised Scene Text Detection using Deep Reinforcement Learning Jan 13, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Direct Mutation and Crossover in Genetic Algorithms Applied to Reinforcement Learning Tasks Jan 13, 2022 OpenAI Gym reinforcement-learning
— Unverified 0Automated Reinforcement Learning: An Overview Jan 13, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning Jan 13, 2022 Q-Learning reinforcement-learning
— Unverified 0Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning Jan 12, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Dyna-T: Dyna-Q and Upper Confidence Bounds Applied to Trees Jan 12, 2022 Reinforcement Learning (RL)
— Unverified 0Toddler-Guidance Learning: Impacts of Critical Period on Multimodal AI Agents Jan 12, 2022 Reinforcement Learning (RL) Transfer Learning
— Unverified 0The Recurrent Reinforcement Learning Crypto Agent Jan 12, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Multi-echelon Supply Chains with Uncertain Seasonal Demands and Lead Times Using Deep Reinforcement Learning Jan 12, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0Task Independent Capsule-Based Agents for Deep Q-Learning Jan 11, 2022 Deep Reinforcement Learning Object Recognition
— Unverified 0