| Enhancing Safety in Mixed Traffic: Learning-Based Modeling and Efficient Control of Autonomous and Human-Driven Vehicles | Apr 10, 2024 | Autonomous VehiclesModel Predictive Control | CodeCode Available | 1 |
| FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and Forecasting | Mar 19, 2024 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors | Mar 14, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| SSM Meets Video Diffusion Models: Efficient Long-Term Video Generation with Structured State Spaces | Mar 12, 2024 | GPUImage Generation | CodeCode Available | 1 |
| Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos | Mar 5, 2024 | Logical SequenceNavigate | CodeCode Available | 1 |
| Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment | Feb 29, 2024 | Navigate | CodeCode Available | 1 |
| MemoNav: Working Memory Model for Visual Navigation | Feb 29, 2024 | Decision MakingGraph Attention | CodeCode Available | 1 |
| Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation | Feb 25, 2024 | Navigate | CodeCode Available | 1 |
| No Free Lunch in LLM Watermarking: Trade-offs in Watermarking Design Choices | Feb 25, 2024 | Navigate | CodeCode Available | 1 |
| Task-Oriented Dialogue with In-Context Learning | Feb 19, 2024 | In-Context LearningNavigate | CodeCode Available | 1 |
| Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis | Jan 30, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 1 |
| PlasmoData.jl -- A Julia Framework for Modeling and Analyzing Complex Data as Graphs | Jan 21, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| An Interactive Navigation Method with Effect-oriented Affordance | Jan 1, 2024 | NavigateVisual Navigation | CodeCode Available | 1 |
| WebVLN: Vision-and-Language Navigation on Websites | Dec 25, 2023 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation | Dec 23, 2023 | Camera LocalizationCross-View Geo-Localisation | CodeCode Available | 1 |
| A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks | Dec 20, 2023 | Model SelectionNavigate | CodeCode Available | 1 |
| Sample-Efficient Learning to Solve a Real-World Labyrinth Game Using Data-Augmented Model-Based Reinforcement Learning | Dec 15, 2023 | Model-based Reinforcement LearningNavigate | CodeCode Available | 1 |
| NLLG Quarterly arXiv Report 09/23: What are the most influential current AI Papers? | Dec 9, 2023 | Navigate | CodeCode Available | 1 |
| DBCopilot: Natural Language Querying over Massive Databases via Schema Routing | Dec 6, 2023 | NavigateQuestion Generation | CodeCode Available | 1 |
| StoryGPT-V: Large Language Models as Consistent Story Visualizers | Dec 4, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Advances in 3D Neural Stylization: A Survey | Nov 30, 2023 | NavigateNeural Stylization | CodeCode Available | 1 |
| Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation | Nov 22, 2023 | NavigateTest-time Adaptation | CodeCode Available | 1 |
| Navigating Data Heterogeneity in Federated Learning A Semi-Supervised Federated Object Detection | Oct 26, 2023 | Autonomous DrivingFederated Learning | CodeCode Available | 1 |
| BioImage.IO Chatbot: A Community-Driven AI Assistant for Integrative Computational Bioimaging | Oct 23, 2023 | ChatbotInformation Retrieval | CodeCode Available | 1 |
| Safe Navigation: Training Autonomous Vehicles using Deep Reinforcement Learning in CARLA | Oct 23, 2023 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation | Oct 12, 2023 | NavigateObject | CodeCode Available | 1 |
| Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning | Oct 5, 2023 | NavigateSpatial Reasoning | CodeCode Available | 1 |
| Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View | Oct 3, 2023 | Navigate | CodeCode Available | 1 |
| Interpretable Long-Form Legal Question Answering with Retrieval-Augmented Large Language Models | Sep 29, 2023 | FormNavigate | CodeCode Available | 1 |
| Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs | Sep 27, 2023 | FormNavigate | CodeCode Available | 1 |
| Semantic Map Learning of Traffic Light to Lane Assignment based on Motion Data | Sep 26, 2023 | Autonomous Vehiclesmotion prediction | CodeCode Available | 1 |
| A Study on Learning Social Robot Navigation with Multimodal Perception | Sep 22, 2023 | Decision MakingNavigate | CodeCode Available | 1 |
| Towards Data-centric Graph Machine Learning: Review and Outlook | Sep 20, 2023 | ManagementNavigate | CodeCode Available | 1 |
| CFGPT: Chinese Financial Assistant with Large Language Model | Sep 19, 2023 | Decision MakingFinancial Analysis | CodeCode Available | 1 |
| Mask4D: End-to-End Mask-Based 4D Panoptic Segmentation for LiDAR Sequences | Sep 18, 2023 | 3D Panoptic Segmentation4D Panoptic Segmentation | CodeCode Available | 1 |
| Reasoning about the Unseen for Efficient Outdoor Object Navigation | Sep 18, 2023 | NavigateObject | CodeCode Available | 1 |
| Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL | Sep 13, 2023 | Arithmetic ReasoningNavigate | CodeCode Available | 1 |
| SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments | Sep 8, 2023 | Common Sense ReasoningNavigate | CodeCode Available | 1 |
| 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation | Aug 31, 2023 | NavigateReferring Expression | CodeCode Available | 1 |
| Manipulating Embeddings of Stable Diffusion Prompts | Aug 23, 2023 | Image GenerationNavigate | CodeCode Available | 1 |
| Bird's-Eye-View Scene Graph for Vision-Language Navigation | Aug 9, 2023 | NavigateVision-Language Navigation | CodeCode Available | 1 |
| GridMM: Grid Memory Map for Vision-and-Language Navigation | Jul 24, 2023 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| Learning Vision-and-Language Navigation from YouTube Videos | Jul 22, 2023 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| LAMP: Leveraging Language Prompts for Multi-person Pose Estimation | Jul 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TimeTuner: Diagnosing Time Representations for Time-Series Forecasting with Counterfactual Explanations | Jul 19, 2023 | counterfactualFeature Engineering | CodeCode Available | 1 |
| SentimentGPT: Exploiting GPT for Advanced Sentiment Analysis and its Departure from Current Machine Learning | Jul 16, 2023 | NavigatePrompt Engineering | CodeCode Available | 1 |
| Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with Visual Impairments | Jul 15, 2023 | DecoderGrounded Situation Recognition | CodeCode Available | 1 |
| PowerBEV: A Powerful Yet Lightweight Framework for Instance Prediction in Bird's-Eye View | Jun 19, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| Decentralized Social Navigation with Non-Cooperative Robots via Bi-Level Optimization | Jun 15, 2023 | Collision AvoidanceMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Digital Twin-Enhanced Wireless Indoor Navigation: Achieving Efficient Environment Sensing with Zero-Shot Reinforcement Learning | Jun 11, 2023 | Navigatereinforcement-learning | CodeCode Available | 1 |