| A Dual Curriculum Learning Framework for Multi-UAV Pursuit-Evasion in Diverse Environments | Dec 19, 2023 | Reinforcement Learning (RL)Zero-shot Generalization | —Unverified | 0 |
| Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning | Dec 19, 2023 | DiversityInstruction Following | —Unverified | 0 |
| Towards the Unification of Generative and Discriminative Visual Foundation Model: A Survey | Dec 15, 2023 | Image GenerationImage Segmentation | —Unverified | 0 |
| MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning | Dec 14, 2023 | DecoderLanguage Modelling | —Unverified | 0 |
| Adaptive Human Trajectory Prediction via Latent Corridors | Dec 11, 2023 | PredictionTrajectory Prediction | —Unverified | 0 |
| Multi-View Unsupervised Image Generation with Cross Attention Guidance | Dec 7, 2023 | Hard AttentionImage Generation | —Unverified | 0 |
| MASP: Scalable GNN-based Planning for Multi-Agent Navigation | Dec 5, 2023 | Reinforcement Learning (RL)Zero-shot Generalization | —Unverified | 0 |
| I-PHYRE: Interactive Physical Reasoning | Dec 4, 2023 | Zero-shot Generalization | —Unverified | 0 |
| Categorical Traffic Transformer: Interpretable and Diverse Behavior Prediction with Tokenized Latent | Nov 30, 2023 | Autonomous VehiclesCommon Sense Reasoning | —Unverified | 0 |
| Large Model Based Referring Camouflaged Object Detection | Nov 28, 2023 | modelObject | —Unverified | 0 |
| UniIR: Training and Benchmarking Universal Multimodal Information Retrievers | Nov 28, 2023 | BenchmarkingInformation Retrieval | —Unverified | 0 |
| C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing | Nov 27, 2023 | Language ModellingPrompt Learning | —Unverified | 0 |
| A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs | Nov 21, 2023 | object-detectionObject Detection | —Unverified | 0 |
| Towards Generalizable SER: Soft Labeling and Data Augmentation for Modeling Temporal Emotion Shifts in Large-Scale Multilingual Speech | Nov 15, 2023 | Contrastive LearningCross-corpus | CodeCode Available | 0 |
| Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts | Nov 15, 2023 | Question AnsweringSentence | CodeCode Available | 0 |
| Adaptive recurrent vision performs zero-shot computation scaling to unseen difficulty levels | Nov 12, 2023 | PathfinderVisual Reasoning | —Unverified | 0 |
| A Simple yet Efficient Ensemble Approach for AI-generated Text Detection | Nov 6, 2023 | Language ModellingLarge Language Model | —Unverified | 0 |
| Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE | Nov 5, 2023 | DecoderMixture-of-Experts | CodeCode Available | 0 |
| Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization | Nov 2, 2023 | Domain GeneralizationPrompt Learning | —Unverified | 0 |
| Neural Field Dynamics Model for Granular Object Piles Manipulation | Nov 1, 2023 | ObjectZero-shot Generalization | —Unverified | 0 |
| ZGUL: Zero-shot Generalization to Unseen Languages using Multi-source Ensembling of Language Adapters | Oct 25, 2023 | Cross-Lingual TransferLanguage Modelling | CodeCode Available | 0 |
| Robot Skill Generalization via Keypoint Integrated Soft Actor-Critic Gaussian Mixture Models | Oct 23, 2023 | Skill GeneralizationZero-shot Generalization | —Unverified | 0 |
| InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining | Oct 11, 2023 | 4kDecoder | —Unverified | 0 |
| What Matters to You? Towards Visual Representation Alignment for Robot Learning | Oct 11, 2023 | Zero-shot Generalization | —Unverified | 0 |
| From Supervised to Generative: A Novel Paradigm for Tabular Deep Learning with Large Language Models | Oct 11, 2023 | In-Context LearningInstruction Following | CodeCode Available | 0 |