| Large Model Based Referring Camouflaged Object Detection | Nov 28, 2023 | modelObject | —Unverified | 0 |
| UniIR: Training and Benchmarking Universal Multimodal Information Retrievers | Nov 28, 2023 | BenchmarkingInformation Retrieval | —Unverified | 0 |
| C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing | Nov 27, 2023 | Language ModellingPrompt Learning | —Unverified | 0 |
| VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning | Nov 25, 2023 | DecoderModel Optimization | CodeCode Available | 1 |
| A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with Dynamic Obstacle Trajectory Prediction and Its Application with LLMs | Nov 21, 2023 | object-detectionObject Detection | —Unverified | 0 |
| Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense Encoders | Nov 16, 2023 | Data AugmentationDomain Generalization | CodeCode Available | 1 |
| Neural-Logic Human-Object Interaction Detection | Nov 16, 2023 | DecoderHuman-Object Interaction Detection | CodeCode Available | 1 |
| Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts | Nov 15, 2023 | Question AnsweringSentence | CodeCode Available | 0 |
| Towards Generalizable SER: Soft Labeling and Data Augmentation for Modeling Temporal Emotion Shifts in Large-Scale Multilingual Speech | Nov 15, 2023 | Contrastive LearningCross-corpus | CodeCode Available | 0 |
| Adaptive recurrent vision performs zero-shot computation scaling to unseen difficulty levels | Nov 12, 2023 | PathfinderVisual Reasoning | —Unverified | 0 |