| Neural-Logic Human-Object Interaction Detection | Nov 16, 2023 | DecoderHuman-Object Interaction Detection | CodeCode Available | 1 |
| Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense Encoders | Nov 16, 2023 | Data AugmentationDomain Generalization | CodeCode Available | 1 |
| What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning | Nov 2, 2023 | MMEVisual Reasoning | CodeCode Available | 1 |
| Instructive Decoding: Instruction-Tuned Large Language Models are Self-Refiner from Noisy Instructions | Nov 1, 2023 | Few-Shot NLIInstruction Following | CodeCode Available | 1 |
| Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks | Nov 1, 2023 | InformativenessOut-of-Distribution Generalization | CodeCode Available | 1 |
| Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning | Sep 30, 2023 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 1 |
| MediViSTA: Medical Video Segmentation via Temporal Fusion SAM Adaptation for Echocardiography | Sep 24, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| DePT: Decoupled Prompt Tuning | Sep 14, 2023 | Prompt EngineeringZero-shot Generalization | CodeCode Available | 1 |
| SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation | Sep 1, 2023 | Autonomous DrivingComputational Efficiency | CodeCode Available | 1 |
| TongueSAM: An Universal Tongue Segmentation Model Based on SAM with Zero-Shot | Aug 12, 2023 | DiagnosticInteractive Segmentation | CodeCode Available | 1 |
| Kick Back & Relax: Learning to Reconstruct the World by Watching SlowTV | Jul 20, 2023 | Depth EstimationDiversity | CodeCode Available | 1 |
| Improving Zero-Shot Generalization for CLIP with Synthesized Prompts | Jul 14, 2023 | Generalized Zero-Shot LearningTransfer Learning | CodeCode Available | 1 |
| SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation | Jul 3, 2023 | Domain AdaptationTransfer Learning | CodeCode Available | 1 |
| Gradient Ascent Post-training Enhances Language Model Generalization | Jun 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Digital Twin-Enhanced Wireless Indoor Navigation: Achieving Efficient Environment Sensing with Zero-Shot Reinforcement Learning | Jun 11, 2023 | Navigatereinforcement-learning | CodeCode Available | 1 |
| Improving day-ahead Solar Irradiance Time Series Forecasting by Leveraging Spatio-Temporal Context | Jun 1, 2023 | Solar Irradiance ForecastingTime Series | CodeCode Available | 1 |
| The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only | Jun 1, 2023 | Zero-shot Generalization | CodeCode Available | 1 |
| Subequivariant Graph Reinforcement Learning in 3D Environments | May 30, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Deeply Coupled Cross-Modal Prompt Learning | May 29, 2023 | Domain AdaptationFew-Shot Learning | CodeCode Available | 1 |
| Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models | May 29, 2023 | Image CaptioningImage Classification | CodeCode Available | 1 |
| Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In | May 27, 2023 | MMLURetrieval | CodeCode Available | 1 |
| Label Agnostic Pre-training for Zero-shot Text Classification | May 25, 2023 | Classificationtext-classification | CodeCode Available | 1 |
| Prompting Language-Informed Distribution for Compositional Zero-Shot Learning | May 23, 2023 | Compositional Zero-Shot LearningInformativeness | CodeCode Available | 1 |
| Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers | May 21, 2023 | MMLUZero-shot Generalization | CodeCode Available | 1 |
| Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model | May 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| One-Prompt to Segment All Medical Images | May 17, 2023 | AllImage Segmentation | CodeCode Available | 1 |
| From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning | Apr 17, 2023 | MMLUZero-shot Generalization | CodeCode Available | 1 |
| Improving Diffusion Models for Scene Text Editing with Dual Encoders | Apr 12, 2023 | Scene Text EditingStyle Transfer | CodeCode Available | 1 |
| Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting | Apr 6, 2023 | Action RecognitionPrompt Learning | CodeCode Available | 1 |
| Towards Open-Vocabulary Video Instance Segmentation | Apr 4, 2023 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following | Feb 28, 2023 | Instruction FollowingZero-shot Generalization | CodeCode Available | 1 |
| Zero-Shot Anomaly Detection via Batch Normalization | Feb 15, 2023 | Anomaly DetectionUnsupervised Anomaly Detection | CodeCode Available | 1 |
| Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction | Jan 21, 2023 | DiversityMinecraft | CodeCode Available | 1 |
| PartDistillation: Learning Parts From Instance Segmentation | Jan 1, 2023 | Instance SegmentationObject | CodeCode Available | 1 |
| Improving Zero-shot Generalization and Robustness of Multi-modal Models | Dec 4, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| A Universal Discriminator for Zero-Shot Generalization | Nov 15, 2022 | Zero-shot Generalization | CodeCode Available | 1 |
| Learning Modular Simulations for Homogeneous Systems | Oct 28, 2022 | Zero-shot Generalization | CodeCode Available | 1 |
| MAgNet: Mesh Agnostic Neural PDE Solver | Oct 11, 2022 | Zero-shot Generalization | CodeCode Available | 1 |
| Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks | Oct 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PART: Pre-trained Authorship Representation Transformer | Sep 30, 2022 | Zero-shot Generalization | CodeCode Available | 1 |
| A Multi-Task BERT Model for Schema-Guided Dialogue State Tracking | Jul 2, 2022 | Dialogue State TrackingIntent Classification | CodeCode Available | 1 |
| Multimodal Knowledge Alignment with Reinforcement Learning | May 25, 2022 | Audio captioningLanguage Modeling | CodeCode Available | 1 |
| What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization? | Apr 12, 2022 | DecoderLanguage Modeling | CodeCode Available | 1 |
| MetaMorph: Learning Universal Controllers with Transformers | Mar 22, 2022 | Zero-shot Generalization | CodeCode Available | 1 |
| Contextualize Me -- The Case for Context in Reinforcement Learning | Feb 9, 2022 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data | Dec 15, 2021 | Audio Source SeparationAudio Tagging | CodeCode Available | 1 |
| Neural Disparity Refinement for Arbitrary Resolution Stereo | Oct 28, 2021 | Zero-shot Generalization | CodeCode Available | 1 |
| CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation | Oct 6, 2021 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| Single-dataset Experts for Multi-dataset Question Answering | Sep 28, 2021 | Question AnsweringReading Comprehension | CodeCode Available | 1 |
| RnG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering | Sep 17, 2021 | Entity LinkingKnowledge Base Question Answering | CodeCode Available | 1 |