| COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detection | Nov 28, 2024 | object-detectionObject Detection | CodeCode Available | 1 | 5 |
| Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning | Jan 19, 2021 | reinforcement-learningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Improving Diffusion Models for Scene Text Editing with Dual Encoders | Apr 12, 2023 | Scene Text EditingStyle Transfer | CodeCode Available | 1 | 5 |
| Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In | May 27, 2023 | MMLURetrieval | CodeCode Available | 1 | 5 |
| Equivariant Image Modeling | Mar 24, 2025 | Image GenerationZero-shot Generalization | CodeCode Available | 1 | 5 |
| Improving Zero-shot Generalization and Robustness of Multi-modal Models | Dec 4, 2022 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| A Universal Discriminator for Zero-Shot Generalization | Nov 15, 2022 | Zero-shot Generalization | CodeCode Available | 1 | 5 |
| FluoroSAM: A Language-aligned Foundation Model for X-ray Image Segmentation | Mar 12, 2024 | DiagnosticImage Segmentation | CodeCode Available | 1 | 5 |
| Gradient Ascent Post-training Enhances Language Model Generalization | Jun 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation | Dec 12, 2023 | Anomaly DetectionAutonomous Driving | CodeCode Available | 1 | 5 |