| Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval | Mar 22, 2023 | Image-text matchingLanguage Modeling | CodeCode Available | 2 |
| Implicit Neural Representation for Cooperative Low-light Image Enhancement | Mar 21, 2023 | Image EnhancementLanguage Modeling | CodeCode Available | 2 |
| Large Language Model Instruction Following: A Survey of Progresses and Challenges | Mar 18, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Stabilizing Transformer Training by Preventing Attention Entropy Collapse | Mar 11, 2023 | Automatic Speech Recognitionimage-classification | CodeCode Available | 2 |
| PaLM-E: An Embodied Multimodal Language Model | Mar 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| OpenICL: An Open-Source Framework for In-context Learning | Mar 6, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning | Feb 27, 2023 | Dense Video CaptioningLanguage Modeling | CodeCode Available | 2 |
| SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks | Feb 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Language Model Crossover: Variation through Few-Shot Prompting | Feb 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Hyena Hierarchy: Towards Larger Convolutional Language Models | Feb 21, 2023 | 2k8k | CodeCode Available | 2 |