| AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers | Nov 17, 2024 | In-Context LearningMeta-Learning | CodeCode Available | 2 |
| KV Shifting Attention Enhances Language Modeling | Nov 29, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Just read twice: closing the recall gap for recurrent language models | Jul 7, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution | Jun 27, 2023 | 4kIn-Context Learning | CodeCode Available | 2 |
| Adapting Language Models to Compress Contexts | May 24, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Improving CLIP Training with Language Rewrites | May 31, 2023 | In-Context LearningSentence | CodeCode Available | 2 |
| How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning | Feb 5, 2024 | In-Context LearningMetric Learning | CodeCode Available | 2 |
| Black-Box Tuning for Language-Model-as-a-Service | Jan 10, 2022 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback | May 17, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Graph-ToolFormer: To Empower LLMs with Graph Reasoning Ability via Prompt Dataset Augmented by ChatGPT | Apr 10, 2023 | Community DetectionGraph Classification | CodeCode Available | 2 |