| Zephyr: Direct Distillation of LM Alignment | Oct 25, 2023 | 2D Cyclist DetectionFew-Shot Learning | CodeCode Available | 5 |
| CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving | Oct 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Ferret: Refer and Ground Anything Anywhere at Any Granularity | Oct 11, 2023 | HallucinationLanguage Modeling | CodeCode Available | 5 |
| Efficient Streaming Language Models with Attention Sinks | Sep 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention | Sep 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| The Rise and Potential of Large Language Model Based Agents: A Survey | Sep 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond | Aug 24, 2023 | Chart Question AnsweringFS-MEVQA | CodeCode Available | 5 |
| Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language Model | Jun 28, 2023 | HallucinationKnowledge Graphs | CodeCode Available | 5 |
| CodeGen2: Lessons for Training LLMs on Programming and Natural Languages | May 3, 2023 | Causal Language ModelingDecoder | CodeCode Available | 5 |
| Assessing Language Model Deployment with Risk Cards | Mar 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |