| LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression | Oct 10, 2023 | Code CompletionFew-Shot Learning | CodeCode Available | 5 |
| LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models | Oct 9, 2023 | GSM8KIn-Context Learning | CodeCode Available | 5 |
| EasyPhoto: Your Smart AI Photo Generator | Oct 7, 2023 | | CodeCode Available | 5 |
| ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models | Oct 6, 2023 | | CodeCode Available | 5 |
| Efficient Streaming Language Models with Attention Sinks | Sep 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| YOLOR-Based Multi-Task Learning | Sep 29, 2023 | Image CaptioningInstance Segmentation | CodeCode Available | 5 |
| QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models | Sep 26, 2023 | Quantization | CodeCode Available | 5 |
| DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention | Sep 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| ChatGPT MT: Competitive for High- (but not Low-) Resource Languages | Sep 14, 2023 | Machine Translation | CodeCode Available | 5 |
| The Rise and Potential of Large Language Model Based Agents: A Survey | Sep 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Agents: An Open-source Framework for Autonomous Language Agents | Sep 14, 2023 | | CodeCode Available | 5 |
| ImageBind-LLM: Multi-modality Instruction Tuning | Sep 7, 2023 | Instruction FollowingText Generation | CodeCode Available | 5 |
| ProPainter: Improving Propagation and Transformer for Video Inpainting | Sep 7, 2023 | Optical Flow EstimationVideo Inpainting | CodeCode Available | 5 |
| Data-Juicer: A One-Stop Data Processing System for Large Language Models | Sep 5, 2023 | Distributed Computing | CodeCode Available | 5 |
| Nougat: Neural Optical Understanding for Academic Documents | Aug 25, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 5 |
| Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond | Aug 24, 2023 | Chart Question AnsweringFS-MEVQA | CodeCode Available | 5 |
| WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct | Aug 18, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 5 |
| IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models | Aug 13, 2023 | Diffusion Personalization Tuning FreeImage Generation | CodeCode Available | 5 |
| ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs | Jul 31, 2023 | Trajectory PlanningZero-shot Generalization | CodeCode Available | 5 |
| MMBench: Is Your Multi-modal Model an All-around Player? | Jul 12, 2023 | AllInstruction Following | CodeCode Available | 5 |
| ReLoRA: High-Rank Training Through Low-Rank Updates | Jul 11, 2023 | GPU | CodeCode Available | 5 |
| Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language Model | Jun 28, 2023 | HallucinationKnowledge Graphs | CodeCode Available | 5 |
| Faster Segment Anything: Towards Lightweight SAM for Mobile Applications | Jun 25, 2023 | CPUDecoder | CodeCode Available | 5 |
| LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models | Jun 21, 2023 | | CodeCode Available | 5 |
| Infinite Photorealistic Worlds using Procedural Generation | Jun 15, 2023 | 3D Reconstructionobject-detection | CodeCode Available | 5 |
| WizardCoder: Empowering Code Large Language Models with Evol-Instruct | Jun 14, 2023 | Code GenerationHumanEval | CodeCode Available | 5 |
| StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models | Jun 13, 2023 | Speech Synthesistext-to-speech | CodeCode Available | 5 |
| Image Vectorization: a Review | Jun 10, 2023 | Image GenerationVector Graphics | CodeCode Available | 5 |
| XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech | May 31, 2023 | text-to-speechText to Speech | CodeCode Available | 5 |
| Voyager: An Open-Ended Embodied Agent with Large Language Models | May 25, 2023 | Lifelong learningMinecraft | CodeCode Available | 5 |
| Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training | May 23, 2023 | Contrastive LearningSelf-Supervised Learning | CodeCode Available | 5 |
| Tree of Thoughts: Deliberate Problem Solving with Large Language Models | May 17, 2023 | Arithmetic ReasoningDecision Making | CodeCode Available | 5 |
| ImageBind: One Embedding Space To Bind Them All | May 9, 2023 | AllCross-Modal Retrieval | CodeCode Available | 5 |
| StarCoder: may the source be with you! | May 9, 2023 | 8kCode Generation | CodeCode Available | 5 |
| CodeGen2: Lessons for Training LLMs on Programming and Natural Languages | May 3, 2023 | Causal Language ModelingDecoder | CodeCode Available | 5 |
| LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model | Apr 28, 2023 | Instruction Followingmodel | CodeCode Available | 5 |
| WizardLM: Empowering Large Language Models to Follow Complex Instructions | Apr 24, 2023 | Instruction Following | CodeCode Available | 5 |
| Track Anything: Segment Anything Meets Videos | Apr 24, 2023 | Image SegmentationObject Tracking | CodeCode Available | 5 |
| Long-term Forecasting with TiDE: Time-series Dense Encoder | Apr 17, 2023 | Anomaly DetectionDecoder | CodeCode Available | 5 |
| Tool Learning with Foundation Models | Apr 17, 2023 | | CodeCode Available | 5 |
| Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation | Apr 16, 2023 | Instruction Following | CodeCode Available | 5 |
| Inpaint Anything: Segment Anything Meets Image Inpainting | Apr 13, 2023 | Image Inpainting | CodeCode Available | 5 |
| RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment | Apr 13, 2023 | Ethics | CodeCode Available | 5 |
| Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond | Apr 11, 2023 | Text to 3D | CodeCode Available | 5 |
| How to Design Translation Prompts for ChatGPT: An Empirical Study | Apr 5, 2023 | Machine TranslationNatural Language Understanding | CodeCode Available | 5 |
| Segment Anything | Apr 5, 2023 | Event-based Object SegmentationImage Segmentation | CodeCode Available | 5 |
| Assessing Language Model Deployment with Risk Cards | Mar 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X | Mar 30, 2023 | BenchmarkingCode Generation | CodeCode Available | 5 |
| LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention | Mar 28, 2023 | Instruction FollowingLanguage Modelling | CodeCode Available | 5 |
| Does `Deep Learning on a Data Diet' reproduce? Overall yes, but GraNd at Initialization does not | Mar 26, 2023 | | CodeCode Available | 5 |