| Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding | Nov 14, 2023 | Image-based Generative Performance BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Tamil-Llama: A New Tamil Language Model Based on Llama 2 | Nov 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving | Nov 9, 2023 | Autonomous DrivingCommon Sense Reasoning | CodeCode Available | 2 |
| BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings | Nov 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Large Trajectory Models are Scalable Motion Predictors and Planners | Oct 30, 2023 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution | Oct 25, 2023 | DenoisingLanguage Modeling | CodeCode Available | 2 |
| DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning | Oct 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain | Oct 22, 2023 | Dialogue GenerationDialogue Understanding | CodeCode Available | 2 |
| Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture | Oct 18, 2023 | 4kimage-classification | CodeCode Available | 2 |
| BitNet: Scaling 1-bit Transformers for Large Language Models | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLark: A Multimodal Instruction-Following Language Model for Music | Oct 11, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning | Oct 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Making Large Language Models Perform Better in Knowledge Graph Completion | Oct 10, 2023 | In-Context LearningKnowledge Graph Completion | CodeCode Available | 2 |
| OptiMUS: Optimization Modeling Using MIP Solvers and large language models | Oct 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction | Oct 5, 2023 | Event Argument ExtractionEvent Extraction | CodeCode Available | 2 |
| Ring Attention with Blockwise Transformers for Near-Infinite Context | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GPT-Driver: Learning to Drive with GPT | Oct 2, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training | Sep 29, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| RLLTE: Long-Term Evolution Project of Reinforcement Learning | Sep 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Effective Long-Context Scaling of Foundation Models | Sep 27, 2023 | Continual PretrainingLanguage Modeling | CodeCode Available | 2 |
| MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models | Sep 21, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 2 |
| LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent | Sep 21, 2023 | 3D visual groundingLanguage Modeling | CodeCode Available | 2 |
| OWL: A Large Language Model for IT Operations | Sep 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding | Sep 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning | Sep 14, 2023 | HallucinationIn-Context Learning | CodeCode Available | 2 |
| Unified Human-Scene Interaction via Prompted Chain-of-Contacts | Sep 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications | Sep 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Automated Bioinformatics Analysis via AutoBA | Sep 6, 2023 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| GPT Can Solve Mathematical Problems Without a Calculator | Sep 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning | Sep 5, 2023 | DecoderImage Generation | CodeCode Available | 2 |
| Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following | Sep 1, 2023 | 3D Generation3D Question Answering (3D-QA) | CodeCode Available | 2 |
| SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models | Aug 31, 2023 | DecoderLanguage Modeling | CodeCode Available | 2 |
| LLaSM: Large Language and Speech Model | Aug 30, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| DTrOCR: Decoder-only Transformer for Optical Character Recognition | Aug 30, 2023 | DecoderHandwritten Text Recognition | CodeCode Available | 2 |
| SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding | Aug 21, 2023 | Entity TypingEvent Extraction | CodeCode Available | 2 |
| Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes | Aug 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Language is All a Graph Needs | Aug 14, 2023 | AllGraph Learning | CodeCode Available | 2 |
| SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool | Aug 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Shepherd: A Critic for Language Model Generation | Aug 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AgentSims: An Open-Source Sandbox for Large Language Model Evaluation | Aug 8, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 2 |
| Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue | Aug 7, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Spanish Pre-trained BERT Model and Evaluation Data | Aug 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education | Aug 5, 2023 | ChatbotLanguage Modeling | CodeCode Available | 2 |
| LP-MusicCaps: LLM-Based Pseudo Music Captioning | Jul 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation | Jul 27, 2023 | 3D geometryFew-Shot Learning | CodeCode Available | 2 |
| TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer | Jul 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models | Jul 24, 2023 | Image GenerationImage-text matching | CodeCode Available | 2 |
| FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets | Jul 20, 2023 | Instruction FollowingLanguage Model Evaluation | CodeCode Available | 2 |
| DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI | Jul 19, 2023 | Conversational RecommendationDiversity | CodeCode Available | 2 |
| Planting a SEED of Vision in Large Language Model | Jul 16, 2023 | Image GenerationImage to text | CodeCode Available | 2 |