| GuardT2I: Defending Text-to-Image Models from Adversarial Prompts | Mar 3, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 3 |
| IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact | Mar 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| OpenGraph: Towards Open Graph Foundation Models | Mar 2, 2024 | Data AugmentationGraph Learning | CodeCode Available | 3 |
| RiNALMo: General-Purpose RNA Language Models Can Generalize Well on Structure Prediction Tasks | Feb 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL | Feb 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Diffusion Language Models Are Versatile Protein Learners | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| SongComposer: A Large Language Model for Lyric and Melody Generation in Song Composition | Feb 27, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 3 |
| ShapeLLM: Universal 3D Object Understanding for Embodied Interaction | Feb 27, 2024 | 3D geometry3D Object Captioning | CodeCode Available | 3 |
| Cleaner Pretraining Corpus Curation with Neural Web Scraping | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Towards Building Multilingual Language Model for Medicine | Feb 21, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 3 |
| Query-Based Adversarial Prompt Generation | Feb 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models | Feb 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| VerMCTS: Synthesizing Multi-Step Programs using a Verifier, a Large Language Model, and Tree Search | Feb 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models | Feb 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey | Feb 8, 2024 | ArticlesEntity Alignment | CodeCode Available | 3 |
| Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents | Feb 8, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 |
| AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls | Feb 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks | Feb 6, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 3 |
| BlackMamba: Mixture of Experts for State-Space Models | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Evaluating Language Model Agency through Negotiations | Jan 9, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language Model | Jan 4, 2024 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 3 |
| LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model | Jan 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones | Dec 28, 2023 | Computational EfficiencyImage Captioning | CodeCode Available | 3 |
| MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices | Dec 28, 2023 | AutoMLCPU | CodeCode Available | 3 |
| SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling | Dec 23, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 3 |
| Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language Alignment | Dec 1, 2023 | Contrastive LearningFew-Shot Learning | CodeCode Available | 3 |
| Taiwan LLM: Bridging the Linguistic Divide with a Culturally Aligned Language Model | Nov 29, 2023 | DiversityLanguage Modeling | CodeCode Available | 3 |
| Language Model Inversion | Nov 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Large Language Model based Long-tail Query Rewriting in Taobao Search | Nov 7, 2023 | Contrastive LearningLanguage Modeling | CodeCode Available | 3 |
| Skywork: A More Open Bilingual Foundation Model | Oct 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| SkyMath: Technical Report | Oct 25, 2023 | GSM8KLanguage Modeling | CodeCode Available | 3 |
| Llemma: An Open Language Model For Mathematics | Oct 16, 2023 | Arithmetic ReasoningAutomated Theorem Proving | CodeCode Available | 3 |
| OceanGPT: A Large Language Model for Ocean Science Tasks | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Data Filtering Networks | Sep 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model | Sep 20, 2023 | 8kLanguage Modeling | CodeCode Available | 3 |
| Retentive Network: A Successor to Transformer for Large Language Models | Jul 17, 2023 | GPULanguage Modeling | CodeCode Available | 3 |
| MotionGPT: Human Motion as a Foreign Language | Jun 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration | Jun 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences | Jun 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| HuatuoGPT, towards Taming Language Model to Be a Doctor | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Hierarchical Prompting Assists Large Language Model on Web Navigation | May 23, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia | May 23, 2023 | ChatbotHallucination | CodeCode Available | 3 |
| Self-QA: Unsupervised Knowledge Guided Language Model Alignment | May 19, 2023 | DiversityLanguage Modeling | CodeCode Available | 3 |
| SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities | May 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and Verification | May 16, 2023 | DecoderLanguage Modeling | CodeCode Available | 3 |
| MultiModal-GPT: A Vision and Language Model for Dialogue with Humans | May 8, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 3 |
| REPLUG: Retrieval-Augmented Black-Box Language Models | Jan 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| ThoughtSource: A central hub for large language model reasoning data | Jan 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Cramming: Training a Language Model on a Single GPU in One Day | Dec 28, 2022 | GPULanguage Modeling | CodeCode Available | 3 |