| BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation | Oct 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM | Jun 18, 2024 | Anomaly DetectionAnomaly Localization | CodeCode Available | 2 |
| AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model | Aug 2, 2022 | Causal Language ModelingCommon Sense Reasoning | CodeCode Available | 2 |
| GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance | May 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Grounding Language Models to Images for Multimodal Inputs and Outputs | Jan 31, 2023 | Image RetrievalIn-Context Learning | CodeCode Available | 2 |
| Causal Agent based on Large Language Model | Aug 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning | Sep 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Adapting a Language Model While Preserving its General Knowledge | Jan 21, 2023 | Continual LearningGeneral Knowledge | CodeCode Available | 2 |
| Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications | Sep 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| VLKEB: A Large Vision-Language Model Knowledge Editing Benchmark | Mar 12, 2024 | knowledge editingLanguage Modeling | CodeCode Available | 2 |
| Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model | Mar 6, 2025 | General KnowledgeImage Captioning | CodeCode Available | 2 |
| GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding | Mar 13, 2025 | DiversityLanguage Modeling | CodeCode Available | 2 |
| VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis | Mar 29, 2024 | HallucinationImage Captioning | CodeCode Available | 2 |
| GraphWiz: An Instruction-Following Language Model for Graph Problems | Feb 25, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| A Length-Extrapolatable Transformer | Dec 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks | Feb 11, 2024 | Graph Question AnsweringInstruction Following | CodeCode Available | 2 |
| Grounded 3D-LLM with Referent Tokens | May 16, 2024 | Dense CaptioningDiversity | CodeCode Available | 2 |
| How to Index Item IDs for Recommendation Foundation Models | May 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| In-Context Retrieval-Augmented Language Models | Jan 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models | Mar 4, 2022 | DecoderGPU | CodeCode Available | 2 |
| GPT Can Solve Mathematical Problems Without a Calculator | Sep 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Black-Box Tuning for Language-Model-as-a-Service | Jan 10, 2022 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Characterization of Large Language Model Development in the Datacenter | Mar 12, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Block-Recurrent Transformers | Mar 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GPT-Driver: Learning to Drive with GPT | Oct 2, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |