| Block-Recurrent Transformers | Mar 11, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Huatuo-26M, a Large-scale Chinese Medical QA Dataset | May 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Ignore Previous Prompt: Attack Techniques For Language Models | Nov 17, 2022 | Adversarial AttackAdversarial Text | CodeCode Available | 2 | 5 |
| GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance | May 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Grounding Language Models to Images for Multimodal Inputs and Outputs | Jan 31, 2023 | Image RetrievalIn-Context Learning | CodeCode Available | 2 | 5 |
| OpenICL: An Open-Source Framework for In-context Learning | Mar 6, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| SOLO: A Single Transformer for Scalable Vision-Language Modeling | Jul 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding | Mar 13, 2025 | DiversityLanguage Modeling | CodeCode Available | 2 | 5 |
| VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis | Mar 29, 2024 | HallucinationImage Captioning | CodeCode Available | 2 | 5 |
| GraphWiz: An Instruction-Following Language Model for Graph Problems | Feb 25, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 | 5 |
| A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models | Oct 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Graph Language Models | Jan 13, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 2 | 5 |
| AdaFisher: Adaptive Second Order Optimization via Fisher Information | May 26, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 | 5 |
| Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer | Jan 23, 2017 | Computational EfficiencyGPU | CodeCode Available | 2 | 5 |
| GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks | Feb 11, 2024 | Graph Question AnsweringInstruction Following | CodeCode Available | 2 | 5 |
| SemiCD-VL: Visual-Language Model Guidance Makes Better Semi-supervised Change Detector | May 8, 2024 | Change DetectionLanguage Modeling | CodeCode Available | 2 | 5 |
| DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings | Apr 21, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| Grounded 3D-LLM with Referent Tokens | May 16, 2024 | Dense CaptioningDiversity | CodeCode Available | 2 | 5 |
| Differential Transformer | Oct 7, 2024 | HallucinationIn-Context Learning | CodeCode Available | 2 | 5 |
| PALO: A Polyglot Large Multimodal Model for 5B People | Feb 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GPT or BERT: why not both? | Oct 31, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 2 | 5 |
| GPT Understands, Too | Mar 18, 2021 | Knowledge ProbingLanguage Modeling | CodeCode Available | 2 | 5 |
| GPT Can Solve Mathematical Problems Without a Calculator | Sep 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients | Oct 15, 2020 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| GPT-Driver: Learning to Drive with GPT | Oct 2, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 | 5 |