| HGRN2: Gated Linear RNNs with State Expansion | Apr 11, 2024 | Image ClassificationLanguage Modeling | CodeCode Available | 2 | 5 |
| Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First Time | Feb 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 2 | 5 |
| Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding | Sep 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM | Jun 18, 2024 | Anomaly DetectionAnomaly Localization | CodeCode Available | 2 | 5 |
| Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow | Jun 3, 2024 | GPULanguage Modeling | CodeCode Available | 2 | 5 |
| VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis | Mar 29, 2024 | HallucinationImage Captioning | CodeCode Available | 2 | 5 |
| How to Index Item IDs for Recommendation Foundation Models | May 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Grounding Language Models to Images for Multimodal Inputs and Outputs | Jan 31, 2023 | Image RetrievalIn-Context Learning | CodeCode Available | 2 | 5 |
| Drive Like a Human: Rethinking Autonomous Driving with Large Language Models | Jul 14, 2023 | Autonomous DrivingCommon Sense Reasoning | CodeCode Available | 2 | 5 |
| DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences | Jun 5, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 | 5 |
| Grounded 3D-LLM with Referent Tokens | May 16, 2024 | Dense CaptioningDiversity | CodeCode Available | 2 | 5 |
| DsDm: Model-Aware Dataset Selection with Datamodels | Jan 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding | Mar 13, 2025 | DiversityLanguage Modeling | CodeCode Available | 2 | 5 |
| RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder | May 24, 2022 | DecoderInformation Retrieval | CodeCode Available | 2 | 5 |
| Causal Agent based on Large Language Model | Aug 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Cedille: A large autoregressive French language model | Feb 7, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| GraphWiz: An Instruction-Following Language Model for Graph Problems | Feb 25, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 | 5 |
| Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers | Jan 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Granite Guardian | Dec 10, 2024 | HallucinationLanguage Modeling | CodeCode Available | 2 | 5 |
| Ring Attention with Blockwise Transformers for Near-Infinite Context | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Graph Language Models | Jan 13, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 2 | 5 |
| RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing | Jun 20, 2023 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 2 | 5 |
| GPT or BERT: why not both? | Oct 31, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 2 | 5 |
| SOLO: A Single Transformer for Scalable Vision-Language Modeling | Jul 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GPT Understands, Too | Mar 18, 2021 | Knowledge ProbingLanguage Modeling | CodeCode Available | 2 | 5 |
| GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks | Feb 11, 2024 | Graph Question AnsweringInstruction Following | CodeCode Available | 2 | 5 |
| GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance | May 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Huatuo-26M, a Large-scale Chinese Medical QA Dataset | May 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Implicit Neural Representation for Cooperative Low-light Image Enhancement | Mar 21, 2023 | Image EnhancementLanguage Modeling | CodeCode Available | 2 | 5 |
| Language Models Can Improve Event Prediction by Few-Shot Abductive Reasoning | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction | Oct 5, 2023 | Event Argument ExtractionEvent Extraction | CodeCode Available | 2 | 5 |
| GOFA: A Generative One-For-All Model for Joint Graph Language Modeling | Jul 12, 2024 | AllLanguage Modeling | CodeCode Available | 2 | 5 |
| ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis | Aug 16, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 2 | 5 |
| Compression Represents Intelligence Linearly | Apr 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and Voting | Jun 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GODEL: Large-Scale Pre-Training for Goal-Directed Dialog | Jun 22, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Apr 10, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education | Aug 5, 2023 | ChatbotLanguage Modeling | CodeCode Available | 2 | 5 |
| DiffArtist: Towards Structure and Appearance Controllable Image Stylization | Jul 22, 2024 | DisentanglementImage Stylization | CodeCode Available | 2 | 5 |
| Scaling Transformer to 1M tokens and beyond with RMT | Apr 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Empirical Asset Pricing with Large Language Model Agents | Sep 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI | Nov 21, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 | 5 |
| GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest | Jul 7, 2023 | AttributeCommon Sense Reasoning | CodeCode Available | 2 | 5 |
| Scene Text Recognition with Permuted Autoregressive Sequence Models | Jul 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Composed Image Retrieval for Remote Sensing | May 24, 2024 | Composed Image Retrieval (CoIR)Descriptive | CodeCode Available | 2 | 5 |
| GIT: A Generative Image-to-text Transformer for Vision and Language | May 27, 2022 | DecoderImage Captioning | CodeCode Available | 2 | 5 |
| Characterization of Large Language Model Development in the Datacenter | Mar 12, 2024 | GPULanguage Modeling | CodeCode Available | 2 | 5 |
| GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models | May 30, 2025 | ClassificationDisaster Response | CodeCode Available | 2 | 5 |
| GeoChat: Grounded Large Vision-Language Model for Remote Sensing | Nov 24, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 | 5 |