| Android in the Zoo: Chain-of-Action-Thought for GUI Agents | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance | May 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis | Mar 29, 2024 | HallucinationImage Captioning | CodeCode Available | 2 | 5 |
| Composed Image Retrieval for Remote Sensing | May 24, 2024 | Composed Image Retrieval (CoIR)Descriptive | CodeCode Available | 2 | 5 |
| AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients | Oct 15, 2020 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Grounded 3D-LLM with Referent Tokens | May 16, 2024 | Dense CaptioningDiversity | CodeCode Available | 2 | 5 |
| Grounding Language Models to Images for Multimodal Inputs and Outputs | Jan 31, 2023 | Image RetrievalIn-Context Learning | CodeCode Available | 2 | 5 |
| GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks | Feb 11, 2024 | Graph Question AnsweringInstruction Following | CodeCode Available | 2 | 5 |
| Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster | Jun 22, 2025 | DecoderImage Segmentation | CodeCode Available | 2 | 5 |
| GraphWiz: An Instruction-Following Language Model for Graph Problems | Feb 25, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 | 5 |
| GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding | Mar 13, 2025 | DiversityLanguage Modeling | CodeCode Available | 2 | 5 |
| An Egocentric Vision-Language Model based Portable Real-time Smart Assistant | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code Generation | May 20, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 | 5 |
| GPT or BERT: why not both? | Oct 31, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 2 | 5 |
| PromptPex: Automatic Test Generation for Language Model Prompts | Mar 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GPT-Driver: Learning to Drive with GPT | Oct 2, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 | 5 |
| GPT Understands, Too | Mar 18, 2021 | Knowledge ProbingLanguage Modeling | CodeCode Available | 2 | 5 |
| Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models | Jun 5, 2024 | DiversityLanguage Modeling | CodeCode Available | 2 | 5 |
| P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks | Oct 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models | Oct 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Enhancing Diagnostic Accuracy in Rare and Common Fundus Diseases with a Knowledge-Rich Vision-Language Model | Jun 13, 2024 | DiagnosticImage Retrieval | CodeCode Available | 2 | 5 |
| Granite Guardian | Dec 10, 2024 | HallucinationLanguage Modeling | CodeCode Available | 2 | 5 |
| GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest | Jul 7, 2023 | AttributeCommon Sense Reasoning | CodeCode Available | 2 | 5 |
| GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction | May 30, 2023 | Image GenerationInstruction Following | CodeCode Available | 2 | 5 |