| Hallucinations in Large Multilingual Translation Models | Mar 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes | Oct 22, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 |
| Housekeep: Tidying Virtual Households using Commonsense Reasoning | May 22, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Grounding Language Models for Visual Entity Recognition | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation | Feb 18, 2025 | Collaborative FilteringExplainable Recommendation | CodeCode Available | 1 |
| Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos | Aug 26, 2024 | FormLanguage Modelling | CodeCode Available | 1 |
| MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model Evaluation | Dec 28, 2023 | GSM8KLanguage Model Evaluation | CodeCode Available | 1 |
| MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis | Jun 23, 2025 | DiagnosticLarge Language Model | CodeCode Available | 1 |
| GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration | Oct 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can Large Language Model Agents Balance Energy Systems? | Feb 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Jun 25, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enabling LLM Knowledge Analysis via Extensive Materialization | Nov 7, 2024 | Knowledge Base ConstructionLarge Language Model | CodeCode Available | 1 |
| MemoNet: Memorizing All Cross Features' Representations Efficiently via Multi-Hash Codebook Network for CTR Prediction | Oct 25, 2022 | AllClick-Through Rate Prediction | CodeCode Available | 1 |
| AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein Engineering | Nov 7, 2024 | AutoMLHyperparameter Optimization | CodeCode Available | 1 |
| GraphLLM: Boosting Graph Reasoning Ability of Large Language Model | Oct 9, 2023 | Graph LearningLanguage Modeling | CodeCode Available | 1 |
| Hallucination Augmented Contrastive Learning for Multimodal Large Language Model | Dec 12, 2023 | Contrastive LearningHallucination | CodeCode Available | 1 |
| Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs | Mar 18, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| InterControl: Zero-shot Human Interaction Generation by Controlling Every Joint | Nov 27, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Glinthawk: A Two-Tiered Architecture for Offline LLM Inference | Jan 20, 2025 | CPULanguage Modeling | CodeCode Available | 1 |
| GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text | Aug 14, 2023 | Drug DiscoveryImage Captioning | CodeCode Available | 1 |
| Autonomous Microscopy Experiments through Large Language Model Agents | Dec 18, 2024 | BenchmarkingExperimental Design | CodeCode Available | 1 |
| Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection | Jul 12, 2024 | Collaborative InferenceLanguage Modelling | CodeCode Available | 1 |
| GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution | May 27, 2025 | 8kAvg | CodeCode Available | 1 |
| GeoGalactica: A Scientific Large Language Model in Geoscience | Dec 31, 2023 | Document ClassificationGeneral Knowledge | CodeCode Available | 1 |