| Robotouille: An Asynchronous Planning Benchmark for LLM Agents | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications | Feb 5, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Do Large Language Model Benchmarks Test Reliability? | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Intent Representation Learning with Large Language Model for Recommendation | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing | Feb 4, 2025 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 |
| Learning to Generate Unit Tests for Automated Debugging | Feb 3, 2025 | HumanEvalLarge Language Model | CodeCode Available | 1 |
| Simulating Rumor Spreading in Social Networks using LLM Agents | Feb 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Speculative Ensemble: Fast Large Language Model Ensemble via Speculation | Feb 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token Reprogrammings | Jan 28, 2025 | DenoisingDomain Generalization | CodeCode Available | 1 |
| PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures | Jan 25, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing | Jan 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery | Jan 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Glinthawk: A Two-Tiered Architecture for Offline LLM Inference | Jan 20, 2025 | CPULanguage Modeling | CodeCode Available | 1 |
| MedFILIP: Medical Fine-grained Language-Image Pre-training | Jan 18, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| When language and vision meet road safety: leveraging multimodal large language models for video-based traffic accident analysis | Jan 17, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport | Jan 16, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 |
| 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding | Jan 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Gandalf the Red: Adaptive Security for LLMs | Jan 14, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 |
| Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping | Jan 11, 2025 | GPULarge Language Model | CodeCode Available | 1 |
| Establishing baselines for generative discovery of inorganic crystals | Jan 4, 2025 | Band GapLanguage Modeling | CodeCode Available | 1 |
| CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models | Jan 2, 2025 | BenchmarkingComputer Security | CodeCode Available | 1 |
| SynTab-LLaVA: Enhancing Multimodal Table Understanding with Decoupled Synthesis | Jan 1, 2025 | Large Language Model | CodeCode Available | 1 |
| Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering | Jan 1, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts | Dec 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense | Dec 30, 2024 | Cloud ComputingCode Generation | CodeCode Available | 1 |