| Robotouille: An Asynchronous Planning Benchmark for LLM Agents | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Intent Representation Learning with Large Language Model for Recommendation | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Do Large Language Model Benchmarks Test Reliability? | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications | Feb 5, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing | Feb 4, 2025 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 |
| Simulating Rumor Spreading in Social Networks using LLM Agents | Feb 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Learning to Generate Unit Tests for Automated Debugging | Feb 3, 2025 | HumanEvalLarge Language Model | CodeCode Available | 1 |
| Speculative Ensemble: Fast Large Language Model Ensemble via Speculation | Feb 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token Reprogrammings | Jan 28, 2025 | DenoisingDomain Generalization | CodeCode Available | 1 |
| PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures | Jan 25, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing | Jan 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Glinthawk: A Two-Tiered Architecture for Offline LLM Inference | Jan 20, 2025 | CPULanguage Modeling | CodeCode Available | 1 |
| EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery | Jan 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MedFILIP: Medical Fine-grained Language-Image Pre-training | Jan 18, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| When language and vision meet road safety: leveraging multimodal large language models for video-based traffic accident analysis | Jan 17, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport | Jan 16, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 |
| 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding | Jan 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Gandalf the Red: Adaptive Security for LLMs | Jan 14, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 |
| Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping | Jan 11, 2025 | GPULarge Language Model | CodeCode Available | 1 |
| Establishing baselines for generative discovery of inorganic crystals | Jan 4, 2025 | Band GapLanguage Modeling | CodeCode Available | 1 |
| CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models | Jan 2, 2025 | BenchmarkingComputer Security | CodeCode Available | 1 |
| SynTab-LLaVA: Enhancing Multimodal Table Understanding with Decoupled Synthesis | Jan 1, 2025 | Large Language Model | CodeCode Available | 1 |
| Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering | Jan 1, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts | Dec 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Facilitating large language model Russian adaptation with Learned Embedding Propagation | Dec 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense | Dec 30, 2024 | Cloud ComputingCode Generation | CodeCode Available | 1 |
| An Engorgio Prompt Makes Large Language Model Babble on | Dec 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner | Dec 24, 2024 | Autonomous DrivingDataset Generation | CodeCode Available | 1 |
| Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing | Dec 23, 2024 | ArabicMMLUDialect Identification | CodeCode Available | 1 |
| Brain-to-Text Benchmark '24: Lessons Learned | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model | Dec 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection | Dec 20, 2024 | Cancer ClassificationChatbot | CodeCode Available | 1 |
| Autonomous Microscopy Experiments through Large Language Model Agents | Dec 18, 2024 | BenchmarkingExperimental Design | CodeCode Available | 1 |
| SnakModel: Lessons Learned from Training an Open Danish Large Language Model | Dec 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| IDEA-Bench: How Far are Generative Models from Professional Designing? | Dec 16, 2024 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| LLMs Can Simulate Standardized Patients via Agent Coevolution | Dec 16, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| Large Language Models as Realistic Microservice Trace Generators | Dec 16, 2024 | Graph GenerationLanguage Modeling | CodeCode Available | 1 |
| Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning | Dec 15, 2024 | Decision MakingLarge Language Model | CodeCode Available | 1 |
| From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SPRec: Leveraging Self-Play to Debias Preference Alignment for Large Language Model-based Recommendations | Dec 12, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 |
| Concept Bottleneck Large Language Models | Dec 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NyayaAnumana & INLegalLlama: The Largest Indian Legal Judgment Prediction Dataset and Specialized Language Model for Enhanced Decision Analysis | Dec 11, 2024 | Continual PretrainingLanguage Modeling | CodeCode Available | 1 |
| Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences | Dec 10, 2024 | Bayesian OptimizationLanguage Modeling | CodeCode Available | 1 |
| LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations | Dec 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MISR: Measuring Instrumental Self-Reasoning in Frontier Models | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Mind the Gap: Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning | Dec 5, 2024 | Large Language ModelMeta Reinforcement Learning | CodeCode Available | 1 |
| Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension | Dec 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| Free and Customizable Code Documentation with LLMs: A Fine-Tuning Approach | Dec 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Multi-label Sequential Sentence Classification via Large Language Model | Nov 23, 2024 | Contrastive LearningExtractive Summarization | CodeCode Available | 1 |
| Planning-Driven Programming: A Large Language Model Programming Workflow | Nov 21, 2024 | Code GenerationHumanEval | CodeCode Available | 1 |