| Efficient Domain-adaptive Continual Pretraining for the Process Industry in the German Language | Apr 28, 2025 | Continual PretrainingGPU | —Unverified | 0 |
| PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant Phenotyping | Apr 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CodeBC: A More Secure Large Language Model for Smart Contract Code Generation in Blockchain | Apr 28, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination | Apr 28, 2025 | Code GenerationHallucination | —Unverified | 0 |
| Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing | Apr 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GenTorrent: Scaling Large Language Model Serving with An Overley Network | Apr 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Practical Second-Order Optimizers in Deep Learning: Insights from Fisher Information Analysis | Apr 26, 2025 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| Improving Language Model Personas via Rationalization with Psychological Scaffolds | Apr 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fast-Slow Thinking for Large Vision-Language Model Reasoning | Apr 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LEAM: A Prompt-only Large Language Model-enabled Antenna Modeling Method | Apr 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| The Big Send-off: High Performance Collectives on GPU-based Supercomputers | Apr 25, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Exploring a Large Language Model for Transforming Taxonomic Data into OWL: Lessons Learned and Implications for Ontology Development | Apr 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SMARTFinRAG: Interactive Modularized Financial RAG Benchmark | Apr 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation | Apr 24, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Does Knowledge Distillation Matter for Large Language Model based Bundle Generation? | Apr 24, 2025 | In-Context LearningKnowledge Distillation | —Unverified | 0 |
| TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation | Apr 24, 2025 | Caption GenerationDense Video Captioning | —Unverified | 0 |
| Towards Leveraging Large Language Model Summaries for Topic Modeling in Source Code | Apr 24, 2025 | Code SearchLanguage Modeling | —Unverified | 0 |
| Automatically Generating Rules of Malicious Software Packages via Large Language Model | Apr 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FashionM3: Multimodal, Multitask, and Multiround Fashion Assistant based on Unified Vision-Language Model | Apr 24, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Target Concrete Score Matching: A Holistic Framework for Discrete Diffusion | Apr 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Monte Carlo Planning with Large Language Model for Text-Based Game Agents | Apr 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ParamΔ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost | Apr 23, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Planning with Diffusion Models for Target-Oriented Dialogue Systems | Apr 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SplitReason: Learning To Offload Reasoning | Apr 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| In-Context Learning can distort the relationship between sequence likelihoods and biological fitness | Apr 23, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |