| Improving the Serving Performance of Multi-LoRA Large Language Models via Efficient LoRA and KV Cache Management | Apr 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SOTOPIA-S4: a user-friendly system for flexible, customizable, and large-scale social simulation | Apr 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Accelerating LLM Inference with Flexible N:M Sparsity via A Fully Digital Compute-in-Memory Accelerator | Apr 19, 2025 | Large Language Model | CodeCode Available | 0 |
| FGMP: Fine-Grained Mixed-Precision Weight and Activation Quantization for Hardware-Accelerated LLM Inference | Apr 19, 2025 | Large Language ModelQuantization | —Unverified | 0 |
| Manipulating Multimodal Agents via Cross-Modal Prompt Injection | Apr 19, 2025 | Large Language Model | —Unverified | 0 |
| Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations | Apr 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Model Enhanced Particle Swarm Optimization for Hyperparameter Tuning for Deep Learning Models | Apr 19, 2025 | Deep LearningLanguage Modeling | —Unverified | 0 |
| High-Throughput LLM inference on Heterogeneous Clusters | Apr 18, 2025 | Large Language ModelScheduling | —Unverified | 0 |
| Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods | Apr 18, 2025 | Large Language Model | —Unverified | 0 |
| Large Language Bayes | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |