| Nd-BiMamba2: A Unified Bidirectional Architecture for Multi-Dimensional Data Processing | Nov 22, 2024 | Computational EfficiencyCPU | CodeCode Available | 3 |
| Deep operator network models for predicting post-burn contraction | Nov 21, 2024 | CPUGPU | —Unverified | 0 |
| Generative AI on the Edge: Architecture and Performance Evaluation | Nov 18, 2024 | CPURaspberry Pi 5 | —Unverified | 0 |
| Llama Guard 3-1B-INT4: Compact and Efficient Safeguard for Human-AI Conversations | Nov 18, 2024 | CPU | —Unverified | 0 |
| MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs | Nov 18, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Towards Accurate and Efficient Sub-8-Bit Integer Training | Nov 17, 2024 | CPUGPU | —Unverified | 0 |
| Pie: Pooling CPU Memory for LLM Inference | Nov 14, 2024 | CPUGPU | —Unverified | 0 |
| Offline Adaptation of Quadruped Locomotion using Diffusion Models | Nov 13, 2024 | CPU | CodeCode Available | 0 |
| Input-Based Ensemble-Learning Method for Dynamic Memory Configuration of Serverless Computing Functions | Nov 12, 2024 | CPUEnsemble Learning | —Unverified | 0 |
| TinyML Security: Exploring Vulnerabilities in Resource-Constrained Machine Learning Systems | Nov 11, 2024 | CPUEdge-computing | —Unverified | 0 |