| Look Every Frame All at Once: Video-Ma^2mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing | Nov 29, 2024 | AllForm | —Unverified | 0 |
| A Simple Sparse Matrix Vector Multiplication Approach to Padded Convolution | Nov 29, 2024 | CPUGPU | CodeCode Available | 0 |
| PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers | Nov 28, 2024 | GPU | —Unverified | 0 |
| An Integrated Artificial Intelligence Operating System for Advanced Low-Altitude Aviation Applications | Nov 28, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Differentiable Topology Estimating from Curvatures for 3D Shapes | Nov 28, 2024 | GPUTopological Data Analysis | —Unverified | 0 |
| Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach | Nov 28, 2024 | GPU | —Unverified | 0 |
| Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads | Nov 28, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Puzzle: Distillation-Based NAS for Inference-Optimized LLMs | Nov 28, 2024 | GPUKnowledge Distillation | —Unverified | 0 |
| A Runtime-Adaptive Transformer Neural Network Accelerator on FPGAs | Nov 27, 2024 | Computational EfficiencyCPU | CodeCode Available | 0 |
| Towards Chunk-Wise Generation for Long Videos | Nov 27, 2024 | DenoisingGPU | —Unverified | 0 |