| Edge-Enabled Real-time Railway Track Segmentation | Jan 21, 2024 | GPUQuantization | —Unverified | 0 |
| immrax: A Parallelizable and Differentiable Toolbox for Interval Analysis and Mixed Monotone Reachability in JAX | Jan 21, 2024 | Computational EfficiencyGPU | CodeCode Available | 1 |
| A Lightweight FPGA-based IDS-ECU Architecture for Automotive CAN | Jan 19, 2024 | GPUIntrusion Detection | —Unverified | 0 |
| Enhancing Scalability in Recommender Systems through Lottery Ticket Hypothesis and Knowledge Distillation-based Neural Network Pruning | Jan 19, 2024 | GPUKnowledge Distillation | —Unverified | 0 |
| Exact analytical algorithm for solvent accessible surface area and derivatives in implicit solvent molecular simulations on GPUs | Jan 19, 2024 | CPUGPU | —Unverified | 0 |
| Towards providing reliable job completion time predictions using PCS | Jan 18, 2024 | FairnessGPU | CodeCode Available | 0 |
| Dynamic DNNs and Runtime Management for Efficient Inference on Mobile/Embedded Devices | Jan 17, 2024 | Dynamic neural networksGPU | CodeCode Available | 1 |
| PIN-SLAM: LiDAR SLAM Using a Point-Based Implicit Neural Representation for Achieving Global Map Consistency | Jan 17, 2024 | GPUIncremental Learning | CodeCode Available | 4 |
| Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model | Jan 17, 2024 | GPUImage Classification | CodeCode Available | 2 |
| LoMA: Lossless Compressed Memory Attention | Jan 16, 2024 | GPU | —Unverified | 0 |
| Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference | Jan 16, 2024 | GPUMixture-of-Experts | CodeCode Available | 1 |
| Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models | Jan 16, 2024 | GPUQuantization | CodeCode Available | 3 |
| TP-Aware Dequantization | Jan 15, 2024 | GPUQuantization | —Unverified | 0 |
| Efficient approximation of Earth Mover's Distance Based on Nearest Neighbor Search | Jan 14, 2024 | GPUimage-classification | CodeCode Available | 0 |
| Beyond Traditional Approaches: Multi-Task Network for Breast Ultrasound Diagnosis | Jan 14, 2024 | Anomaly ClassificationCancer Classification | CodeCode Available | 0 |
| Parameter-Efficient Detoxification with Contrastive Decoding | Jan 13, 2024 | AttributeGPU | —Unverified | 0 |
| E^2-LLM: Efficient and Extreme Length Extension of Large Language Models | Jan 13, 2024 | 4kGPU | —Unverified | 0 |
| Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part I: Homogeneous Diffusion Inpainting | Jan 12, 2024 | 4kGPU | —Unverified | 0 |
| Efficient Parallel Data Optimization for Homogeneous Diffusion Inpainting of 4K Images | Jan 12, 2024 | 4kGPU | —Unverified | 0 |
| Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction | Jan 12, 2024 | Bandwidth ExtensionCPU | CodeCode Available | 2 |
| Extreme Compression of Large Language Models via Additive Quantization | Jan 11, 2024 | CPUGPU | CodeCode Available | 5 |
| PANDORA: A Parallel Dendrogram Construction Algorithm for Single Linkage Clustering on GPU | Jan 11, 2024 | ClusteringGPU | —Unverified | 0 |
| MGARD: A multigrid framework for high-performance, error-controlled data compression and refactoring | Jan 11, 2024 | Data CompressionGPU | —Unverified | 0 |
| Towards Safe Load Balancing based on Control Barrier Functions and Deep Reinforcement Learning | Jan 10, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models | Jan 10, 2024 | GPUImage Generation | CodeCode Available | 7 |