| Merino: Entropy-driven Design for Generative Language Models on IoT Devices | Feb 28, 2024 | CPULanguage Modeling | —Unverified | 0 |
| Beacon, a lightweight deep reinforcement learning benchmark library for flow control | Feb 27, 2024 | BenchmarkingCPU | CodeCode Available | 1 |
| PyGim: An Efficient Graph Neural Network Library for Real Processing-In-Memory Architectures | Feb 26, 2024 | CPUGPU | CodeCode Available | 1 |
| l1-norm regularized l1-norm best-fit lines | Feb 26, 2024 | CPU | —Unverified | 0 |
| GPTVQ: The Blessing of Dimensionality for LLM Quantization | Feb 23, 2024 | CPUQuantization | —Unverified | 0 |
| Automated Design and Optimization of Distributed Filtering Circuits via Reinforcement Learning | Feb 22, 2024 | CPUGPU | —Unverified | 0 |
| Green AI: A Preliminary Empirical Study on Energy Consumption in DL Models Across Different Runtime Infrastructures | Feb 21, 2024 | CPUGPU | —Unverified | 0 |
| Neuromorphic Synergy for Video Binarization | Feb 20, 2024 | BinarizationCamera Calibration | CodeCode Available | 1 |
| Accelerating local laplacian filters on FPGAs | Feb 18, 2024 | CPUinverse tone mapping | —Unverified | 0 |
| Optimal Parallelization Strategies for Active Flow Control in Deep Reinforcement Learning-Based Computational Fluid Dynamics | Feb 18, 2024 | CPUDeep Reinforcement Learning | —Unverified | 0 |
| Predicting User Experience on Laptops from Hardware Specifications | Feb 14, 2024 | CPU | —Unverified | 0 |
| Graph Feature Preprocessor: Real-time Subgraph-based Feature Extraction for Financial Crime Detection | Feb 13, 2024 | CPUGPU | —Unverified | 0 |
| Utilizing Low-Dimensional Molecular Embeddings for Rapid Chemical Similarity Search | Feb 12, 2024 | CPUDrug Discovery | —Unverified | 0 |
| Accelerating Distributed Deep Learning using Lossless Homomorphic Compression | Feb 12, 2024 | Computational EfficiencyCPU | CodeCode Available | 0 |
| Spectral Efficiency Maximization for Active RIS-aided Cell-Free Massive MIMO Systems with Imperfect CSI | Feb 11, 2024 | CPU | —Unverified | 0 |
| Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models | Feb 10, 2024 | CPUGPU | CodeCode Available | 3 |
| Anatomizing Deep Learning Inference in Web Browsers | Feb 8, 2024 | CPUDeep Learning | —Unverified | 0 |
| A Lightweight Inception Boosted U-Net Neural Network for Routability Prediction | Feb 7, 2024 | AvgCPU | CodeCode Available | 1 |
| ServeFlow: A Fast-Slow Model Architecture for Network Traffic Analysis | Feb 6, 2024 | CPU | —Unverified | 0 |
| ProactivePIM: Accelerating Weight-Sharing Embedding Layer with PIM for Scalable Recommendation System | Feb 6, 2024 | CPURecommendation Systems | —Unverified | 0 |
| Design and Implementation of an Automated Disaster-recovery System for a Kubernetes Cluster Using LSTM | Feb 5, 2024 | CPUManagement | —Unverified | 0 |
| Spin: An Efficient Secure Computation Framework with GPU Acceleration | Feb 4, 2024 | CPUGPU | —Unverified | 0 |
| Root Cause Analysis In Microservice Using Neural Granger Causal Discovery | Feb 2, 2024 | Causal DiscoveryContrastive Learning | CodeCode Available | 1 |
| Asynchronous Distributed Genetic Algorithms with Javascript and JSON | Jan 30, 2024 | CPU | —Unverified | 0 |
| SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design | Jan 29, 2024 | CPUGPU | CodeCode Available | 2 |