| Towards Dynamic Resource Allocation and Client Scheduling in Hierarchical Federated Learning: A Two-Phase Deep Reinforcement Learning Approach | Jun 21, 2024 | CPUDeep Reinforcement Learning | —Unverified | 0 |
| Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGA | Jun 20, 2024 | Autonomous DrivingCPU | CodeCode Available | 1 |
| UpDLRM: Accelerating Personalized Recommendation using Real-World PIM Architecture | Jun 20, 2024 | CPUGPU | —Unverified | 0 |
| GPU-Accelerated DCOPF using Gradient-Based Optimization | Jun 19, 2024 | CPUGPU | CodeCode Available | 0 |
| Sparse High Rank Adapters | Jun 19, 2024 | CPUGPU | —Unverified | 0 |
| Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network | Jun 17, 2024 | CPUData Augmentation | CodeCode Available | 0 |
| Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference | Jun 17, 2024 | CPUGPU | —Unverified | 0 |
| PixRO: Pixel-Distributed Rotational Odometry with Gaussian Belief Propagation | Jun 14, 2024 | CPUGPU | —Unverified | 0 |
| Deep Symbolic Optimization for Combinatorial Optimization: Accelerating Node Selection by Discovering Potential Heuristics | Jun 14, 2024 | Combinatorial OptimizationCPU | CodeCode Available | 0 |
| GEB-1.3B: Open Lightweight Large Language Model | Jun 14, 2024 | CPULanguage Modeling | —Unverified | 0 |
| Practical offloading for fine-tuning LLM on commodity GPU via learned sparse projectors | Jun 14, 2024 | CPUGPU | CodeCode Available | 0 |
| ProTrain: Efficient LLM Training via Memory-Aware Techniques | Jun 12, 2024 | CPUGPU | —Unverified | 0 |
| PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models | Jun 11, 2024 | CPUGPU | —Unverified | 0 |
| fSEAD: a Composable FPGA-based Streaming Ensemble Anomaly Detection Library | Jun 10, 2024 | Anomaly DetectionCPU | CodeCode Available | 0 |
| PowerInfer-2: Fast Large Language Model Inference on a Smartphone | Jun 10, 2024 | CPULanguage Modeling | CodeCode Available | 9 |
| Investigating Memory Failure Prediction Across CPU Architectures | Jun 8, 2024 | CPUPrediction | —Unverified | 0 |
| Ensemble Method for System Failure Detection Using Large-Scale Telemetry Data | Jun 7, 2024 | CPU | —Unverified | 0 |
| MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter | Jun 7, 2024 | CPUGPU | CodeCode Available | 1 |
| Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control | Jun 4, 2024 | Bandwidth ExtensionCPU | CodeCode Available | 2 |
| Position Paper: Think Globally, React Locally -- Bringing Real-time Reference-based Website Phishing Detection on macOS | May 28, 2024 | CPUPosition | —Unverified | 0 |
| Proof of Quality: A Costless Paradigm for Trustless Generative AI Model Inference on Blockchains | May 28, 2024 | CPU | —Unverified | 0 |
| LoReTrack: Efficient and Accurate Low-Resolution Transformer Tracking | May 27, 2024 | CPUKnowledge Distillation | CodeCode Available | 1 |
| MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface Defects | May 25, 2024 | CPUDefect Detection | CodeCode Available | 1 |
| Aerial Inspection of High-Voltage Power Lines Using YOLOv8 Real-Time Object Detector | May 24, 2024 | CPUDefect Detection | CodeCode Available | 0 |
| Improving Simulation Regression Efficiency using a Machine Learning-based Method in Design Verification | May 24, 2024 | CPU | —Unverified | 0 |