| LSM-GNN: Large-scale Storage-based Multi-GPU GNN Training by Optimizing Data Transfer Scheme | Jul 21, 2024 | CPUFraud Detection | —Unverified | 0 |
| Regression prediction algorithm for energy consumption regression in cloud computing based on horned lizard algorithm optimised convolutional neural network-bidirectional gated recurrent unit | Jul 19, 2024 | Cloud ComputingCPU | —Unverified | 0 |
| OCTolyzer: Fully automatic toolkit for segmentation and feature extracting in optical coherence tomography and scanning laser ophthalmoscopy data | Jul 19, 2024 | CPU | CodeCode Available | 1 |
| Mixture of Experts with Mixture of Precisions for Tuning Quality of Service | Jul 19, 2024 | CPUGPU | —Unverified | 0 |
| Mixed-precision Neural Networks on RISC-V Cores: ISA extensions for Multi-Pumped Soft SIMD Operations | Jul 19, 2024 | CPUQuantization | CodeCode Available | 1 |
| Attention in SRAM on Tenstorrent Grayskull | Jul 18, 2024 | CPUGPU | CodeCode Available | 1 |
| RISC-V RVV efficiency for ANN algorithms | Jul 18, 2024 | CPU | —Unverified | 0 |
| FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification | Jul 17, 2024 | CPUDomain Adaptation | CodeCode Available | 1 |
| ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks | Jul 17, 2024 | CPUGPU | —Unverified | 0 |
| MEMO: Fine-grained Tensor Management For Ultra-long Context LLM Training | Jul 16, 2024 | CPUGPU | —Unverified | 0 |
| PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation | Jul 16, 2024 | CPU | —Unverified | 0 |
| A Bag of Tricks for Scaling CPU-based Deep FFMs to more than 300m Predictions per Second | Jul 14, 2024 | Click-Through Rate PredictionCPU | —Unverified | 0 |
| TensorTEE: Unifying Heterogeneous TEE Granularity for Efficient Secure Collaborative Tensor Computing | Jul 12, 2024 | CPULanguage Modelling | —Unverified | 0 |
| Analyzing Machine Learning Performance in a Hybrid Quantum Computing and HPC Environment | Jul 10, 2024 | CPUGPU | —Unverified | 0 |
| Inference Performance Optimization for Large Language Models on CPUs | Jul 10, 2024 | CPUGPU | CodeCode Available | 3 |
| Fast On-device LLM Inference with NPUs | Jul 8, 2024 | CPUGPU | CodeCode Available | 5 |
| Accelerating MRI Uncertainty Estimation with Mask-based Bayesian Neural Network | Jul 7, 2024 | CPUDiagnostic | —Unverified | 0 |
| Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms | Jul 3, 2024 | BenchmarkingCPU | —Unverified | 0 |
| Supporting Cross-language Cross-project Bug Localization Using Pre-trained Language Models | Jul 3, 2024 | Contrastive LearningCPU | —Unverified | 0 |
| Unified Anomaly Detection methods on Edge Device using Knowledge Distillation and Quantization | Jul 3, 2024 | Anomaly DetectionCPU | —Unverified | 0 |
| FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis | Jun 30, 2024 | CPUDecoder | —Unverified | 0 |
| Graph Neural Network as Computationally Efficient Emulator of Ice-sheet and Sea-level System Model (ISSM) | Jun 26, 2024 | CPUGPU | —Unverified | 0 |
| T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge | Jun 25, 2024 | Computational EfficiencyCPU | CodeCode Available | 4 |
| Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving | Jun 24, 2024 | CPUGPU | CodeCode Available | 7 |
| SLOctolyzer: Fully automatic analysis toolkit for segmentation and feature extracting in scanning laser ophthalmoscopy images | Jun 24, 2024 | AnatomyCPU | CodeCode Available | 1 |
| Towards Dynamic Resource Allocation and Client Scheduling in Hierarchical Federated Learning: A Two-Phase Deep Reinforcement Learning Approach | Jun 21, 2024 | CPUDeep Reinforcement Learning | —Unverified | 0 |
| Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGA | Jun 20, 2024 | Autonomous DrivingCPU | CodeCode Available | 1 |
| UpDLRM: Accelerating Personalized Recommendation using Real-World PIM Architecture | Jun 20, 2024 | CPUGPU | —Unverified | 0 |
| GPU-Accelerated DCOPF using Gradient-Based Optimization | Jun 19, 2024 | CPUGPU | CodeCode Available | 0 |
| Sparse High Rank Adapters | Jun 19, 2024 | CPUGPU | —Unverified | 0 |
| Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network | Jun 17, 2024 | CPUData Augmentation | CodeCode Available | 0 |
| Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference | Jun 17, 2024 | CPUGPU | —Unverified | 0 |
| PixRO: Pixel-Distributed Rotational Odometry with Gaussian Belief Propagation | Jun 14, 2024 | CPUGPU | —Unverified | 0 |
| Deep Symbolic Optimization for Combinatorial Optimization: Accelerating Node Selection by Discovering Potential Heuristics | Jun 14, 2024 | Combinatorial OptimizationCPU | CodeCode Available | 0 |
| GEB-1.3B: Open Lightweight Large Language Model | Jun 14, 2024 | CPULanguage Modeling | —Unverified | 0 |
| Practical offloading for fine-tuning LLM on commodity GPU via learned sparse projectors | Jun 14, 2024 | CPUGPU | CodeCode Available | 0 |
| ProTrain: Efficient LLM Training via Memory-Aware Techniques | Jun 12, 2024 | CPUGPU | —Unverified | 0 |
| PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models | Jun 11, 2024 | CPUGPU | —Unverified | 0 |
| fSEAD: a Composable FPGA-based Streaming Ensemble Anomaly Detection Library | Jun 10, 2024 | Anomaly DetectionCPU | CodeCode Available | 0 |
| PowerInfer-2: Fast Large Language Model Inference on a Smartphone | Jun 10, 2024 | CPULanguage Modeling | CodeCode Available | 9 |
| Investigating Memory Failure Prediction Across CPU Architectures | Jun 8, 2024 | CPUPrediction | —Unverified | 0 |
| Ensemble Method for System Failure Detection Using Large-Scale Telemetry Data | Jun 7, 2024 | CPU | —Unverified | 0 |
| MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter | Jun 7, 2024 | CPUGPU | CodeCode Available | 1 |
| Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control | Jun 4, 2024 | Bandwidth ExtensionCPU | CodeCode Available | 2 |
| Position Paper: Think Globally, React Locally -- Bringing Real-time Reference-based Website Phishing Detection on macOS | May 28, 2024 | CPUPosition | —Unverified | 0 |
| Proof of Quality: A Costless Paradigm for Trustless Generative AI Model Inference on Blockchains | May 28, 2024 | CPU | —Unverified | 0 |
| LoReTrack: Efficient and Accurate Low-Resolution Transformer Tracking | May 27, 2024 | CPUKnowledge Distillation | CodeCode Available | 1 |
| MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface Defects | May 25, 2024 | CPUDefect Detection | CodeCode Available | 1 |
| Aerial Inspection of High-Voltage Power Lines Using YOLOv8 Real-Time Object Detector | May 24, 2024 | CPUDefect Detection | CodeCode Available | 0 |
| Improving Simulation Regression Efficiency using a Machine Learning-based Method in Design Verification | May 24, 2024 | CPU | —Unverified | 0 |