| LSM-GNN: Large-scale Storage-based Multi-GPU GNN Training by Optimizing Data Transfer Scheme | Jul 21, 2024 | CPUFraud Detection | —Unverified | 0 |
| Mixture of Experts with Mixture of Precisions for Tuning Quality of Service | Jul 19, 2024 | CPUGPU | —Unverified | 0 |
| OCTolyzer: Fully automatic toolkit for segmentation and feature extracting in optical coherence tomography and scanning laser ophthalmoscopy data | Jul 19, 2024 | CPU | CodeCode Available | 1 |
| Mixed-precision Neural Networks on RISC-V Cores: ISA extensions for Multi-Pumped Soft SIMD Operations | Jul 19, 2024 | CPUQuantization | CodeCode Available | 1 |
| Regression prediction algorithm for energy consumption regression in cloud computing based on horned lizard algorithm optimised convolutional neural network-bidirectional gated recurrent unit | Jul 19, 2024 | Cloud ComputingCPU | —Unverified | 0 |
| Attention in SRAM on Tenstorrent Grayskull | Jul 18, 2024 | CPUGPU | CodeCode Available | 1 |
| RISC-V RVV efficiency for ANN algorithms | Jul 18, 2024 | CPU | —Unverified | 0 |
| FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification | Jul 17, 2024 | CPUDomain Adaptation | CodeCode Available | 1 |
| ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks | Jul 17, 2024 | CPUGPU | —Unverified | 0 |
| MEMO: Fine-grained Tensor Management For Ultra-long Context LLM Training | Jul 16, 2024 | CPUGPU | —Unverified | 0 |
| PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation | Jul 16, 2024 | CPU | —Unverified | 0 |
| A Bag of Tricks for Scaling CPU-based Deep FFMs to more than 300m Predictions per Second | Jul 14, 2024 | Click-Through Rate PredictionCPU | —Unverified | 0 |
| TensorTEE: Unifying Heterogeneous TEE Granularity for Efficient Secure Collaborative Tensor Computing | Jul 12, 2024 | CPULanguage Modelling | —Unverified | 0 |
| Analyzing Machine Learning Performance in a Hybrid Quantum Computing and HPC Environment | Jul 10, 2024 | CPUGPU | —Unverified | 0 |
| Inference Performance Optimization for Large Language Models on CPUs | Jul 10, 2024 | CPUGPU | CodeCode Available | 3 |
| Fast On-device LLM Inference with NPUs | Jul 8, 2024 | CPUGPU | CodeCode Available | 5 |
| Accelerating MRI Uncertainty Estimation with Mask-based Bayesian Neural Network | Jul 7, 2024 | CPUDiagnostic | —Unverified | 0 |
| Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms | Jul 3, 2024 | BenchmarkingCPU | —Unverified | 0 |
| Supporting Cross-language Cross-project Bug Localization Using Pre-trained Language Models | Jul 3, 2024 | Contrastive LearningCPU | —Unverified | 0 |
| Unified Anomaly Detection methods on Edge Device using Knowledge Distillation and Quantization | Jul 3, 2024 | Anomaly DetectionCPU | —Unverified | 0 |
| FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis | Jun 30, 2024 | CPUDecoder | —Unverified | 0 |
| Graph Neural Network as Computationally Efficient Emulator of Ice-sheet and Sea-level System Model (ISSM) | Jun 26, 2024 | CPUGPU | —Unverified | 0 |
| T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge | Jun 25, 2024 | Computational EfficiencyCPU | CodeCode Available | 4 |
| Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving | Jun 24, 2024 | CPUGPU | CodeCode Available | 7 |
| SLOctolyzer: Fully automatic analysis toolkit for segmentation and feature extracting in scanning laser ophthalmoscopy images | Jun 24, 2024 | AnatomyCPU | CodeCode Available | 1 |