| Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference | Aug 23, 2023 | CPUGPU | CodeCode Available | 1 |
| Towards Lightweight Data Integration using Multi-workflow Provenance and Data Observability | Aug 17, 2023 | CPUData Integration | CodeCode Available | 1 |
| When Monte-Carlo Dropout Meets Multi-Exit: Optimizing Bayesian Neural Networks on FPGA | Aug 13, 2023 | Autonomous DrivingCPU | CodeCode Available | 1 |
| High-performance Data Management for Whole Slide Image Analysis in Digital Pathology | Aug 10, 2023 | CPUGPU | CodeCode Available | 1 |
| QUANT: A Minimalist Interval Method for Time Series Classification | Aug 2, 2023 | ClassificationCPU | CodeCode Available | 1 |
| BearingPGA-Net: A Lightweight and Deployable Bearing Fault Diagnosis Network via Decoupled Knowledge Distillation and FPGA Acceleration | Jul 31, 2023 | CPUFault Diagnosis | CodeCode Available | 1 |
| Mitigating Communications Threats in Decentralized Federated Learning through Moving Target Defense | Jul 21, 2023 | CPUFederated Learning | CodeCode Available | 1 |
| Implementation of a perception system for autonomous vehicles using a detection-segmentation network in SoC FPGA | Jul 17, 2023 | Autonomous VehiclesCPU | CodeCode Available | 1 |
| Fast model inference and training on-board of Satellites | Jul 17, 2023 | CPUDecision Making | CodeCode Available | 1 |
| QIGen: Generating Efficient Kernels for Quantized Inference on Large Language Models | Jul 7, 2023 | Code GenerationCPU | CodeCode Available | 1 |
| An open-source deep learning algorithm for efficient and fully-automatic analysis of the choroid in optical coherence tomography | Jul 3, 2023 | CPUSegmentation | CodeCode Available | 1 |
| SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores | Jun 29, 2023 | CPUreinforcement-learning | CodeCode Available | 1 |
| Accelerating Sampling and Aggregation Operations in GNN Frameworks with GPU Initiated Direct Storage Accesses | Jun 28, 2023 | CPUGPU | CodeCode Available | 1 |
| Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference | Jun 26, 2023 | CPUModel Compression | CodeCode Available | 1 |
| Implementing contextual biasing in GPU decoder for online ASR | Jun 23, 2023 | CPUDecoder | CodeCode Available | 1 |
| Dynamic Perceiver for Efficient Visual Recognition | Jun 20, 2023 | Action RecognitionClassification | CodeCode Available | 1 |
| Co-design Hardware and Algorithm for Vector Search | Jun 19, 2023 | CPUInformation Retrieval | CodeCode Available | 1 |
| ExoMDN: Rapid characterization of exoplanet interior structures with Mixture Density Networks | Jun 15, 2023 | CPU | CodeCode Available | 1 |
| Audio Tagging on an Embedded Hardware Platform | Jun 15, 2023 | Audio ClassificationAudio Tagging | CodeCode Available | 1 |
| EfficientBioAI: Making Bioimaging AI Models Efficient in Energy, Latency and Representation | Jun 9, 2023 | CPUGPU | CodeCode Available | 1 |
| The Information Retrieval Experiment Platform | May 30, 2023 | CPUInformation Retrieval | CodeCode Available | 1 |
| Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts | May 30, 2023 | CPUGPU | CodeCode Available | 1 |
| Search-Based Regular Expression Inference on a GPU | May 29, 2023 | CPUGPU | CodeCode Available | 1 |
| EfficientSpeech: An On-Device Text to Speech Model | May 23, 2023 | CPUmodel | CodeCode Available | 1 |
| Fast and Attributed Change Detection on Dynamic Graphs with Density of States | May 15, 2023 | Change DetectionChange Point Detection | CodeCode Available | 1 |
| Dynamic Sparse Training with Structured Sparsity | May 3, 2023 | CPUGPU | CodeCode Available | 1 |
| Harnessing Deep Learning and HPC Kernels via High-Level Loop and Tensor Abstractions on CPU Architectures | Apr 25, 2023 | CPU | CodeCode Available | 1 |
| FindVehicle and VehicleFinder: A NER dataset for natural language-based vehicle retrieval and a keyword-based cross-modal vehicle retrieval system | Apr 21, 2023 | CPUGPU | CodeCode Available | 1 |
| Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value | Apr 16, 2023 | CPUData Valuation | CodeCode Available | 1 |
| DGNN-Booster: A Generic FPGA Accelerator Framework For Dynamic Graph Neural Network Inference | Apr 13, 2023 | CPUGPU | CodeCode Available | 1 |
| InterFormer: Real-time Interactive Image Segmentation | Apr 6, 2023 | Computational EfficiencyCPU | CodeCode Available | 1 |
| Real-Time Dense 3D Mapping of Underwater Environments | Apr 5, 2023 | 3D ReconstructionCPU | CodeCode Available | 1 |
| TransPimLib: A Library for Efficient Transcendental Functions on Processing-in-Memory Systems | Apr 3, 2023 | CPUGPU | CodeCode Available | 1 |
| GNNBuilder: An Automated Framework for Generic Graph Neural Network Accelerator Generation, Simulation, and Optimization | Mar 29, 2023 | Code GenerationCPU | CodeCode Available | 1 |
| FAStEN: An Efficient Adaptive Method for Feature Selection and Estimation in High-Dimensional Functional Regressions | Mar 26, 2023 | CPUfeature selection | CodeCode Available | 1 |
| Practically Solving LPN in High Noise Regimes Faster Using Neural Networks | Mar 14, 2023 | CPUVocal Bursts Intensity Prediction | CodeCode Available | 1 |
| Fourier-MIONet: Fourier-enhanced multiple-input neural operators for multiphase modeling of geological carbon sequestration | Mar 8, 2023 | CPUGPU | CodeCode Available | 1 |
| Efficient subtyping of ovarian cancer histopathology whole slide images using active sampling in multiple instance learning | Feb 17, 2023 | ClassificationCPU | CodeCode Available | 1 |
| GPU-based Private Information Retrieval for On-Device Machine Learning Inference | Jan 26, 2023 | CPUGPU | CodeCode Available | 1 |
| FemtoDet: An Object Detection Baseline for Energy Versus Performance Tradeoffs | Jan 17, 2023 | CPUobject-detection | CodeCode Available | 1 |
| Distributed Deep Neural-Network-Based Middleware for Cyber-Attacks Detection in Smart IoT Ecosystem: A Novel Framework and Performance Evaluation Approach | Jan 6, 2023 | CPU | CodeCode Available | 1 |
| Autothrottle: A Practical Bi-Level Approach to Resource Management for SLO-Targeted Microservices | Dec 23, 2022 | CPUManagement | CodeCode Available | 1 |
| GPU-accelerated Guided Source Separation for Meeting Transcription | Dec 10, 2022 | blind source separationCPU | CodeCode Available | 1 |
| A Practical Stereo Depth System for Smart Glasses | Nov 19, 2022 | CPUDepth Estimation | CodeCode Available | 1 |
| ParticleGrid: Enabling Deep Learning using 3D Representation of Materials | Nov 15, 2022 | CPUDeep Learning | CodeCode Available | 1 |
| WindowSHAP: An Efficient Framework for Explaining Time-series Classifiers based on Shapley Values | Nov 11, 2022 | CPUTime Series | CodeCode Available | 1 |
| TLP: A Deep Learning-based Cost Model for Tensor Program Tuning | Nov 7, 2022 | CPUGPU | CodeCode Available | 1 |
| SLOPT: Bandit Optimization Framework for Mutation-Based Fuzzing | Nov 7, 2022 | CPU | CodeCode Available | 1 |
| Frequency Cam: Imaging Periodic Signals in Real-Time | Nov 1, 2022 | CPU | CodeCode Available | 1 |
| AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation | Oct 14, 2022 | CPUMachine Translation | CodeCode Available | 1 |