| A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation Models | Oct 17, 2022 | CPUGPU | CodeCode Available | 3 |
| Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking | Mar 27, 2022 | CPUMulti-Object Tracking | CodeCode Available | 3 |
| Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates | Sep 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| SoundStream: An End-to-End Neural Audio Codec | Jul 7, 2021 | CPUDecoder | CodeCode Available | 3 |
| Performance Analysis of Open Source Machine Learning Frameworks for Various Parameters in Single-Threaded and Multi-Threaded Modes | Aug 29, 2017 | BIG-bench Machine LearningCPU | CodeCode Available | 3 |
| MathOptAI.jl: Embed trained machine learning predictors into JuMP models | Jul 3, 2025 | CPUGaussian Processes | CodeCode Available | 2 |
| QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design | May 22, 2025 | CPUGPU | CodeCode Available | 2 |
| HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference | Apr 8, 2025 | CPUGPU | CodeCode Available | 2 |
| HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading | Feb 18, 2025 | Computational EfficiencyCPU | CodeCode Available | 2 |
| TakuNet: an Energy-Efficient CNN for Real-Time Inference on Embedded UAV systems in Emergency Response Scenarios | Jan 10, 2025 | Aerial Scene ClassificationCPU | CodeCode Available | 2 |
| Very fast Bayesian Additive Regression Trees on GPU | Oct 30, 2024 | CPUGPU | CodeCode Available | 2 |
| Rawsamble: Overlapping and Assembling Raw Nanopore Signals using a Hash-based Seeding Mechanism | Oct 23, 2024 | CPU | CodeCode Available | 2 |
| RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval | Sep 16, 2024 | CPUGPU | CodeCode Available | 2 |
| Super Monotonic Alignment Search | Sep 12, 2024 | CPUGPU | CodeCode Available | 2 |
| Enhancing Privacy in Federated Learning: Secure Aggregation for Real-World Healthcare Applications | Sep 2, 2024 | CPUFederated Learning | CodeCode Available | 2 |
| NAVIX: Scaling MiniGrid Environments with JAX | Jul 28, 2024 | CPUDeep Reinforcement Learning | CodeCode Available | 2 |
| Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control | Jun 4, 2024 | Bandwidth ExtensionCPU | CodeCode Available | 2 |
| The future of cosmological likelihood-based inference: accelerated high-dimensional parameter estimation and model comparison | May 21, 2024 | Bayesian InferenceCPU | CodeCode Available | 2 |
| AdFlush: A Real-World Deployable Machine Learning Solution for Effective Advertisement and Web Tracker Prevention | May 13, 2024 | BlockingCPU | CodeCode Available | 2 |
| HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level Synthesis | Apr 29, 2024 | CPUEdge-computing | CodeCode Available | 2 |
| An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training | Apr 18, 2024 | Contrastive LearningCPU | CodeCode Available | 2 |
| SFSORT: Scene Features-based Simple Online Real-Time Tracker | Apr 11, 2024 | CPUMulti-Object Tracking | CodeCode Available | 2 |
| Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures | Apr 3, 2024 | CPUGPU | CodeCode Available | 2 |
| FireANTs: Adaptive Riemannian Optimization for Multi-Scale Diffeomorphic Matching | Apr 1, 2024 | CPUImage Registration | CodeCode Available | 2 |
| Fast-Poly: A Fast Polyhedral Framework For 3D Multi-Object Tracking | Mar 20, 2024 | 3D Multi-Object TrackingCPU | CodeCode Available | 2 |
| JaxUED: A simple and useable UED library in Jax | Mar 19, 2024 | CPU | CodeCode Available | 2 |
| SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design | Jan 29, 2024 | CPUGPU | CodeCode Available | 2 |
| SCNet: Sparse Compression Network for Music Source Separation | Jan 24, 2024 | CPUMusic Source Separation | CodeCode Available | 2 |
| Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction | Jan 12, 2024 | Bandwidth ExtensionCPU | CodeCode Available | 2 |
| Real-time and Continuous Turn-taking Prediction Using Voice Activity Projection | Jan 10, 2024 | CPU | CodeCode Available | 2 |
| XLB: A differentiable massively parallel lattice Boltzmann library in Python | Nov 27, 2023 | CPUGPU | CodeCode Available | 2 |
| JaxMARL: Multi-Agent RL Environments and Algorithms in JAX | Nov 16, 2023 | CPUGPU | CodeCode Available | 2 |
| Exponentially Faster Language Modelling | Nov 15, 2023 | BenchmarkingCPU | CodeCode Available | 2 |
| Low-latency Real-time Voice Conversion on CPU | Nov 1, 2023 | CPUKnowledge Distillation | CodeCode Available | 2 |
| Breaking of brightness consistency in optical flow with a lightweight CNN network | Oct 24, 2023 | CPUOptical Flow Estimation | CodeCode Available | 2 |
| CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs | Aug 29, 2023 | CPUGPU | CodeCode Available | 2 |
| tsdownsample: high-performance time series downsampling for scalable visualization | Jul 5, 2023 | CPUTime Series | CodeCode Available | 2 |
| Datasets and Benchmarks for Offline Safe Reinforcement Learning | Jun 15, 2023 | Autonomous DrivingBenchmarking | CodeCode Available | 2 |
| QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control | Jun 15, 2023 | CPUDeep Reinforcement Learning | CodeCode Available | 2 |
| AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec | May 26, 2023 | CPUGPU | CodeCode Available | 2 |
| MixFormerV2: Efficient Fully Transformer Tracking | May 25, 2023 | CPUGPU | CodeCode Available | 2 |
| Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory | May 25, 2023 | Common Sense ReasoningCPU | CodeCode Available | 2 |
| Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness | May 18, 2023 | CPUGPU | CodeCode Available | 2 |
| Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks | Mar 7, 2023 | CPUGPU | CodeCode Available | 2 |
| Cross-domain Neural Pitch and Periodicity Estimation | Jan 28, 2023 | CPUGPU | CodeCode Available | 2 |
| ImMesh: An Immediate LiDAR Localization and Meshing Framework | Jan 12, 2023 | CPUDimensionality Reduction | CodeCode Available | 2 |
| evosax: JAX-based Evolution Strategies | Dec 8, 2022 | CPUScheduling | CodeCode Available | 2 |
| TorchOpt: An Efficient Library for Differentiable Optimization | Nov 13, 2022 | CPUGPU | CodeCode Available | 2 |
| Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform | Oct 28, 2022 | CPUKnowledge Distillation | CodeCode Available | 2 |
| Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering | Oct 25, 2022 | ClusteringCPU | CodeCode Available | 2 |