| Salus: Fine-Grained GPU Sharing Primitives for Deep Learning Applications | Feb 12, 2019 | CPUDeep Learning | CodeCode Available | 0 |
| Development of Fast Refinement Detectors on AI Edge Platforms | Sep 24, 2019 | GPUObject | CodeCode Available | 0 |
| Comparing Energy Efficiency of CPU, GPU and FPGA Implementations for Vision Kernels | May 31, 2019 | CPUGPU | CodeCode Available | 0 |
| TensorLy: Tensor Learning in Python | Oct 29, 2016 | CPUGPU | CodeCode Available | 0 |
| Fast Algorithms for Spiking Neural Network Simulation with FPGAs | May 3, 2024 | GPUHigh-Level Synthesis | CodeCode Available | 0 |
| SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate | Nov 13, 2024 | Decision MakingGPU | CodeCode Available | 0 |
| Tensor Monte Carlo: particle methods for the GPU era | Jun 22, 2018 | GPU | CodeCode Available | 0 |
| Comparative Analysis of FPGA and GPU Performance for Machine Learning-Based Track Reconstruction at LHCb | Feb 4, 2025 | GPUGraph Neural Network | CodeCode Available | 0 |
| TensorNetwork for Machine Learning | Jun 7, 2019 | BIG-bench Machine LearningCPU | CodeCode Available | 0 |
| TensorNetwork on TensorFlow: A Spin Chain Application Using Tree Tensor Networks | May 3, 2019 | GPUTensor Networks | CodeCode Available | 0 |
| FALCON: Feature-Label Constrained Graph Net Collapse for Memory Efficient GNNs | Dec 27, 2023 | BenchmarkingGPU | CodeCode Available | 0 |
| A GPU-based Hydrodynamic Simulator with Boid Interactions | Nov 25, 2023 | GPUSurface Reconstruction | CodeCode Available | 0 |
| A unified framework for 21cm tomography sample generation and parameter inference with Progressively Growing GANs | Feb 19, 2020 | CPUGenerative Adversarial Network | CodeCode Available | 0 |
| A Generative Appearance Model for End-to-end Video Object Segmentation | Nov 28, 2018 | GPUOne-shot visual object segmentation | CodeCode Available | 0 |
| Faith: An Efficient Framework for Transformer Verification on GPUs | Sep 23, 2022 | GPUSentence | CodeCode Available | 0 |
| AttriReBoost: A Gradient-Free Propagation Optimization Method for Cold Start Mitigation in Attribute Missing Graphs | Jan 1, 2025 | AttributeComputational Efficiency | CodeCode Available | 0 |
| Factored Latent-Dynamic Conditional Random Fields for Single and Multi-label Sequence Modeling | Nov 9, 2019 | GPUModel Selection | CodeCode Available | 0 |
| Scalable Data Assimilation with Message Passing | Apr 19, 2024 | Bayesian InferenceGPU | CodeCode Available | 0 |
| TernaryNet: Faster Deep Model Inference without GPUs for Medical 3D Segmentation using Sparse and Binary Convolutions | Jan 29, 2018 | 3D Medical Imaging SegmentationDiagnostic | CodeCode Available | 0 |
| Attention on Attention: Architectures for Visual Question Answering (VQA) | Mar 21, 2018 | GPUQuestion Answering | CodeCode Available | 0 |
| UberNet: Training a `Universal' Convolutional Neural Network for Low-, Mid-, and High-Level Vision using Diverse Datasets and Limited Memory | Sep 7, 2016 | Boundary DetectionGPU | CodeCode Available | 0 |
| Compact Convolutional Neural Network Cascade for Face Detection | Aug 6, 2015 | 4kComputational Efficiency | CodeCode Available | 0 |
| Scalable Graph Networks for Particle Simulations | Oct 14, 2020 | GPU | CodeCode Available | 0 |
| Face-NMS: A Core-set Selection Approach for Efficient Face Recognition | Sep 10, 2021 | Face RecognitionGPU | CodeCode Available | 0 |
| Scalable K-FAC Training for Deep Neural Networks with Distributed Preconditioning | Jun 30, 2022 | GPU | CodeCode Available | 0 |
| Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy | Jul 4, 2024 | GPU | CodeCode Available | 0 |
| FaceBoxes: A CPU Real-time Face Detector with High Accuracy | Aug 17, 2017 | CPUFace Detection | CodeCode Available | 0 |
| Scalable Multitask Learning Using Gradient-based Estimation of Task Affinity | Sep 9, 2024 | GPUMulti-Label Classification | CodeCode Available | 0 |
| Accelerating the Training of Video Super-Resolution Models | May 10, 2022 | GPUSuper-Resolution | CodeCode Available | 0 |
| Enhanced Recurrent Neural Tangent Kernels for Non-Time-Series Data | Dec 9, 2020 | GPUTime Series | CodeCode Available | 0 |
| XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning | Jun 13, 2024 | GPUIn-Context Learning | CodeCode Available | 0 |
| ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning | Jun 20, 2024 | GPUVideo Generation | CodeCode Available | 0 |
| Extensions and Limitations of the Neural GPU | Nov 2, 2016 | GPU | CodeCode Available | 0 |
| A Frequency-aware Software Cache for Large Recommendation System Embeddings | Aug 8, 2022 | CPUGPU | CodeCode Available | 0 |
| Expressive Higher-Order Link Prediction through Hypergraph Symmetry Breaking | Feb 17, 2024 | GPULink Prediction | CodeCode Available | 0 |
| reCSE: Portable Reshaping Features for Sentence Embedding in Self-supervised Contrastive Learning | Aug 9, 2024 | Contrastive LearningData Augmentation | CodeCode Available | 0 |
| ScaleFreeCTR: MixCache-based Distributed Training System for CTR Models with Huge Embedding Table | Apr 17, 2021 | CPUGPU | CodeCode Available | 0 |
| A Truncated Newton Method for Optimal Transport | Apr 2, 2025 | GPU | CodeCode Available | 0 |
| Scaling Attention to Very Long Sequences in Linear Time with Wavelet-Enhanced Random Spectral Attention (WERSA) | Jul 11, 2025 | GPU | CodeCode Available | 0 |
| Ultra-High-Definition Image Deblurring via Multi-scale Cubic-Mixer | Jun 8, 2022 | DeblurringGPU | CodeCode Available | 0 |
| Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic Similarity | Feb 20, 2025 | GPULanguage Modeling | CodeCode Available | 0 |
| Explore as a Storm, Exploit as a Raindrop: On the Benefit of Fine-Tuning Kernel Schedulers with Coordinate Descent | Jun 28, 2024 | GPUScheduling | CodeCode Available | 0 |
| Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs | Feb 10, 2025 | GPU | CodeCode Available | 0 |
| Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation | Oct 14, 2024 | Autonomous DrivingGPU | CodeCode Available | 0 |
| Exact Gaussian Processes on a Million Data Points | Mar 19, 2019 | Gaussian ProcessesGPU | CodeCode Available | 0 |
| Accelerating Simulation-based Inference with Emerging AI Hardware | Dec 12, 2020 | EpidemiologyGPU | CodeCode Available | 0 |
| Evolving Neural Architecture Using One Shot Model | Dec 23, 2020 | GPUmodel | CodeCode Available | 0 |
| Evolutionary NAS with Gene Expression Programming of Cellular Encoding | May 27, 2020 | General ClassificationGPU | CodeCode Available | 0 |
| Communication-Efficient Graph Neural Networks with Probabilistic Neighborhood Expansion Analysis and Caching | May 4, 2023 | GPURecommendation Systems | CodeCode Available | 0 |
| Evaluating Quantized Large Language Models for Code Generation on Low-Resource Language Benchmarks | Oct 18, 2024 | Code GenerationGPU | CodeCode Available | 0 |