| Asymmetric Non-local Neural Networks for Semantic Segmentation | Aug 21, 2019 | GPUSegmentation | CodeCode Available | 2 |
| Habitat: A Platform for Embodied AI Research | Apr 2, 2019 | BenchmarkingGPU | CodeCode Available | 2 |
| AutoFocus: Efficient Multi-Scale Inference | Dec 4, 2018 | GPU | CodeCode Available | 2 |
| ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware | Dec 2, 2018 | GPUImage Classification | CodeCode Available | 2 |
| SNIPER: Efficient Multi-Scale Training | May 23, 2018 | GPUimage-classification | CodeCode Available | 2 |
| geomstats: a Python Package for Riemannian Geometry in Machine Learning | May 21, 2018 | BIG-bench Machine LearningGPU | CodeCode Available | 2 |
| Efficient Neural Audio Synthesis | Feb 23, 2018 | Audio SynthesisCPU | CodeCode Available | 2 |
| AMC: AutoML for Model Compression and Acceleration on Mobile Devices | Feb 10, 2018 | AutoMLGPU | CodeCode Available | 2 |
| Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer | Jan 23, 2017 | Computational EfficiencyGPU | CodeCode Available | 2 |
| Feature Pyramid Networks for Object Detection | Dec 9, 2016 | GPUObject | CodeCode Available | 2 |
| GPflow: A Gaussian process library using TensorFlow | Oct 27, 2016 | Gaussian ProcessesGPU | CodeCode Available | 2 |
| Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations | Sep 22, 2016 | GPU | CodeCode Available | 2 |
| Fast Algorithms for Convolutional Neural Networks | Sep 30, 2015 | GPUPedestrian Detection | CodeCode Available | 2 |
| Relative Entropy Pathwise Policy Optimization | Jul 15, 2025 | GPU | CodeCode Available | 1 |
| LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language Models | Jul 5, 2025 | BenchmarkingGPU | CodeCode Available | 1 |
| FADRM: Fast and Accurate Data Residual Matching for Dataset Distillation | Jun 30, 2025 | Computational EfficiencyDataset Distillation | CodeCode Available | 1 |
| Fast ground penetrating radar dual-parameter full waveform inversion method accelerated by hybrid compilation of CUDA kernel function and PyTorch | Jun 25, 2025 | Computational EfficiencyGPR | CodeCode Available | 1 |
| Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking | Jun 25, 2025 | GPUVisual Tracking | CodeCode Available | 1 |
| DIP: Unsupervised Dense In-Context Post-training of Visual Representations | Jun 23, 2025 | GPUMeta-Learning | CodeCode Available | 1 |
| CommVQ: Commutative Vector Quantization for KV Cache Compression | Jun 23, 2025 | GPUGSM8K | CodeCode Available | 1 |
| ConsumerBench: Benchmarking Generative AI Applications on End-User Devices | Jun 21, 2025 | BenchmarkingCPU | CodeCode Available | 1 |
| Farseer: A Refined Scaling Law in Large Language Models | Jun 12, 2025 | GPU | CodeCode Available | 1 |
| Mutual-Supervised Learning for Sequential-to-Parallel Code Translation | Jun 11, 2025 | Code TranslationGPU | CodeCode Available | 1 |
| Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts | Jun 5, 2025 | GPUScheduling | CodeCode Available | 1 |
| Accelerating AllReduce with a Persistent Straggler | May 29, 2025 | GPU | CodeCode Available | 1 |