| Deep Architectures for Neural Machine Translation | Jul 24, 2017 | DecoderGPU | CodeCode Available | 1 | 5 |
| Deep CNNs Meet Global Covariance Pooling: Better Representation and Generalization | Apr 15, 2019 | Fine-Grained Visual RecognitionGeneral Classification | CodeCode Available | 1 | 5 |
| EPSNet: Efficient Panoptic Segmentation Network with Cross-layer Attention Fusion | Mar 23, 2020 | GPUInstance Segmentation | CodeCode Available | 1 | 5 |
| CPU- and GPU-based Distributed Sampling in Dirichlet Process Mixtures for Large-scale Analysis | Apr 19, 2022 | CPUGPU | CodeCode Available | 1 | 5 |
| A GPU-accelerated Large-scale Simulator for Transportation System Optimization Benchmarking | Jun 15, 2024 | BenchmarkingGPU | CodeCode Available | 1 | 5 |
| Effective Batching for Recurrent Neural Network Grammars | May 31, 2021 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Evaluation and Optimization of Gradient Compression for Distributed Deep Learning | Jun 15, 2023 | Deep LearningGPU | CodeCode Available | 1 | 5 |
| CPM-2: Large-scale Cost-effective Pre-trained Language Models | Jun 20, 2021 | DecoderGPU | CodeCode Available | 1 | 5 |
| AutoDNNchip: An Automated DNN Chip Predictor and Builder for Both FPGAs and ASICs | Jan 6, 2020 | GPU | CodeCode Available | 1 | 5 |
| Evaluating Retrieval Quality in Retrieval-Augmented Generation | Apr 21, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Aerial Single-View Depth Completion with Image-Guided Uncertainty Estimation | Jan 17, 2020 | Depth CompletionGPU | CodeCode Available | 1 | 5 |
| Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts | May 30, 2023 | CPUGPU | CodeCode Available | 1 | 5 |
| LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language Models | Jul 5, 2025 | BenchmarkingGPU | CodeCode Available | 1 | 5 |
| LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-Resolution | Sep 5, 2024 | GPUImage Super-Resolution | CodeCode Available | 1 | 5 |
| CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU | Apr 13, 2022 | Click-Through Rate PredictionGPU | CodeCode Available | 1 | 5 |
| Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes | Jan 15, 2021 | All-day Semantic SegmentationAutonomous Vehicles | CodeCode Available | 1 | 5 |
| LLM-Pilot: Characterize and Optimize Performance of your LLM Inference Services | Oct 3, 2024 | BenchmarkingGPU | CodeCode Available | 1 | 5 |
| LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization | Mar 2, 2024 | GPUQuantization | CodeCode Available | 1 | 5 |
| Microscopy Image Restoration using Deep Learning on W2S | Apr 22, 2020 | CPUDeep Learning | CodeCode Available | 1 | 5 |
| COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing | Jun 13, 2024 | DenoisingGPU | CodeCode Available | 1 | 5 |
| Accelerating Large Scale Real-Time GNN Inference using Channel Pruning | May 10, 2021 | CPUGPU | CodeCode Available | 1 | 5 |
| A Streaming Approach For Efficient Batched Beam Search | Oct 5, 2020 | GPUMachine Translation | CodeCode Available | 1 | 5 |
| Counterfactual Generative Networks | Jan 15, 2021 | Classificationcounterfactual | CodeCode Available | 1 | 5 |
| eWaSR -- an embedded-compute-ready maritime obstacle detection network | Apr 21, 2023 | GPU | CodeCode Available | 1 | 5 |
| Auto Learning Attention | Dec 1, 2020 | GPUimage-classification | CodeCode Available | 1 | 5 |
| Easy and Efficient Transformer : Scalable Inference Solution For large NLP model | Apr 26, 2021 | DecoderGPU | CodeCode Available | 1 | 5 |
| Deep learning approach to left ventricular non-compaction measurement | Nov 30, 2020 | CPUDeep Learning | CodeCode Available | 1 | 5 |
| EXODUS: Stable and Efficient Training of Spiking Neural Networks | May 20, 2022 | GPU | CodeCode Available | 1 | 5 |
| AE-OT: A NEW GENERATIVE MODEL BASED ON EXTENDED SEMI-DISCRETE OPTIMAL TRANSPORT | May 1, 2020 | DecoderGPU | CodeCode Available | 1 | 5 |
| Edge and Identity Preserving Network for Face Super-Resolution | Aug 27, 2020 | GPUSuper-Resolution | CodeCode Available | 1 | 5 |
| LLMSTEP: LLM proofstep suggestions in Lean | Oct 27, 2023 | CPUGPU | CodeCode Available | 1 | 5 |
| Fast and accurate learned multiresolution dynamical downscaling for precipitation | Jan 18, 2021 | CPUGenerative Adversarial Network | CodeCode Available | 1 | 5 |
| FADRM: Fast and Accurate Data Residual Matching for Dataset Distillation | Jun 30, 2025 | Computational EfficiencyDataset Distillation | CodeCode Available | 1 | 5 |
| EZ-CLIP: Efficient Zeroshot Video Action Recognition | Dec 13, 2023 | Action RecognitionGPU | CodeCode Available | 1 | 5 |
| Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices | Oct 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Dynamic Structure Pruning for Compressing CNNs | Mar 17, 2023 | GPU | CodeCode Available | 1 | 5 |
| LL-GNN: Low Latency Graph Neural Networks on FPGAs for High Energy Physics | Sep 28, 2022 | GPUGraph Neural Network | CodeCode Available | 1 | 5 |
| Dynamic Sparse Training with Structured Sparsity | May 3, 2023 | CPUGPU | CodeCode Available | 1 | 5 |
| LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models | Sep 25, 2023 | GPUMixture-of-Experts | CodeCode Available | 1 | 5 |
| Farseer: A Refined Scaling Law in Large Language Models | Jun 12, 2025 | GPU | CodeCode Available | 1 | 5 |
| Dynamic Pooling Improves Nanopore Base Calling Accuracy | May 16, 2021 | GPU | CodeCode Available | 1 | 5 |
| Dynamic Perceiver for Efficient Visual Recognition | Jun 20, 2023 | Action RecognitionClassification | CodeCode Available | 1 | 5 |
| LiVOS: Light Video Object Segmentation with Gated Linear Matching | Nov 5, 2024 | GPUSemantic Segmentation | CodeCode Available | 1 | 5 |
| CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception | Apr 29, 2024 | Data VisualizationDecision Making | CodeCode Available | 1 | 5 |
| Accelerating Sparse DNN Models without Hardware-Support via Tile-Wise Sparsity | Aug 29, 2020 | GPUNetwork Pruning | CodeCode Available | 1 | 5 |
| Fast and Accurate Neural CRF Constituency Parsing | Aug 9, 2020 | Constituency ParsingDependency Parsing | CodeCode Available | 1 | 5 |
| Fast and Accurate Retrieval of Methane Concentration from Imaging Spectrometer Data Using Sparsity Prior | Mar 6, 2020 | GPURetrieval | CodeCode Available | 1 | 5 |
| Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete Verifiers | Nov 27, 2020 | GPU | CodeCode Available | 1 | 5 |
| Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms | May 8, 2021 | CPUGPU | CodeCode Available | 1 | 5 |
| LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing | Jun 4, 2024 | ClassificationGPU | CodeCode Available | 1 | 5 |