| BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache | Mar 24, 2025 | Computational EfficiencyGPU | CodeCode Available | 2 |
| Boundary-Aware Segmentation Network for Mobile and Web Applications | Jan 12, 2021 | Camouflaged Object SegmentationDecoder | CodeCode Available | 2 |
| Birbal: An efficient 7B instruct-model fine-tuned with curated datasets | Mar 4, 2024 | GPU | CodeCode Available | 2 |
| Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity | Sep 19, 2023 | GPU | CodeCode Available | 2 |
| Positive-Unlabeled Compression on the Cloud | Sep 21, 2019 | GPUKnowledge Distillation | CodeCode Available | 2 |
| FRA-RIR: Fast Random Approximation of the Image-source Method | Aug 8, 2022 | DenoisingGPU | CodeCode Available | 2 |
| FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative Finance | Dec 13, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 |
| GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models | Oct 12, 2023 | GPUText to 3D | CodeCode Available | 2 |
| GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Nov 19, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference | May 27, 2023 | GPUImage Generation | CodeCode Available | 2 |