| Atom: Low-bit Quantization for Efficient and Accurate LLM Serving | Oct 29, 2023 | GPUQuantization | CodeCode Available | 2 |
| FP8-LM: Training FP8 Large Language Models | Oct 27, 2023 | GPU | CodeCode Available | 2 |
| QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models | Oct 25, 2023 | GPUMixture-of-Experts | CodeCode Available | 2 |
| LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation | Oct 16, 2023 | GPUImage Animation | CodeCode Available | 2 |
| ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models | Oct 16, 2023 | General Reinforcement LearningGPU | CodeCode Available | 2 |
| GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models | Oct 12, 2023 | GPUText to 3D | CodeCode Available | 2 |
| Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes | Oct 12, 2023 | GPUNovel View Synthesis | CodeCode Available | 2 |
| DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training | Oct 5, 2023 | GPU | CodeCode Available | 2 |
| MEM: Multi-Modal Elevation Mapping for Robotics and Learning | Sep 28, 2023 | ColorizationGPU | CodeCode Available | 2 |
| ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers | Sep 28, 2023 | GPUInstruction Following | CodeCode Available | 2 |
| OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control | Sep 22, 2023 | GPUreinforcement-learning | CodeCode Available | 2 |
| Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity | Sep 19, 2023 | GPU | CodeCode Available | 2 |
| CoLA: Exploiting Compositional Structure for Automatic and Efficient Numerical Linear Algebra | Sep 6, 2023 | CoLAGaussian Processes | CodeCode Available | 2 |
| CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs | Aug 29, 2023 | CPUGPU | CodeCode Available | 2 |
| OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models | Aug 25, 2023 | Common Sense ReasoningComputational Efficiency | CodeCode Available | 2 |
| FastSurfer-HypVINN: Automated sub-segmentation of the hypothalamus and adjacent structures on high-resolutional brain MRI | Aug 24, 2023 | GPUSegmentation | CodeCode Available | 2 |
| Platypus: Quick, Cheap, and Powerful Refinement of LLMs | Aug 14, 2023 | GPU | CodeCode Available | 2 |
| Machine-learned molecular mechanics force field for the simulation of protein-ligand systems and beyond | Jul 13, 2023 | Drug DesignDrug Discovery | CodeCode Available | 2 |
| Differentiable Forward Projector for X-ray Computed Tomography | Jul 11, 2023 | CT ReconstructionDeep Learning | CodeCode Available | 2 |
| InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information Retrieval | Jul 10, 2023 | GPUInformation Retrieval | CodeCode Available | 2 |
| cuSLINK: Single-linkage Agglomerative Clustering on the GPU | Jun 28, 2023 | ClusteringGPU | CodeCode Available | 2 |
| LeanDojo: Theorem Proving with Retrieval-Augmented Language Models | Jun 27, 2023 | Automated Theorem ProvingGPU | CodeCode Available | 2 |
| DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genome | Jun 26, 2023 | Computational EfficiencyCore Promoter Detection | CodeCode Available | 2 |
| H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models | Jun 24, 2023 | GPU | CodeCode Available | 2 |
| RoMe: Towards Large Scale Road Surface Reconstruction via Mesh Representation | Jun 20, 2023 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 |
| Full Parameter Fine-tuning for Large Language Models with Limited Resources | Jun 16, 2023 | GPUparameter-efficient fine-tuning | CodeCode Available | 2 |
| Datasets and Benchmarks for Offline Safe Reinforcement Learning | Jun 15, 2023 | Autonomous DrivingBenchmarking | CodeCode Available | 2 |
| Efficient 3D Semantic Segmentation with Superpoint Transformer | Jun 13, 2023 | 3D Semantic SegmentationGPU | CodeCode Available | 2 |
| StreetSurf: Extending Multi-view Implicit Surface Reconstruction to Street Views | Jun 8, 2023 | Autonomous DrivingGPU | CodeCode Available | 2 |
| SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression | Jun 5, 2023 | GPULanguage Modelling | CodeCode Available | 2 |
| Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models | Jun 1, 2023 | GPUImage Compression | CodeCode Available | 2 |
| Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference | May 27, 2023 | GPUImage Generation | CodeCode Available | 2 |
| AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec | May 26, 2023 | CPUGPU | CodeCode Available | 2 |
| MixFormerV2: Efficient Fully Transformer Tracking | May 25, 2023 | CPUGPU | CodeCode Available | 2 |
| Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory | May 25, 2023 | Common Sense ReasoningCPU | CodeCode Available | 2 |
| Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach | May 23, 2023 | GPUImage Generation | CodeCode Available | 2 |
| Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness | May 18, 2023 | CPUGPU | CodeCode Available | 2 |
| CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model | May 11, 2023 | DenoisingGPU | CodeCode Available | 2 |
| OctFormer: Octree-based Transformers for 3D Point Clouds | May 4, 2023 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| VPGTrans: Transfer Visual Prompt Generator across LLMs | May 2, 2023 | GPUTransfer Learning | CodeCode Available | 2 |
| Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation | Apr 26, 2023 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| Scaling the leading accuracy of deep equivariant models to biomolecular simulations of realistic size | Apr 20, 2023 | GPU | CodeCode Available | 2 |
| Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning | Mar 29, 2023 | GPUreinforcement-learning | CodeCode Available | 2 |
| SimpleNet: A Simple Network for Image Anomaly Detection and Localization | Mar 27, 2023 | Anomaly ClassificationAnomaly Detection | CodeCode Available | 2 |
| EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level Latencies | Mar 25, 2023 | Anomaly DetectionComputational Efficiency | CodeCode Available | 2 |
| BiFormer: Vision Transformer with Bi-Level Routing Attention | Mar 15, 2023 | Computational EfficiencyGPU | CodeCode Available | 2 |
| 3DGen: Triplane Latent Diffusion for Textured Mesh Generation | Mar 9, 2023 | DiversityGPU | CodeCode Available | 2 |
| Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks | Mar 7, 2023 | CPUGPU | CodeCode Available | 2 |
| FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation | Mar 4, 2023 | BenchmarkingGPU | CodeCode Available | 2 |
| POPGym: Benchmarking Partially Observable Reinforcement Learning | Mar 3, 2023 | BenchmarkingGPU | CodeCode Available | 2 |