| Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model using 3D Whole-body CT Scans | Mar 22, 2024 | GPUImage Segmentation | CodeCode Available | 1 |
| DaCapo: Accelerating Continuous Learning in Autonomous Systems for Video Analytics | Mar 21, 2024 | GPU | CodeCode Available | 1 |
| On Pretraining Data Diversity for Self-Supervised Learning | Mar 20, 2024 | DiversityGPU | CodeCode Available | 1 |
| Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection | Mar 18, 2024 | Anomaly DetectionDenoising | CodeCode Available | 1 |
| JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-Tuning | Mar 17, 2024 | GPUManagement | CodeCode Available | 1 |
| FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images | Mar 14, 2024 | 3D Medical Imaging SegmentationGPU | CodeCode Available | 1 |
| Optimistic Verifiable Training by Controlling Hardware Nondeterminism | Mar 14, 2024 | Data PoisoningGPU | CodeCode Available | 1 |
| SM4Depth: Seamless Monocular Metric Depth Estimation across Multiple Cameras and Scenes by One Model | Mar 13, 2024 | Depth EstimationGPU | CodeCode Available | 1 |
| Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment Anything | Mar 12, 2024 | GPUPoint Tracking | CodeCode Available | 1 |
| LookupFFN: Making Transformers Compute-lite for CPU inference | Mar 12, 2024 | CPUGPU | CodeCode Available | 1 |
| SSM Meets Video Diffusion Models: Efficient Long-Term Video Generation with Structured State Spaces | Mar 12, 2024 | GPUImage Generation | CodeCode Available | 1 |
| UniSparse: An Intermediate Language for General Sparse Format Customization | Mar 9, 2024 | AttributeCode Generation | CodeCode Available | 1 |
| LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization | Mar 2, 2024 | GPUQuantization | CodeCode Available | 1 |
| Efficient Lifelong Model Evaluation in an Era of Rapid Progress | Feb 29, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control | Feb 27, 2024 | GPUImage Retrieval | CodeCode Available | 1 |
| DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation | Feb 27, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| PyGim: An Efficient Graph Neural Network Library for Real Processing-In-Memory Architectures | Feb 26, 2024 | CPUGPU | CodeCode Available | 1 |
| Mechanistic Neural Networks for Scientific Machine Learning | Feb 20, 2024 | Equation DiscoveryGPU | CodeCode Available | 1 |
| BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation | Feb 18, 2024 | GPUQuestion Answering | CodeCode Available | 1 |
| Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment | Feb 15, 2024 | GPUReinforcement Learning (RL) | CodeCode Available | 1 |
| Anchor-based Large Language Models | Feb 12, 2024 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes | Feb 8, 2024 | GPU | CodeCode Available | 1 |
| Improving Token-Based World Models with Parallel Observation Prediction | Feb 8, 2024 | GPUPrediction | CodeCode Available | 1 |
| TASER: Temporal Adaptive Sampling for Fast and Accurate Dynamic Graph Representation Learning | Feb 8, 2024 | DenoisingFraud Detection | CodeCode Available | 1 |
| ApiQ: Finetuning of 2-Bit Quantized Large Language Model | Feb 7, 2024 | GPULanguage Modeling | CodeCode Available | 1 |