| Diffusion Sampling Correction via Approximately 10 Parameters | Nov 10, 2024 | GPU | CodeCode Available | 1 |
| KeyB2: Selecting Key Blocks is Also Important for Long Document Ranking with Large Language Models | Nov 9, 2024 | Document RankingGPU | —Unverified | 0 |
| Benchmarking 3D multi-coil NC-PDNet MRI reconstruction | Nov 8, 2024 | 3D ReconstructionBenchmarking | —Unverified | 0 |
| Hardware and Software Platform Inference | Nov 7, 2024 | GPULarge Language Model | —Unverified | 0 |
| SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models | Nov 7, 2024 | GPUQuantization | CodeCode Available | 4 |
| Brain Tumour Removing and Missing Modality Generation using 3D WDM | Nov 7, 2024 | GPUPrediction | CodeCode Available | 2 |
| LEGO-GraphRAG: Modularizing Graph-based Retrieval-Augmented Generation for Design Space Exploration | Nov 6, 2024 | GPUKnowledge Graphs | —Unverified | 0 |
| PropNEAT -- Efficient GPU-Compatible Backpropagation over NeuroEvolutionary Augmenting Topology Networks | Nov 6, 2024 | Binary ClassificationGPU | —Unverified | 0 |
| Reducing Hyperparameter Tuning Costs in ML, Vision and Language Model Training Pipelines via Memoization-Awareness | Nov 6, 2024 | Bayesian OptimizationGPU | CodeCode Available | 0 |
| HRDecoder: High-Resolution Decoder Network for Fundus Image Lesion Segmentation | Nov 6, 2024 | DecoderGPU | CodeCode Available | 1 |
| LiVOS: Light Video Object Segmentation with Gated Linear Matching | Nov 5, 2024 | GPUSemantic Segmentation | CodeCode Available | 1 |
| Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation | Nov 5, 2024 | GPUparameter-efficient fine-tuning | —Unverified | 0 |
| Real-Time Polygonal Semantic Mapping for Humanoid Robot Stair Climbing | Nov 4, 2024 | Computational EfficiencyGPU | CodeCode Available | 2 |
| Context Parallelism for Scalable Million-Token Inference | Nov 4, 2024 | GPULanguage Modeling | —Unverified | 0 |
| DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution | Nov 4, 2024 | GPURobot Manipulation | CodeCode Available | 2 |
| xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism | Nov 4, 2024 | GPU | CodeCode Available | 7 |
| "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization | Nov 4, 2024 | GPULarge Language Model | —Unverified | 0 |
| RAGViz: Diagnose and Visualize Retrieval-Augmented Generation | Nov 4, 2024 | Answer GenerationGPU | CodeCode Available | 2 |
| Stochastic Communication Avoidance for Recommendation Systems | Nov 3, 2024 | Federated LearningGPU | —Unverified | 0 |
| CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural Networks | Nov 2, 2024 | GPU | —Unverified | 0 |
| Fast and Memory-Efficient Video Diffusion Using Streamlined Inference | Nov 2, 2024 | GPUVideo Generation | CodeCode Available | 1 |
| NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference | Nov 2, 2024 | Code GenerationCPU | CodeCode Available | 0 |
| Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models | Nov 2, 2024 | GPU | —Unverified | 0 |
| Computation-Aware Gaussian Processes: Model Selection And Linear-Time Inference | Nov 1, 2024 | Decision MakingGaussian Processes | —Unverified | 0 |
| HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices | Nov 1, 2024 | Autonomous DrivingGPU | CodeCode Available | 0 |