| AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction | Nov 19, 2024 | GPUQuestion Answering | —Unverified | 0 |
| Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting | Nov 19, 2024 | 3D GenerationGPU | —Unverified | 0 |
| Modeling Multivariable High-resolution 3D Urban Microclimate Using Localized Fourier Neural Operator | Nov 18, 2024 | GPU | —Unverified | 0 |
| Graph Retention Networks for Dynamic Graphs | Nov 18, 2024 | GPUGraph Learning | CodeCode Available | 0 |
| MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs | Nov 18, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| LP Data Pipeline: Lightweight, Purpose-driven Data Pipeline for Large Language Models | Nov 18, 2024 | GPU | —Unverified | 0 |
| Towards Accurate and Efficient Sub-8-Bit Integer Training | Nov 17, 2024 | CPUGPU | —Unverified | 0 |
| NeuroNURBS: Learning Efficient Surface Representations for 3D Solids | Nov 16, 2024 | GPURepresentation Learning | —Unverified | 0 |
| Improving training time and GPU utilization in geo-distributed language model training | Nov 16, 2024 | GPULanguage Modeling | —Unverified | 0 |
| MDHP-Net: Detecting an Emerging Time-exciting Threat in IVN | Nov 15, 2024 | DiagnosticGPU | —Unverified | 0 |
| TEESlice: Protecting Sensitive Neural Network Models in Trusted Execution Environments When Attackers have Pre-Trained Models | Nov 15, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Pie: Pooling CPU Memory for LLM Inference | Nov 14, 2024 | CPUGPU | —Unverified | 0 |
| SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate | Nov 13, 2024 | Decision MakingGPU | CodeCode Available | 0 |
| Optimizing LLM Inference for Database Systems: Cost-Aware Scheduling for Concurrent Requests | Nov 12, 2024 | Decision MakingGPU | —Unverified | 0 |
| On Adapting Randomized Nyström Preconditioners to Accelerate Variational Image Reconstruction | Nov 12, 2024 | DeblurringGPU | —Unverified | 0 |
| FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training | Nov 12, 2024 | GPU | CodeCode Available | 0 |
| OpenThaiGPT 1.5: A Thai-Centric Open Source Large Language Model | Nov 11, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Accelerating Large Language Model Training with 4D Parallelism and Memory Consumption Estimator | Nov 10, 2024 | GPULanguage Modeling | —Unverified | 0 |
| KeyB2: Selecting Key Blocks is Also Important for Long Document Ranking with Large Language Models | Nov 9, 2024 | Document RankingGPU | —Unverified | 0 |
| Benchmarking 3D multi-coil NC-PDNet MRI reconstruction | Nov 8, 2024 | 3D ReconstructionBenchmarking | —Unverified | 0 |
| Hardware and Software Platform Inference | Nov 7, 2024 | GPULarge Language Model | —Unverified | 0 |
| PropNEAT -- Efficient GPU-Compatible Backpropagation over NeuroEvolutionary Augmenting Topology Networks | Nov 6, 2024 | Binary ClassificationGPU | —Unverified | 0 |
| Reducing Hyperparameter Tuning Costs in ML, Vision and Language Model Training Pipelines via Memoization-Awareness | Nov 6, 2024 | Bayesian OptimizationGPU | CodeCode Available | 0 |
| LEGO-GraphRAG: Modularizing Graph-based Retrieval-Augmented Generation for Design Space Exploration | Nov 6, 2024 | GPUKnowledge Graphs | —Unverified | 0 |
| Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation | Nov 5, 2024 | GPUparameter-efficient fine-tuning | —Unverified | 0 |
| "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization | Nov 4, 2024 | GPULarge Language Model | —Unverified | 0 |
| Context Parallelism for Scalable Million-Token Inference | Nov 4, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Stochastic Communication Avoidance for Recommendation Systems | Nov 3, 2024 | Federated LearningGPU | —Unverified | 0 |
| NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference | Nov 2, 2024 | Code GenerationCPU | CodeCode Available | 0 |
| Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models | Nov 2, 2024 | GPU | —Unverified | 0 |
| CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural Networks | Nov 2, 2024 | GPU | —Unverified | 0 |
| HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices | Nov 1, 2024 | Autonomous DrivingGPU | CodeCode Available | 0 |
| Computation-Aware Gaussian Processes: Model Selection And Linear-Time Inference | Nov 1, 2024 | Decision MakingGaussian Processes | —Unverified | 0 |
| A Novel Breast Ultrasound Image Augmentation Method Using Advanced Neural Style Transfer: An Efficient and Explainable Approach | Oct 31, 2024 | GPUImage Augmentation | —Unverified | 0 |
| Cycle-Constrained Adversarial Denoising Convolutional Network for PET Image Denoising: Multi-Dimensional Validation on Large Datasets with Reader Study and Real Low-Dose Data | Oct 31, 2024 | DenoisingGPU | —Unverified | 0 |
| Reinforcement learning with learned gadgets to tackle hard quantum problems on real hardware | Oct 31, 2024 | GPUProgram Synthesis | CodeCode Available | 0 |
| Context-Aware Token Selection and Packing for Enhanced Vision Transformer | Oct 31, 2024 | GPUobject-detection | —Unverified | 0 |
| ProMoE: Fast MoE-based LLM Serving using Proactive Caching | Oct 29, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| Application of Audio Fingerprinting Techniques for Real-Time Scalable Speech Retrieval and Speech Clusterization | Oct 29, 2024 | GPURetrieval | —Unverified | 0 |
| Memory-Efficient Point Cloud Registration via Overlapping Region Sampling | Oct 29, 2024 | GPUPoint Cloud Registration | —Unverified | 0 |
| A Message Passing Neural Network Surrogate Model for Bond-Associated Peridynamic Material Correspondence Formulation | Oct 29, 2024 | GPU | —Unverified | 0 |
| Revisiting Reliability in Large-Scale Machine Learning Research Clusters | Oct 29, 2024 | GPU | —Unverified | 0 |
| AI-assisted Agile Propagation Modeling for Real-time Digital Twin Wireless Networks | Oct 29, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Motion Graph Unleashed: A Novel Approach to Video Prediction | Oct 29, 2024 | GPUOptical Flow Estimation | CodeCode Available | 0 |
| Pushing the Performance Envelope of DNN-based Recommendation Systems Inference on GPUs | Oct 29, 2024 | GPURecommendation Systems | CodeCode Available | 0 |
| VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration | Oct 29, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Accelerated Bayesian parameter estimation and model selection for gravitational waves with normalizing flows | Oct 28, 2024 | CPUGPU | —Unverified | 0 |
| FusedInf: Efficient Swapping of DNN Models for On-Demand Serverless Inference Services on the Edge | Oct 28, 2024 | GPU | CodeCode Available | 0 |
| Deep Optimizer States: Towards Scalable Training of Transformer Models Using Interleaved Offloading | Oct 26, 2024 | CPUGPU | CodeCode Available | 0 |
| Computational Bottlenecks of Training Small-scale Large Language Models | Oct 25, 2024 | GPULanguage Modeling | —Unverified | 0 |