| Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension | Nov 20, 2024 | GPUMME | CodeCode Available | 3 |
| REDUCIO! Generating 10241024 Video within 16 Seconds using Extremely Compressed Motion Latents | Nov 20, 2024 | GPUVideo Generation | CodeCode Available | 3 |
| Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting | Nov 19, 2024 | 3D GenerationGPU | —Unverified | 0 |
| Faster Multi-GPU Training with PPLL: A Pipeline Parallelism Framework Leveraging Local Learning | Nov 19, 2024 | GPU | —Unverified | 0 |
| AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction | Nov 19, 2024 | GPUQuestion Answering | —Unverified | 0 |
| GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Nov 19, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs | Nov 18, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Modeling Multivariable High-resolution 3D Urban Microclimate Using Localized Fourier Neural Operator | Nov 18, 2024 | GPU | —Unverified | 0 |
| Graph Retention Networks for Dynamic Graphs | Nov 18, 2024 | GPUGraph Learning | CodeCode Available | 0 |
| LP Data Pipeline: Lightweight, Purpose-driven Data Pipeline for Large Language Models | Nov 18, 2024 | GPU | —Unverified | 0 |
| Towards Accurate and Efficient Sub-8-Bit Integer Training | Nov 17, 2024 | CPUGPU | —Unverified | 0 |
| Improving training time and GPU utilization in geo-distributed language model training | Nov 16, 2024 | GPULanguage Modeling | —Unverified | 0 |
| NeuroNURBS: Learning Efficient Surface Representations for 3D Solids | Nov 16, 2024 | GPURepresentation Learning | —Unverified | 0 |
| TEESlice: Protecting Sensitive Neural Network Models in Trusted Execution Environments When Attackers have Pre-Trained Models | Nov 15, 2024 | GPULanguage Modeling | —Unverified | 0 |
| MDHP-Net: Detecting an Emerging Time-exciting Threat in IVN | Nov 15, 2024 | DiagnosticGPU | —Unverified | 0 |
| Pie: Pooling CPU Memory for LLM Inference | Nov 14, 2024 | CPUGPU | —Unverified | 0 |
| SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate | Nov 13, 2024 | Decision MakingGPU | CodeCode Available | 0 |
| On Adapting Randomized Nyström Preconditioners to Accelerate Variational Image Reconstruction | Nov 12, 2024 | DeblurringGPU | —Unverified | 0 |
| FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training | Nov 12, 2024 | GPU | CodeCode Available | 0 |
| Optimizing LLM Inference for Database Systems: Cost-Aware Scheduling for Concurrent Requests | Nov 12, 2024 | Decision MakingGPU | —Unverified | 0 |
| ITER: Iterative Transformer-based Entity Recognition and Relation Extraction | Nov 11, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| GPU-Accelerated Inverse Lithography Towards High Quality Curvy Mask Generation | Nov 11, 2024 | GPU | CodeCode Available | 1 |
| OpenThaiGPT 1.5: A Thai-Centric Open Source Large Language Model | Nov 11, 2024 | GPULanguage Modeling | —Unverified | 0 |
| AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models | Nov 11, 2024 | Audio Super-ResolutionGPU | CodeCode Available | 2 |
| Accelerating Large Language Model Training with 4D Parallelism and Memory Consumption Estimator | Nov 10, 2024 | GPULanguage Modeling | —Unverified | 0 |