| KVDirect: Distributed Disaggregated LLM Inference | Dec 13, 2024 | GPUScheduling | —Unverified | 0 |
| LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity | Dec 13, 2024 | GPUMamba | —Unverified | 0 |
| HashEvict: A Pre-Attention KV Cache Eviction Strategy using Locality-Sensitive Hashing | Dec 13, 2024 | GPUMultiple-choice | —Unverified | 0 |
| Stellar parameter prediction and spectral simulation using machine learning | Dec 12, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Benchmarking of GPU-optimized Quantum-Inspired Evolutionary Optimization Algorithm using Functional Analysis | Dec 12, 2024 | BenchmarkingGPU | —Unverified | 0 |
| All You Need in Knowledge Distillation Is a Tailored Coordinate System | Dec 12, 2024 | AllFew-Shot Learning | —Unverified | 0 |
| Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks | Dec 12, 2024 | DiversityGPU | —Unverified | 0 |
| Dimensionality Reduction Techniques for Global Bayesian Optimisation | Dec 12, 2024 | Bayesian OptimisationDimensionality Reduction | —Unverified | 0 |
| COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework | Dec 11, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion | Dec 11, 2024 | GPUImage Segmentation | CodeCode Available | 0 |
| Protecting Confidentiality, Privacy and Integrity in Collaborative Learning | Dec 11, 2024 | CPUGPU | —Unverified | 0 |
| Low-Latency Scalable Streaming for Event-Based Vision | Dec 10, 2024 | Event-based visionGPU | —Unverified | 0 |
| CEEMS: A Resource Manager Agnostic Energy and Emissions Monitoring Stack | Dec 10, 2024 | CPUGPU | —Unverified | 0 |
| Machine learning-driven conservative-to-primitive conversion in hybrid piecewise polytropic and tabulated equations of state | Dec 10, 2024 | CPUGPU | —Unverified | 0 |
| Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds | Dec 10, 2024 | GPUTexture Synthesis | —Unverified | 0 |
| LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models | Dec 10, 2024 | 3D ReconstructionGPU | —Unverified | 0 |
| From Slow Bidirectional to Fast Autoregressive Video Diffusion Models | Dec 10, 2024 | GPUVideo Generation | —Unverified | 0 |
| Edge Delayed Deep Deterministic Policy Gradient: efficient continuous control for edge scenarios | Dec 9, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Improving text-conditioned latent diffusion for cancer pathology | Dec 9, 2024 | GPUSynthetic Data Generation | CodeCode Available | 0 |
| Flexible and Scalable Deep Dendritic Spiking Neural Networks with Multiple Nonlinear Branching | Dec 9, 2024 | Few-Shot LearningGPU | —Unverified | 0 |
| ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance | Dec 9, 2024 | DenoisingGPU | —Unverified | 0 |
| Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression | Dec 7, 2024 | GPU | —Unverified | 0 |
| Code generation and runtime techniques for enabling data-efficient deep learning training on GPUs | Dec 6, 2024 | Code GenerationDeep Learning | —Unverified | 0 |
| GUIDE: A Global Unified Inference Engine for Deploying Large Language Models in Heterogeneous Environments | Dec 6, 2024 | GPU | —Unverified | 0 |
| Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection | Dec 6, 2024 | GPUMulti-Object Tracking | —Unverified | 0 |