| VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings | Oct 2, 2024 | GPUGraph Attention | —Unverified | 0 |
| TorchSISSO: A PyTorch-Based Implementation of the Sure Independence Screening and Sparsifying Operator for Efficient and Interpretable Model Discovery | Oct 2, 2024 | GPUModel Discovery | CodeCode Available | 1 |
| Lotus: learning-based online thermal and latency variation management for two-stage detectors on edge devices | Oct 1, 2024 | CPUDeep Reinforcement Learning | CodeCode Available | 0 |
| ROK Defense M&S in the Age of Hyperscale AI: Concepts, Challenges, and Future Directions | Oct 1, 2024 | Decision MakingGPU | —Unverified | 0 |
| MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards | Oct 1, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| STGformer: Efficient Spatiotemporal Graph Transformer for Traffic Forecasting | Oct 1, 2024 | GPU | CodeCode Available | 1 |
| LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management | Oct 1, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI | Oct 1, 2024 | GPUImitation Learning | CodeCode Available | 7 |
| Characterizing and Efficiently Accelerating Multimodal Generation Model Inference | Sep 30, 2024 | GPUmultimodal generation | —Unverified | 0 |
| HEADS-UP: Head-Mounted Egocentric Dataset for Trajectory Prediction in Blind Assistance Systems | Sep 30, 2024 | GPUPrediction | —Unverified | 0 |
| Simple and Fast Distillation of Diffusion Models | Sep 29, 2024 | GPUImage Generation | CodeCode Available | 3 |
| Simulation-based inference with the Python Package sbijax | Sep 28, 2024 | Bayesian InferenceCPU | —Unverified | 0 |
| Analog In-Memory Computing Attention Mechanism for Fast and Energy-Efficient Large Language Models | Sep 28, 2024 | GPU | CodeCode Available | 1 |
| Gradient-free Decoder Inversion in Latent Diffusion Models | Sep 27, 2024 | DecoderDenoising | —Unverified | 0 |
| TensorSocket: Shared Data Loading for Deep Learning Training | Sep 27, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs | Sep 27, 2024 | GPURecommendation Systems | CodeCode Available | 1 |
| DRL-STNet: Unsupervised Domain Adaptation for Cross-modality Medical Image Segmentation via Disentangled Representation Learning | Sep 26, 2024 | Domain AdaptationGPU | —Unverified | 0 |
| Input-Dependent Power Usage in GPUs | Sep 26, 2024 | GPU | CodeCode Available | 0 |
| Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores | Sep 26, 2024 | GPUManagement | —Unverified | 0 |
| Behaviour4All: in-the-wild Facial Behaviour Analysis Toolkit | Sep 26, 2024 | Action Unit DetectionArousal Estimation | —Unverified | 0 |
| LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field | Sep 26, 2024 | GPUNeRF | CodeCode Available | 1 |
| MALPOLON: A Framework for Deep Species Distribution Modeling | Sep 26, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reduction | Sep 25, 2024 | GPUToken Reduction | CodeCode Available | 2 |
| Search for Efficient Large Language Models | Sep 25, 2024 | GPUModel Compression | CodeCode Available | 1 |
| Efficient and generalizable nested Fourier-DeepONet for three-dimensional geological carbon sequestration | Sep 25, 2024 | GPU | CodeCode Available | 0 |