| DisGNet: A Distance Graph Neural Network for Forward Kinematics Learning of Gough-Stewart Platform | Feb 14, 2024 | GPUGraph Neural Network | CodeCode Available | 0 |
| Stochastic Spiking Attention: Accelerating Attention with Stochastic Computing in Spiking Networks | Feb 14, 2024 | GPU | —Unverified | 0 |
| Active Disruption Avoidance and Trajectory Design for Tokamak Ramp-downs with Neural Differential Equations and Reinforcement Learning | Feb 14, 2024 | GPU | —Unverified | 0 |
| MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech | Feb 14, 2024 | DecoderGPU | —Unverified | 0 |
| Listening to Multi-talker Conversations: Modular and End-to-end Perspectives | Feb 14, 2024 | GPUspeaker-diarization | —Unverified | 0 |
| MLTCP: Congestion Control for DNN Training | Feb 14, 2024 | GPU | —Unverified | 0 |
| Multi-Level GNN Preconditioner for Solving Large Scale Problems | Feb 13, 2024 | GPU | —Unverified | 0 |
| Graph Feature Preprocessor: Real-time Subgraph-based Feature Extraction for Financial Crime Detection | Feb 13, 2024 | CPUGPU | —Unverified | 0 |
| Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT | Feb 12, 2024 | BenchmarkingChunking | —Unverified | 0 |
| The I/O Complexity of Attention, or How Optimal is Flash Attention? | Feb 12, 2024 | GPU | —Unverified | 0 |
| Accelerating Distributed Deep Learning using Lossless Homomorphic Compression | Feb 12, 2024 | Computational EfficiencyCPU | CodeCode Available | 0 |
| Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute Systems | Feb 12, 2024 | GPUobject-detection | CodeCode Available | 0 |
| Cardiac ultrasound simulation for autonomous ultrasound navigation | Feb 9, 2024 | DiagnosticGPU | —Unverified | 0 |
| Anatomizing Deep Learning Inference in Web Browsers | Feb 8, 2024 | CPUDeep Learning | —Unverified | 0 |
| On the Convergence of Zeroth-Order Federated Tuning for Large Language Models | Feb 8, 2024 | Federated LearningGPU | —Unverified | 0 |
| Graph convolutional network as a fast statistical emulator for numerical ice sheet modeling | Feb 7, 2024 | GPUGraph Attention | —Unverified | 0 |
| BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery | Feb 7, 2024 | GPUNeRF | —Unverified | 0 |
| EfficientViT-SAM: Accelerated Segment Anything Model Without Accuracy Loss | Feb 7, 2024 | DecoderGPU | —Unverified | 0 |
| Towards Deterministic End-to-end Latency for Medical AI Systems in NVIDIA Holoscan | Feb 6, 2024 | Edge-computingGPU | —Unverified | 0 |
| torchmSAT: A GPU-Accelerated Approximation To The Maximum Satisfiability Problem | Feb 6, 2024 | Combinatorial OptimizationGPU | —Unverified | 0 |
| Low-rank Attention Side-Tuning for Parameter-Efficient Fine-Tuning | Feb 6, 2024 | GPUparameter-efficient fine-tuning | —Unverified | 0 |
| Single-GPU GNN Systems: Traps and Pitfalls | Feb 5, 2024 | GPUGraph Neural Network | —Unverified | 0 |
| Time-, Memory- and Parameter-Efficient Visual Adaptation | Feb 5, 2024 | GPUVideo Classification | —Unverified | 0 |
| GPU-Accelerated 3D Polygon Visibility Volumes for Synergistic Perception and Navigation | Feb 5, 2024 | GPU | —Unverified | 0 |
| Approximation Rates and VC-Dimension Bounds for (P)ReLU MLP Mixture of Experts | Feb 5, 2024 | GPUMixture-of-Experts | —Unverified | 0 |