| Anatomizing Deep Learning Inference in Web Browsers | Feb 8, 2024 | CPUDeep Learning | —Unverified | 0 |
| Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes | Feb 8, 2024 | GPU | CodeCode Available | 1 |
| On the Convergence of Zeroth-Order Federated Tuning for Large Language Models | Feb 8, 2024 | Federated LearningGPU | —Unverified | 0 |
| Improving Token-Based World Models with Parallel Observation Prediction | Feb 8, 2024 | GPUPrediction | CodeCode Available | 1 |
| TASER: Temporal Adaptive Sampling for Fast and Accurate Dynamic Graph Representation Learning | Feb 8, 2024 | DenoisingFraud Detection | CodeCode Available | 1 |
| A Lightweight Inception Boosted U-Net Neural Network for Routability Prediction | Feb 7, 2024 | AvgCPU | CodeCode Available | 1 |
| ApiQ: Finetuning of 2-Bit Quantized Large Language Model | Feb 7, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space | Feb 7, 2024 | Concept AlignmentGPU | CodeCode Available | 2 |
| Graph convolutional network as a fast statistical emulator for numerical ice sheet modeling | Feb 7, 2024 | GPUGraph Attention | —Unverified | 0 |
| JAX-Fluids 2.0: Towards HPC for Differentiable CFD of Compressible Two-phase Flows | Feb 7, 2024 | GPU | CodeCode Available | 4 |
| EfficientViT-SAM: Accelerated Segment Anything Model Without Accuracy Loss | Feb 7, 2024 | DecoderGPU | —Unverified | 0 |
| BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery | Feb 7, 2024 | GPUNeRF | —Unverified | 0 |
| Fast Timing-Conditioned Latent Audio Diffusion | Feb 7, 2024 | Audio GenerationGPU | CodeCode Available | 7 |
| BiLLM: Pushing the Limit of Post-Training Quantization for LLMs | Feb 6, 2024 | BinarizationGPU | CodeCode Available | 3 |
| Towards Deterministic End-to-end Latency for Medical AI Systems in NVIDIA Holoscan | Feb 6, 2024 | Edge-computingGPU | —Unverified | 0 |
| EscherNet: A Generative Model for Scalable View Synthesis | Feb 6, 2024 | 3D ReconstructionGPU | CodeCode Available | 3 |
| torchmSAT: A GPU-Accelerated Approximation To The Maximum Satisfiability Problem | Feb 6, 2024 | Combinatorial OptimizationGPU | —Unverified | 0 |
| Low-rank Attention Side-Tuning for Parameter-Efficient Fine-Tuning | Feb 6, 2024 | GPUparameter-efficient fine-tuning | —Unverified | 0 |
| Approximation Rates and VC-Dimension Bounds for (P)ReLU MLP Mixture of Experts | Feb 5, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| Single-GPU GNN Systems: Traps and Pitfalls | Feb 5, 2024 | GPUGraph Neural Network | —Unverified | 0 |
| Time-, Memory- and Parameter-Efficient Visual Adaptation | Feb 5, 2024 | GPUVideo Classification | —Unverified | 0 |
| 4D-Rotor Gaussian Splatting: Towards Efficient Novel View Synthesis for Dynamic Scenes | Feb 5, 2024 | GPUNovel View Synthesis | CodeCode Available | 2 |
| GPU-Accelerated 3D Polygon Visibility Volumes for Synergistic Perception and Navigation | Feb 5, 2024 | GPU | —Unverified | 0 |
| Spin: An Efficient Secure Computation Framework with GPU Acceleration | Feb 4, 2024 | CPUGPU | —Unverified | 0 |
| DeSparsify: Adversarial Attack Against Token Sparsification Mechanisms in Vision Transformers | Feb 4, 2024 | Adversarial AttackGPU | CodeCode Available | 0 |