| Fully Differentiable Lagrangian Convolutional Neural Network for Continuity-Consistent Physics-Informed Precipitation Nowcasting | Feb 16, 2024 | GPU | —Unverified | 0 |
| Evaluating Neural Radiance Fields (NeRFs) for 3D Plant Geometry Reconstruction in Field Conditions | Feb 15, 2024 | 3D ReconstructionGPU | —Unverified | 0 |
| ME-ViT: A Single-Load Memory-Efficient FPGA Accelerator for Vision Transformers | Feb 15, 2024 | GPU | —Unverified | 0 |
| QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference | Feb 15, 2024 | GPUQuantization | CodeCode Available | 2 |
| Efficient Language Adaptive Pre-training: Extending State-of-the-Art Large Language Models for Polish | Feb 15, 2024 | DecoderGPU | —Unverified | 0 |
| BitDelta: Your Fine-Tune May Only Be Worth One Bit | Feb 15, 2024 | GPU | CodeCode Available | 3 |
| Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment | Feb 15, 2024 | GPUReinforcement Learning (RL) | CodeCode Available | 1 |
| TinyCL: An Efficient Hardware Architecture for Continual Learning on Autonomous Systems | Feb 15, 2024 | Continual LearningGPU | —Unverified | 0 |
| MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech | Feb 14, 2024 | DecoderGPU | —Unverified | 0 |
| Listening to Multi-talker Conversations: Modular and End-to-end Perspectives | Feb 14, 2024 | GPUspeaker-diarization | —Unverified | 0 |
| Active Disruption Avoidance and Trajectory Design for Tokamak Ramp-downs with Neural Differential Equations and Reinforcement Learning | Feb 14, 2024 | GPU | —Unverified | 0 |
| MLTCP: Congestion Control for DNN Training | Feb 14, 2024 | GPU | —Unverified | 0 |
| DisGNet: A Distance Graph Neural Network for Forward Kinematics Learning of Gough-Stewart Platform | Feb 14, 2024 | GPUGraph Neural Network | CodeCode Available | 0 |
| Stochastic Spiking Attention: Accelerating Attention with Stochastic Computing in Spiking Networks | Feb 14, 2024 | GPU | —Unverified | 0 |
| HyCubE: Efficient Knowledge Hypergraph 3D Circular Convolutional Embedding | Feb 14, 2024 | GPUhypergraph embedding | —Unverified | 0 |
| Multi-Level GNN Preconditioner for Solving Large Scale Problems | Feb 13, 2024 | GPU | —Unverified | 0 |
| Graph Feature Preprocessor: Real-time Subgraph-based Feature Extraction for Financial Crime Detection | Feb 13, 2024 | CPUGPU | —Unverified | 0 |
| Accelerating Distributed Deep Learning using Lossless Homomorphic Compression | Feb 12, 2024 | Computational EfficiencyCPU | CodeCode Available | 0 |
| Anchor-based Large Language Models | Feb 12, 2024 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT | Feb 12, 2024 | BenchmarkingChunking | —Unverified | 0 |
| The I/O Complexity of Attention, or How Optimal is Flash Attention? | Feb 12, 2024 | GPU | —Unverified | 0 |
| Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute Systems | Feb 12, 2024 | GPUobject-detection | CodeCode Available | 0 |
| Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models | Feb 10, 2024 | CPUGPU | CodeCode Available | 3 |
| Cardiac ultrasound simulation for autonomous ultrasound navigation | Feb 9, 2024 | DiagnosticGPU | —Unverified | 0 |
| On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model Inference | Feb 9, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Anatomizing Deep Learning Inference in Web Browsers | Feb 8, 2024 | CPUDeep Learning | —Unverified | 0 |
| Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes | Feb 8, 2024 | GPU | CodeCode Available | 1 |
| On the Convergence of Zeroth-Order Federated Tuning for Large Language Models | Feb 8, 2024 | Federated LearningGPU | —Unverified | 0 |
| Improving Token-Based World Models with Parallel Observation Prediction | Feb 8, 2024 | GPUPrediction | CodeCode Available | 1 |
| TASER: Temporal Adaptive Sampling for Fast and Accurate Dynamic Graph Representation Learning | Feb 8, 2024 | DenoisingFraud Detection | CodeCode Available | 1 |
| A Lightweight Inception Boosted U-Net Neural Network for Routability Prediction | Feb 7, 2024 | AvgCPU | CodeCode Available | 1 |
| ApiQ: Finetuning of 2-Bit Quantized Large Language Model | Feb 7, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space | Feb 7, 2024 | Concept AlignmentGPU | CodeCode Available | 2 |
| Graph convolutional network as a fast statistical emulator for numerical ice sheet modeling | Feb 7, 2024 | GPUGraph Attention | —Unverified | 0 |
| JAX-Fluids 2.0: Towards HPC for Differentiable CFD of Compressible Two-phase Flows | Feb 7, 2024 | GPU | CodeCode Available | 4 |
| EfficientViT-SAM: Accelerated Segment Anything Model Without Accuracy Loss | Feb 7, 2024 | DecoderGPU | —Unverified | 0 |
| BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery | Feb 7, 2024 | GPUNeRF | —Unverified | 0 |
| Fast Timing-Conditioned Latent Audio Diffusion | Feb 7, 2024 | Audio GenerationGPU | CodeCode Available | 7 |
| BiLLM: Pushing the Limit of Post-Training Quantization for LLMs | Feb 6, 2024 | BinarizationGPU | CodeCode Available | 3 |
| Towards Deterministic End-to-end Latency for Medical AI Systems in NVIDIA Holoscan | Feb 6, 2024 | Edge-computingGPU | —Unverified | 0 |
| EscherNet: A Generative Model for Scalable View Synthesis | Feb 6, 2024 | 3D ReconstructionGPU | CodeCode Available | 3 |
| torchmSAT: A GPU-Accelerated Approximation To The Maximum Satisfiability Problem | Feb 6, 2024 | Combinatorial OptimizationGPU | —Unverified | 0 |
| Low-rank Attention Side-Tuning for Parameter-Efficient Fine-Tuning | Feb 6, 2024 | GPUparameter-efficient fine-tuning | —Unverified | 0 |
| Approximation Rates and VC-Dimension Bounds for (P)ReLU MLP Mixture of Experts | Feb 5, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| Single-GPU GNN Systems: Traps and Pitfalls | Feb 5, 2024 | GPUGraph Neural Network | —Unverified | 0 |
| Time-, Memory- and Parameter-Efficient Visual Adaptation | Feb 5, 2024 | GPUVideo Classification | —Unverified | 0 |
| 4D-Rotor Gaussian Splatting: Towards Efficient Novel View Synthesis for Dynamic Scenes | Feb 5, 2024 | GPUNovel View Synthesis | CodeCode Available | 2 |
| GPU-Accelerated 3D Polygon Visibility Volumes for Synergistic Perception and Navigation | Feb 5, 2024 | GPU | —Unverified | 0 |
| Spin: An Efficient Secure Computation Framework with GPU Acceleration | Feb 4, 2024 | CPUGPU | —Unverified | 0 |
| DeSparsify: Adversarial Attack Against Token Sparsification Mechanisms in Vision Transformers | Feb 4, 2024 | Adversarial AttackGPU | CodeCode Available | 0 |