| Pruner: A Speculative Exploration Mechanism to Accelerate Tensor Program Tuning | Feb 4, 2024 | GPUTransfer Learning | CodeCode Available | 1 |
| Scalable and Efficient Temporal Graph Representation Learning via Forward Recent Sampling | Feb 3, 2024 | GPUGraph Representation Learning | CodeCode Available | 0 |
| Structure-Aware E(3)-Invariant Molecular Conformer Aggregation Networks | Feb 3, 2024 | GPUMolecular Property Prediction | CodeCode Available | 1 |
| InferCept: Efficient Intercept Support for Augmented Large Language Model Inference | Feb 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| PRIME: Protect Your Videos From Malicious Editing | Feb 2, 2024 | GPU | CodeCode Available | 0 |
| Faster Inference of Integer SWIN Transformer by Removing the GELU Activation | Feb 2, 2024 | GPUimage-classification | —Unverified | 0 |
| Enriched Physics-informed Neural Networks for Dynamic Poisson-Nernst-Planck Systems | Feb 1, 2024 | GPU | —Unverified | 0 |
| An Accurate and Low-Parameter Machine Learning Architecture for Next Location Prediction | Feb 1, 2024 | GPUPrediction | —Unverified | 0 |
| Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces | Feb 1, 2024 | Computational EfficiencyGPU | CodeCode Available | 3 |
| KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization | Jan 31, 2024 | GPUQuantization | CodeCode Available | 3 |