| Online Energy Optimization in GPUs: A Multi-Armed Bandit Approach | Oct 3, 2024 | energy managementGPU | CodeCode Available | 0 |
| LLM-Pilot: Characterize and Optimize Performance of your LLM Inference Services | Oct 3, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping | Oct 3, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| Learning from Offline Foundation Features with Tensor Augmentations | Oct 3, 2024 | GPU | —Unverified | 0 |
| LLMCO2: Advancing Accurate Carbon Footprint Prediction for LLM Inferences | Oct 3, 2024 | GPUGraph Neural Network | —Unverified | 0 |
| Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network | Oct 3, 2024 | GPUReal-Time Semantic Segmentation | —Unverified | 0 |
| Contextual Document Embeddings | Oct 3, 2024 | Contrastive LearningDocument Embedding | —Unverified | 0 |
| An Efficient Inference Frame for SMLM (Single-Molecule Localization Microscopy) | Oct 3, 2024 | Deep LearningGPU | CodeCode Available | 0 |
| Depth Pro: Sharp Monocular Metric Depth in Less Than a Second | Oct 2, 2024 | Depth EstimationGPU | CodeCode Available | 9 |
| Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices | Oct 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 |