| Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference | Aug 14, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Review Learning: Advancing All-in-One Ultra-High-Definition Image Restoration Training Method | Aug 13, 2024 | 4kAll | —Unverified | 0 |
| Bridging LLMs and KGs without Fine-Tuning: Intermediate Probing Meets Subgraph-Aware Entity Descriptions | Aug 13, 2024 | GPUKnowledge Graph Completion | —Unverified | 0 |
| Breast-NET: a lightweight DCNN model for breast cancer detection and grading using histological samples | Aug 10, 2024 | Breast Cancer DetectionBreast Cancer Histology Image Classification | CodeCode Available | 0 |
| LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale | Aug 10, 2024 | GPULanguage Modelling | CodeCode Available | 3 |
| A Versatile Framework for Attributed Network Clustering via K-Nearest Neighbor Augmentation | Aug 10, 2024 | AttributeClustering | CodeCode Available | 0 |
| UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling | Aug 9, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| reCSE: Portable Reshaping Features for Sentence Embedding in Self-supervised Contrastive Learning | Aug 9, 2024 | Contrastive LearningData Augmentation | CodeCode Available | 0 |
| Impacts of floating-point non-associativity on reproducibility for HPC and deep learning applications | Aug 9, 2024 | Deep LearningGPU | CodeCode Available | 0 |
| An Edge AI System Based on FPGA Platform for Railway Fault Detection | Aug 8, 2024 | CPUFault Detection | —Unverified | 0 |