| Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches | Aug 20, 2024 | GPUModel Compression | —Unverified | 0 |
| Fine-Tuning a Local LLaMA-3 Large Language Model for Automated Privacy-Preserving Physician Letter Generation in Radiation Oncology | Aug 20, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Characteristic Performance Study on Solving Oscillator ODEs via Soft-constrained Physics-informed Neural Network with Small Data | Aug 19, 2024 | CPUGPU | CodeCode Available | 0 |
| Stream-Based Ground Segmentation for Real-Time LiDAR Point Cloud Processing on FPGA | Aug 19, 2024 | CPUGPU | —Unverified | 0 |
| MoDeGPT: Modular Decomposition for Large Language Model Compression | Aug 19, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Demystifying the Communication Characteristics for Distributed Transformer Models | Aug 19, 2024 | Audio GenerationGPU | —Unverified | 0 |
| SSDTrain: An Activation Offloading Framework to SSDs for Faster Large Language Model Training | Aug 19, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Liquid Fourier Latent Dynamics Networks for fast GPU-based numerical simulations in computational cardiology | Aug 19, 2024 | GPU | CodeCode Available | 0 |
| TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition | Aug 19, 2024 | GPUMulti-Task Learning | CodeCode Available | 0 |
| ELASTIC: Efficient Linear Attention for Sequential Interest Compression | Aug 18, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| Threshold Filtering Packing for Supervised Fine-Tuning: Training Related Samples within Packs | Aug 18, 2024 | DiversityGPU | —Unverified | 0 |
| Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference | Aug 14, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems | Aug 14, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Bridging LLMs and KGs without Fine-Tuning: Intermediate Probing Meets Subgraph-Aware Entity Descriptions | Aug 13, 2024 | GPUKnowledge Graph Completion | —Unverified | 0 |
| Review Learning: Advancing All-in-One Ultra-High-Definition Image Restoration Training Method | Aug 13, 2024 | 4kAll | —Unverified | 0 |
| Breast-NET: a lightweight DCNN model for breast cancer detection and grading using histological samples | Aug 10, 2024 | Breast Cancer DetectionBreast Cancer Histology Image Classification | CodeCode Available | 0 |
| A Versatile Framework for Attributed Network Clustering via K-Nearest Neighbor Augmentation | Aug 10, 2024 | AttributeClustering | CodeCode Available | 0 |
| reCSE: Portable Reshaping Features for Sentence Embedding in Self-supervised Contrastive Learning | Aug 9, 2024 | Contrastive LearningData Augmentation | CodeCode Available | 0 |
| Impacts of floating-point non-associativity on reproducibility for HPC and deep learning applications | Aug 9, 2024 | Deep LearningGPU | CodeCode Available | 0 |
| An Edge AI System Based on FPGA Platform for Railway Fault Detection | Aug 8, 2024 | CPUFault Detection | —Unverified | 0 |
| Understanding the Performance and Estimating the Cost of LLM Fine-Tuning | Aug 8, 2024 | GPUMixture-of-Experts | CodeCode Available | 0 |
| Design of a Quality Management System based on the EU Artificial Intelligence Act | Aug 8, 2024 | Document AIGPU | CodeCode Available | 0 |
| Sparse Spiking Neural-like Membrane Systems on Graphics Processing Units | Aug 8, 2024 | GPU | CodeCode Available | 0 |
| Optimization-Driven Adaptive Experimentation | Aug 8, 2024 | GPUThompson Sampling | —Unverified | 0 |
| Arctic-TILT. Business Document Understanding at Sub-Billion Scale | Aug 8, 2024 | document understandingGPU | —Unverified | 0 |