| Efficient fine-tuning of 37-level GraphCast with the Canadian global deterministic analysis | Aug 26, 2024 | GPU | CodeCode Available | 1 |
| Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects | Aug 26, 2024 | GPU | —Unverified | 0 |
| Theoretical Proportion Label Perturbation for Learning from Label Proportions in Large Bags | Aug 26, 2024 | GPUWeakly-supervised Learning | CodeCode Available | 0 |
| More Pictures Say More: Visual Intersection Network for Open Set Object Detection | Aug 26, 2024 | GPUobject-detection | —Unverified | 0 |
| Quantum-Powered Personalized Learning | Aug 25, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| Selectively Dilated Convolution for Accuracy-Preserving Sparse Pillar-based Embedded 3D Object Detection | Aug 25, 2024 | 3D Object DetectionGPU | —Unverified | 0 |
| Batch-FPM: Random batch-update multi-parameter physical Fourier ptychography neural network | Aug 25, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| HGNAS: Hardware-Aware Graph Neural Architecture Search for Edge Devices | Aug 23, 2024 | GPUNeural Architecture Search | —Unverified | 0 |
| S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control Points | Aug 23, 2024 | 3D Reconstruction4D reconstruction | CodeCode Available | 1 |
| Energy-Efficient Spiking Recurrent Neural Network for Gesture Recognition on Embedded GPUs | Aug 23, 2024 | Edge-computingGesture Recognition | —Unverified | 0 |
| Exploiting Student Parallelism for Low-latency GPU Inference of BERT-like Models in Online Services | Aug 22, 2024 | GPU | —Unverified | 0 |
| PCGRL+: Scaling, Control and Generalization in Reinforcement Learning Level Generators | Aug 22, 2024 | CPUGPU | —Unverified | 0 |
| Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Aug 21, 2024 | GPUImage Retrieval | —Unverified | 0 |
| Mixed Sparsity Training: Achieving 4 FLOP Reduction for Transformer Pretraining | Aug 21, 2024 | GPU | —Unverified | 0 |
| MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models | Aug 21, 2024 | GPUQuantization | CodeCode Available | 5 |
| Vision HgNN: An Electron-Micrograph is Worth Hypergraph of Hypernodes | Aug 21, 2024 | GPU | —Unverified | 0 |
| Slicing Input Features to Accelerate Deep Learning: A Case Study with Graph Neural Networks | Aug 21, 2024 | GPUGraph Learning | —Unverified | 0 |
| EmbodiedSAM: Online Segment Any 3D Thing in Real Time | Aug 21, 2024 | 3D Instance SegmentationGPU | CodeCode Available | 4 |
| Practical Aspects on Solving Differential Equations Using Deep Learning: A Primer | Aug 21, 2024 | Deep LearningGPU | CodeCode Available | 0 |
| deepmriprep: Voxel-based Morphometry (VBM) Preprocessing via Deep Neural Networks | Aug 20, 2024 | GPUImage Registration | CodeCode Available | 2 |
| UKAN: Unbound Kolmogorov-Arnold Network Accompanied with Accelerated Library | Aug 20, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining | Aug 20, 2024 | 3DGSGPU | —Unverified | 0 |
| Fine-Tuning a Local LLaMA-3 Large Language Model for Automated Privacy-Preserving Physician Letter Generation in Radiation Oncology | Aug 20, 2024 | GPULanguage Modeling | —Unverified | 0 |
| EdgeNAT: Transformer for Efficient Edge Detection | Aug 20, 2024 | Edge DetectionGPU | CodeCode Available | 1 |
| HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models | Aug 20, 2024 | GPULanguage Modelling | CodeCode Available | 1 |
| Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches | Aug 20, 2024 | GPUModel Compression | —Unverified | 0 |
| Near, far: Patch-ordering enhances vision foundation models' scene understanding | Aug 20, 2024 | GPUScene Understanding | —Unverified | 0 |
| LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models | Aug 20, 2024 | GPU | CodeCode Available | 0 |
| Accelerating Goal-Conditioned RL Algorithms and Research | Aug 20, 2024 | GPUreinforcement-learning | CodeCode Available | 3 |
| Stream-Based Ground Segmentation for Real-Time LiDAR Point Cloud Processing on FPGA | Aug 19, 2024 | CPUGPU | —Unverified | 0 |
| Characteristic Performance Study on Solving Oscillator ODEs via Soft-constrained Physics-informed Neural Network with Small Data | Aug 19, 2024 | CPUGPU | CodeCode Available | 0 |
| MoDeGPT: Modular Decomposition for Large Language Model Compression | Aug 19, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Liquid Fourier Latent Dynamics Networks for fast GPU-based numerical simulations in computational cardiology | Aug 19, 2024 | GPU | CodeCode Available | 0 |
| SSDTrain: An Activation Offloading Framework to SSDs for Faster Large Language Model Training | Aug 19, 2024 | GPULanguage Modeling | —Unverified | 0 |
| TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition | Aug 19, 2024 | GPUMulti-Task Learning | CodeCode Available | 0 |
| Demystifying the Communication Characteristics for Distributed Transformer Models | Aug 19, 2024 | Audio GenerationGPU | —Unverified | 0 |
| Threshold Filtering Packing for Supervised Fine-Tuning: Training Related Samples within Packs | Aug 18, 2024 | DiversityGPU | —Unverified | 0 |
| ELASTIC: Efficient Linear Attention for Sequential Interest Compression | Aug 18, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models | Aug 16, 2024 | GPUModel Compression | CodeCode Available | 3 |
| Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems | Aug 14, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference | Aug 14, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Review Learning: Advancing All-in-One Ultra-High-Definition Image Restoration Training Method | Aug 13, 2024 | 4kAll | —Unverified | 0 |
| Bridging LLMs and KGs without Fine-Tuning: Intermediate Probing Meets Subgraph-Aware Entity Descriptions | Aug 13, 2024 | GPUKnowledge Graph Completion | —Unverified | 0 |
| Breast-NET: a lightweight DCNN model for breast cancer detection and grading using histological samples | Aug 10, 2024 | Breast Cancer DetectionBreast Cancer Histology Image Classification | CodeCode Available | 0 |
| LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale | Aug 10, 2024 | GPULanguage Modelling | CodeCode Available | 3 |
| A Versatile Framework for Attributed Network Clustering via K-Nearest Neighbor Augmentation | Aug 10, 2024 | AttributeClustering | CodeCode Available | 0 |
| UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling | Aug 9, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| reCSE: Portable Reshaping Features for Sentence Embedding in Self-supervised Contrastive Learning | Aug 9, 2024 | Contrastive LearningData Augmentation | CodeCode Available | 0 |
| Impacts of floating-point non-associativity on reproducibility for HPC and deep learning applications | Aug 9, 2024 | Deep LearningGPU | CodeCode Available | 0 |
| An Edge AI System Based on FPGA Platform for Railway Fault Detection | Aug 8, 2024 | CPUFault Detection | —Unverified | 0 |