| A Simple Sparse Matrix Vector Multiplication Approach to Padded Convolution | Nov 29, 2024 | CPUGPU | CodeCode Available | 0 |
| An Integrated Artificial Intelligence Operating System for Advanced Low-Altitude Aviation Applications | Nov 28, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Improving Accuracy and Generalization for Efficient Visual Tracking | Nov 28, 2024 | CPUTest-time Adaptation | —Unverified | 0 |
| A Runtime-Adaptive Transformer Neural Network Accelerator on FPGAs | Nov 27, 2024 | Computational EfficiencyCPU | CodeCode Available | 0 |
| KVPR: Efficient LLM Inference with I/O-Aware KV Cache Partial Recomputation | Nov 26, 2024 | CPUGPU | CodeCode Available | 0 |
| A Data-Driven Approach to Dataflow-Aware Online Scheduling for Graph Neural Network Inference | Nov 25, 2024 | CPUGPU | —Unverified | 0 |
| Plastic Arbor: a modern simulation framework for synaptic plasticity x2013 from single synapses to networks of morphological neurons | Nov 25, 2024 | CPUGPU | CodeCode Available | 0 |
| OPMOS: Ordered Parallel Algorithm for Multi-Objective Shortest-Paths | Nov 25, 2024 | AttributeCPU | —Unverified | 0 |
| SMM-Conv: Scalar Matrix Multiplication with Zero Packing for Accelerated Convolution | Nov 23, 2024 | CPU | —Unverified | 0 |
| Deep operator network models for predicting post-burn contraction | Nov 21, 2024 | CPUGPU | —Unverified | 0 |
| Llama Guard 3-1B-INT4: Compact and Efficient Safeguard for Human-AI Conversations | Nov 18, 2024 | CPU | —Unverified | 0 |
| MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs | Nov 18, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Generative AI on the Edge: Architecture and Performance Evaluation | Nov 18, 2024 | CPURaspberry Pi 5 | —Unverified | 0 |
| Towards Accurate and Efficient Sub-8-Bit Integer Training | Nov 17, 2024 | CPUGPU | —Unverified | 0 |
| Pie: Pooling CPU Memory for LLM Inference | Nov 14, 2024 | CPUGPU | —Unverified | 0 |
| Offline Adaptation of Quadruped Locomotion using Diffusion Models | Nov 13, 2024 | CPU | CodeCode Available | 0 |
| Input-Based Ensemble-Learning Method for Dynamic Memory Configuration of Serverless Computing Functions | Nov 12, 2024 | CPUEnsemble Learning | —Unverified | 0 |
| TinyML Security: Exploring Vulnerabilities in Resource-Constrained Machine Learning Systems | Nov 11, 2024 | CPUEdge-computing | —Unverified | 0 |
| Project Tracyn: Generative Artificial Intelligence based Peripherals Trace Synthesizer | Nov 10, 2024 | CPU | —Unverified | 0 |
| P-MOSS: Learned Scheduling For Indexes Over NUMA Servers Using Low-Level Hardware Statistics | Nov 5, 2024 | CPUScheduling | —Unverified | 0 |
| DeepContext: A Context-aware, Cross-platform, and Cross-framework Tool for Performance Profiling and Analysis of Deep Learning Workloads | Nov 5, 2024 | CPUDeep Learning | —Unverified | 0 |
| Map++: Towards User-Participatory Visual SLAM Systems with Efficient Map Expansion and Sharing | Nov 4, 2024 | CPU | —Unverified | 0 |
| AI-Ready Energy Modelling for Next Generation RAN | Nov 4, 2024 | CPU | CodeCode Available | 0 |
| NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference | Nov 2, 2024 | Code GenerationCPU | CodeCode Available | 0 |
| DynaSplit: A Hardware-Software Co-Design Framework for Energy-Aware Inference on Edge | Oct 31, 2024 | CPUScheduling | —Unverified | 0 |
| Conditioned quantum-assisted deep generative surrogate for particle-calorimeter interactions | Oct 30, 2024 | CPU | —Unverified | 0 |
| Cora: Accelerating Stateful Network Applications with SmartNICs | Oct 29, 2024 | CPU | —Unverified | 0 |
| AI-assisted Agile Propagation Modeling for Real-time Digital Twin Wireless Networks | Oct 29, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Accelerated Bayesian parameter estimation and model selection for gravitational waves with normalizing flows | Oct 28, 2024 | CPUGPU | —Unverified | 0 |
| Deep Optimizer States: Towards Scalable Training of Transformer Models Using Interleaved Offloading | Oct 26, 2024 | CPUGPU | CodeCode Available | 0 |
| Multi-objective Optimization in CPU Design Space Exploration: Attention is All You Need | Oct 24, 2024 | AllCPU | —Unverified | 0 |
| Structured Connectivity for 6G Reflex Arc: Task-Oriented Virtual User and New Uplink-Downlink Tradeoff | Oct 24, 2024 | ARCCPU | —Unverified | 0 |
| Sensing-Communication-Computing-Control Closed-Loop Optimization for 6G Unmanned Robotic Systems | Oct 24, 2024 | ARCCPU | —Unverified | 0 |
| ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference | Oct 23, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs | Oct 22, 2024 | CPUGPU | —Unverified | 0 |
| AI-focused HPC Data Centers Can Provide More Power Grid Flexibility and at Lower Cost | Oct 22, 2024 | CPUGPU | —Unverified | 0 |
| Accelerate Coastal Ocean Circulation Model with AI Surrogate | Oct 19, 2024 | CPUDisaster Response | —Unverified | 0 |
| CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment | Oct 16, 2024 | CPUGPU | —Unverified | 0 |
| Towards Arbitrary QUBO Optimization: Analysis of Classical and Quantum-Activated Feedforward Neural Networks | Oct 16, 2024 | CPUDecoder | —Unverified | 0 |
| A Transformer Based Generative Chemical Language AI Model for Structural Elucidation of Organic Compounds | Oct 13, 2024 | CPUDecoder | —Unverified | 0 |
| Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property Prediction | Oct 11, 2024 | CPUDimensionality Reduction | CodeCode Available | 0 |
| Superpipeline: A Universal Approach for Reducing GPU Memory Usage in Large Models | Oct 11, 2024 | CPUGPU | CodeCode Available | 0 |
| Bukva: Russian Sign Language Alphabet | Oct 11, 2024 | CPUSign Language Recognition | CodeCode Available | 0 |
| ActNAS : Generating Efficient YOLO Models using Activation NAS | Oct 11, 2024 | CPUGPU | —Unverified | 0 |
| Dense Optimizer : An Information Entropy-Guided Structural Search Method for Dense-like Neural Network Design | Oct 10, 2024 | CPU | —Unverified | 0 |
| KV Prediction for Improved Time to First Token | Oct 10, 2024 | Code CompletionCPU | —Unverified | 0 |
| An Innovative Solution: AI-Based Digital Screen-Integrated Tables for Educational Settings | Oct 8, 2024 | CPU | —Unverified | 0 |
| Fast Object Detection with a Machine Learning Edge Device | Oct 5, 2024 | Autonomous NavigationCPU | —Unverified | 0 |
| Dolphin: A Programmable Framework for Scalable Neurosymbolic Learning | Oct 4, 2024 | CPUDeep Learning | —Unverified | 0 |
| Predictive Attractor Models | Oct 3, 2024 | CPU | —Unverified | 0 |