Accelerating GenAI Workloads by Enabling RISC-V Microkernel Support in IREE
2025-07-07Code Available0· sign in to hype
Adeel Ahmad, Ahmad Tameem Kamal, Nouman Amir, Bilal Zafar, Saad Bin Nasir
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/iree-org/ireeOfficial★ 3,671
Abstract
This project enables RISC-V microkernel support in IREE, an MLIR-based machine learning compiler and runtime. The approach begins by enabling the lowering of MLIR linalg dialect contraction ops to linalg.mmt4d op for the RISC-V64 target within the IREE pass pipeline, followed by the development of optimized microkernels for RISC-V. The performance gains are compared with upstream IREE and Llama.cpp for the Llama-3.2-1B-Instruct model.