Meeting in the Middle: A Co-Design Paradigm for FHE and AI Inference
Bernardo Magri, Benjamin Marsh, Paul Gebheim
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
Modern cloud inference creates a two sided privacy problem where users reveal sensitive inputs to providers, while providers must execute proprietary model weights inside potentially leaky execution environments. Fully homomorphic encryption (FHE) offers cryptographic guarantees but remains prohibitively expensive for modern architectures. We argue that progress requires co-design where specializing FHE schemes/compilers for the static structure of inference circuits, while simultaneously constraining inference architectures to reduce dominant homomorphic cost drivers. We outline a meet in the middle agenda and concrete optimization targets on both axes.