StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold

2025-10-02Code Available0· sign in to hype

Zhizhong Li, Sina Sajadmanesh, Jingtao Li, Lingjuan Lyu

Code Available — Be the first to reproduce this paper.

Code

github.com/sonyresearch/stella
OfficialIn paper★ 15

Abstract

Low-rank adaptation (LoRA) has been widely adopted as a parameter-efficient technique for fine-tuning large-scale pre-trained models. However, it still lags behind full fine-tuning in performance, partly due to its insufficient exploitation of the geometric structure underlying low-rank manifolds. In this paper, we propose a geometry-aware extension of LoRA that uses a three-factor decomposition U\!SV^. Analogous to the structure of singular value decomposition (SVD), it separates the adapter's input and output subspaces, V and U, from the scaling factor S. Our method constrains U and V to lie on the Stiefel manifold, ensuring their orthonormality throughout the training. To optimize on the Stiefel manifold, we employ a flexible and modular geometric optimization design that converts any Euclidean optimizer to a Riemannian one. It enables efficient subspace learning while remaining compatible with existing fine-tuning pipelines. Empirical results across a wide range of downstream tasks, including commonsense reasoning, math and code generation, image classification, and image generation, demonstrate the superior performance of our approach against the recent state-of-the-art variants of LoRA. Code is available at https://github.com/SonyResearch/stella.

StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold

Code

Abstract

Reproductions