SOTAVerified

Chi-square and normal inference in high-dimensional multi-task regression

2021-07-16Unverified0· sign in to hype

Pierre C Bellec, Gabriel Romon

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

The paper proposes chi-square and normal inference methodologies for the unknown coefficient matrix B^* of size p T in a Multi-Task (MT) linear model with p covariates, T tasks and n observations under a row-sparse assumption on B^*. The row-sparsity s, dimension p and number of tasks T are allowed to grow with n. In the high-dimensional regime p n, in order to leverage row-sparsity, the MT Lasso is considered. We build upon the MT Lasso with a de-biasing scheme to correct for the bias induced by the penalty. This scheme requires the introduction of a new data-driven object, coined the interaction matrix, that captures effective correlations between noise vector and residuals on different tasks. This matrix is psd, of size T T and can be computed efficiently. The interaction matrix lets us derive asymptotic normal and ^2_T results under Gaussian design and sT+s(p/s)n0 which corresponds to consistency in Frobenius norm. These asymptotic distribution results yield valid confidence intervals for single entries of B^* and valid confidence ellipsoids for single rows of B^*, for both known and unknown design covariance . While previous proposals in grouped-variables regression require row-sparsity s n up to constants depending on T and logarithmic factors in n,p, the de-biasing scheme using the interaction matrix provides confidence intervals and ^2_T confidence ellipsoids under the conditions (T^2,^8p)/n 0 and allowing row-sparsity s n when \|^-1e_j\|_0 T n up to logarithmic factors.

Tasks

Reproductions