Semi-supervised Inference for Block-wise Missing Data without Imputation

Shanshan Song, Yuanyuan Lin, Yong Zhou.

Year: 2024, Volume: 25, Issue: 99, Pages: 1−36


Abstract

We consider statistical inference for single or low-dimensional parameters in a high-dimensional linear model under a semi-supervised setting, wherein the data are a combination of a labelled block-wise missing data set of a relatively small size and a large unlabelled data set. The proposed method utilises both labelled and unlabelled data without any imputation or removal of the missing observations. The asymptotic properties of the estimator are established under regularity conditions. Hypothesis testing for low-dimensional coefficients are also studied. Extensive simulations are conducted to examine the theoretical results. The method is evaluated on the Alzheimer’s Disease Neuroimaging Initiative data.

PDF BibTeX