Censored cumulative residual independent screening for ultrahigh-dimensional survival data

Zhang, Jing; Yin, Guosheng; Liu, Yanyan; Wu, Yuanshan

doi:10.1007/s10985-017-9395-2

Censored cumulative residual independent screening for ultrahigh-dimensional survival data

Published: 26 May 2017

Volume 24, pages 273–292, (2018)
Cite this article

Lifetime Data Analysis Aims and scope Submit manuscript

Jing Zhang¹,
Guosheng Yin²,
Yanyan Liu¹ &
…
Yuanshan Wu¹

692 Accesses
14 Citations
Explore all metrics

Abstract

For complete ultrahigh-dimensional data, sure independent screening methods can effectively reduce the dimensionality while retaining all the active variables with high probability. However, limited screening methods have been developed for ultrahigh-dimensional survival data subject to censoring. We propose a censored cumulative residual independent screening method that is model-free and enjoys the sure independent screening property. Active variables tend to be ranked above the inactive ones in terms of their association with the survival times. Compared with several existing methods, our model-free screening method works well with general survival models, and it is invariant to the monotone transformation of the responses, as well as requiring substantially weaker moment conditions. Numerical studies demonstrate the usefulness of the censored cumulative residual independent screening method, and the new approach is illustrated with a gene expression data set.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Model-free feature screening via distance correlation for ultrahigh dimensional survival data

Article 29 October 2020

Bayesian penalized Buckley-James method for high dimensional bivariate censored regression models

Article 03 March 2022

Model-free feature screening for high-dimensional survival data

Article 02 April 2018

References

Bitouzé D, Laurent B, Massart P (1999) A Dvoretzky–Kiefer–Wolfowitz type inequality for the Kaplan–Meier estimator. Annales de l’Institut Henri Poincare (B) Probab Stat 35:735–763
Article MathSciNet MATH Google Scholar
Candes E, Tao T (2007) The Dantzig selector: statistical estimation when $p$ is much larger than $n$. Ann Stat 35:2313–2351
Article MathSciNet MATH Google Scholar
Cook AJ, Gold DR, Li Y (2007) Spatial cluster detection for censored outcome data. Biometrics 63:540–549
Article MathSciNet MATH Google Scholar
Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 96:1348–1360
Article MathSciNet MATH Google Scholar
Fan J, Lv J (2008) Sure independence screening for ultrahigh dimensional feature space. J R Stat Soc Ser B 70:849–911
Article MathSciNet Google Scholar
Fan J, Song R (2010) Sure independence screening in generalized linear models with NP-dimensionality. J Am Stat Assoc 38:3567–3604
MathSciNet MATH Google Scholar
Fan J, Samworth R, Wu Y (2009) Ultrahigh dimensional feature selection: beyond the linear model. J Mach Learn Res 10:2013–2038
MathSciNet MATH Google Scholar
Fan J, Feng Y, Wu Y (2010) High-dimensional variable selection for Cox’s proportional hazards model. In: Borrowing strength: theory powering applications—a Festschrift for Lawrence D. Brown, Institute of Mathematical Statistics 6:70–86
Fan J, Feng Y, Song R (2011) Nonparametric independence screening in sparse ultra-high-dimensional additive models. J Am Stat Assoc 106:544–557
Article MathSciNet MATH Google Scholar
Gorst-Rasmussen A, Scheike T (2013) Independent screening for single-index hazard rate models with ultrahigh dimensional features. J R Stat Soc Ser B 75:217–245
Article MathSciNet Google Scholar
Hoeffding W (1948) A non-parametric test of independence. Ann Math Stat 19:546–557
Article MathSciNet MATH Google Scholar
Lin DY, Wei LJ, Ying Z (1993) Checking the Cox model with cumulative sums of martingale-based residuals. Biometrika 80:557–572
Article MathSciNet MATH Google Scholar
Li R, Zhong W, Zhu L (2012) Feature screening via distance correlation learning. J Am Stat Assoc 107:1129–1139
Article MathSciNet MATH Google Scholar
Rosenwald A, Wright G, Wiestner A, Chan WC et al (2003) The proliferation gene expression signature is a quantitative integrator of oncogenic events that predicts survival in mantle cell lymphoma. Cancer Cell 3:185–197
Article Google Scholar
Serfling RJ (1980) Approximation theorems of mathematical statistics. Wiley, New York
Book MATH Google Scholar
Song R, Lu W, Ma S, Jeng XJ (2014) Censored rank independence screening for high-dimensional survival data. Biometrika 101:799–814
Article MathSciNet MATH Google Scholar
Tibshirani R (1996) Regression shrinkage and selection via the Lasso. J R Stat Soc Ser B 58:267–288
MathSciNet MATH Google Scholar
Tibshirani R (2009) Univariate shrinkage in the Cox model for high dimensional data. Stat Appl Genet Mol Biol 8:1–18
Article MathSciNet MATH Google Scholar
Wu Y, Yin G (2015) Conditional quantile screening in ultrahigh-dimensional heterogeneous data. Biometrika 102:65–76
Article MathSciNet MATH Google Scholar
Zhang CH (2010) Nearly unbiased variable selection under minimax concave penalty. Ann Stat 38:894–942
Article MathSciNet MATH Google Scholar
Zhao SD, Li Y (2012) Principled sure independence screening for Cox models with ultra-high-dimensional covariates. J Multivar Anal 105:397–411
Article MathSciNet MATH Google Scholar
Zhu LP, Li L, Li R, Zhu LX (2011) Model-free feature screening for ultrahigh dimensional data. J Am Stat Assoc 106:1464–1475
Article MathSciNet MATH Google Scholar
Zou H (2006) The adaptive Lasso and its oracle properties. J Am Stat Assoc 101:1418–1429
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

We would like to thank the editor, the associate editor, and the referees for their insightful comments, which immensely improved the work. This research was supported in part by grants (11371299, 11571263, 11671311) from the National Science Foundation of China and a grant (17125814) from the Research Grants Council of Hong Kong.

Author information

Authors and Affiliations

School of Mathematics and Statistics, Wuhan University, Wuhan, 430072, Hubei, China
Jing Zhang, Yanyan Liu & Yuanshan Wu
Department of Statistics and Actuarial Science, University of Hong Kong, Pokfulam Road, Hong Kong, China
Guosheng Yin

Authors

Jing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Guosheng Yin
View author publications
You can also search for this author in PubMed Google Scholar
Yanyan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yuanshan Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuanshan Wu.

Appendix: Theoretic Proofs

Proof of Theorem 1

Let

$$\begin{aligned} \widetilde{d_k}(t,z)&=n^{-1}\sum _{i=1}^{n} \left[ \left\{ \frac{\Delta _{i}I(X_i>t)}{G(X_i)}-H(t)\right\} I(Z_{ik}<z)\right] \end{aligned}$$

and define

$$\begin{aligned} \left\| \widetilde{d_k}\right\| _n^2&=n^{-1}\sum _{j=1}^{n}\widetilde{d_k}\left( X_j,Z_{jk}\right) ^2. \end{aligned}$$

Straightforward calculations entail that

$$\begin{aligned} \left\| \widetilde{d_k}\right\| _n^2&=\frac{(n-1)(n-2)}{n^2}\left( \frac{1}{n-2} \widetilde{D}_{k1}+\widetilde{D}_{k2}\right) , \end{aligned}$$

(A.1)

where

$$\begin{aligned} \widetilde{D}_{k1}= & {} \frac{2}{n(n-1)}\sum _{i<j}\frac{1}{2} \left[ \left\{ \frac{\Delta _{i}I(X_i>X_j)}{G(X_i)}-H(X_j)\right\} ^2 I(Z_{ik}<Z_{jk})\right. \\&+\left. \left\{ \frac{\Delta _{j}I(X_j>X_i)}{G(X_j)}-H(X_i)\right\} ^2I \left( Z_{jk}<Z_{ik}\right) \right] \\\equiv & {} \frac{2}{n(n-1)} \sum _{i<j}h_{1}\left( {\mathcal {O}}_{ik};{\mathcal {O}}_{jk};G,H\right) ,\\ \end{aligned}$$

${{\mathcal {O}}}_{ik}=(X_i,\Delta _i,Z_{ik})$, and the definitions of kernels $h_{1}({{\mathcal {O}}}_{ik};{{\mathcal {O}}}_{jk};G,H)$ and $h_{2}({{\mathcal {O}}}_{ik};{{\mathcal {O}}}_{jk};{{\mathcal {O}}}_{lk};G,H)$ in the U-statistics are clear from the context. Likewise, we have

$$\begin{aligned} \left\| \widehat{d_k}\right\| _n^2&=\frac{(n-1)(n-2)}{n^2} \left( \frac{1}{n-2}\widehat{D}_{k1}+\widehat{D}_{k2}\right) , \end{aligned}$$

(A.2)

where $\widehat{D}_{ks}, s=1,2,$ are obtained by replacing G and H in $\widetilde{D}_{ks}$ with $\widehat{G}_n$ and $\widehat{H}_n$ respectively.

First, we derive the exponential tail probability bound of $P\big (\big |\Vert \widehat{d_k}\Vert _n^2-\Vert \widetilde{d_k}\Vert _n^2\big |\ge \upsilon n^{-\alpha }\big )$ for any positive constants $\upsilon $ and $\alpha \in [0, 1/2)$. Consider $P(|\widehat{D}_{k1}-\widetilde{D}_{k1}|\ge \upsilon n^{-\alpha }/2)$ and note that

By condition C1 and the boundness of the indicator function, there exists a constant $c_1$ such that

$$\begin{aligned}&\left| \left\{ \frac{\Delta _{i}I(X_i>X_j)}{\widehat{G}_n(X_i)}-\widehat{H}_n(X_j)\right\} ^2 -\left\{ \frac{\Delta _{i}I(X_i>X_j)}{G(X_i)}-H(X_j)\right\} ^2\right| \\&\quad \le c_1\left\{ \left| \widehat{G}_n(X_i)-G(X_i)\right| +\left| \widehat{H}_n(X_j)-H(X_j)\right| \right\} . \end{aligned}$$

Denoting $c_2=\min \{G(\tau ),H(\tau )\}$, we immediately have

$$\begin{aligned} \left| \widehat{D}_{k1}-\widetilde{D}_{k1}\right|\le & {} \frac{c_1}{n}\sum _{i=1}^{n} \Big [\Big |\widehat{G}_n(X_i)-G(X_i)\Big | +\Big |\widehat{H}_n(X_i)-H(X_i)\Big |\Big ]\nonumber \\\le & {} \frac{c_1}{c_2 n} \sum _{i=1}^{n}\Big [\Big |H(X_i)\Big \{\widehat{G}_n(X_i)-G(X_i)\Big \}\Big |+ \Big |G(X_i)\Big \{\widehat{H}_n(X_i)-H(X_i)\Big \}\Big |\Big ]\nonumber \\\le & {} c_3\sup _{0\le t \le \tau }\Big |H(t)\Big \{\widehat{G}_n(t)-G(t)\Big \}\Big |+ c_3\sup _{0\le t\le \tau }\Big |G(t)\Big \{\widehat{H}_n(t)-H(t)\Big \}\Big |,\nonumber \\ \end{aligned}$$

(A.3)

where $c_3=c_1/c_2$.

Using the similar argument, along with some tedious calculation, we also have

$$\begin{aligned} \Big |\widehat{D}_{k2}-\widetilde{D}_{k2}\Big | \le c_{4}\sup _{0\le t \le \tau }\Big |H(t)\Big \{\widehat{G}_n(t)-G(t)\Big \}\Big |+ c_{4}\sup _{0\le t\le \tau }\Big |G(t)\Big \{\widehat{H}_n(t)-H(t)\Big \}\Big |, \end{aligned}$$

where $c_{4}$ is a constant. It follows from (A.3) and Theorem 1 of Bitouzé et al. (1999) that

$$\begin{aligned} P\Big (\Big |\widehat{D}_{k1}-\widetilde{D}_{k1}\Big |\ge 2\upsilon n^{-\alpha }\Big )\le & {} P\left( c_3\sup _{0\le t \le \tau }\left| H(t)\left\{ \widehat{G}_n(t)-G(t)\right\} \right| \ge \upsilon n^{-\alpha }\right) \nonumber \\&+P\left( c_3\sup _{0\le t\le \tau }\left| G(t)\left\{ \widehat{H}_n(t)-H(t)\right\} \right| \ge \upsilon n^{-\alpha }\right) \nonumber \\\le & {} 5\exp \left( -2c_{3}^{-2}\upsilon ^{2}n^{1-2\alpha }+\mu _1 c_{3}^{-1}\upsilon n^{1/2-\alpha }\right) , \end{aligned}$$

(A.4)

where $\mu _1$ is a constant. Similarly, we also have

$$\begin{aligned} P\left( \left| \widehat{D}_{k2}-\widetilde{D}_{k2}\right| \ge 2\upsilon n^{-\alpha }\right) \le 5\exp \left( -2c_{4}^{-2}\upsilon ^{2}n^{1-2\alpha }+\mu _2 c_{4}^{-1}\upsilon n^{1/2-\alpha }\right) , \end{aligned}$$

(A.5)

where $\mu _2$ is a constant. Combining (A.1), (A.2), (A.4) and (A.5), we have

$$\begin{aligned}&P\left( \left| \left\| \widehat{d_k}\right\| _n^2-\left\| \widetilde{d_k}\right\| _n^2\right| \ge 4\upsilon n^{-\alpha }\right) \nonumber \\&\quad = P\left\{ \left| \frac{n-1}{n^2}\left( \widehat{D}_{k1}-\widetilde{D}_{k1}\right) + \frac{(n-1)(n-2)}{n^2}\left( \widehat{D}_{k2}-\widetilde{D}_{k2}\right) \right| \ge 4\upsilon n^{-\alpha }\right\} \nonumber \\&\quad \le P\left\{ \left| \widehat{D}_{k1} -\widetilde{D}_{k1}\right| \ge 2\upsilon n^{1-\alpha }\right\} + P\left\{ \left| \widehat{D}_{k2}-\widetilde{D}_{k2}\right| \ge 2\upsilon n^{-\alpha }\right\} \nonumber \\&\quad \le 5\exp \left( -2c_3^{-2} \upsilon ^{2}n^{3-2\alpha }+\mu _1 c_3^{-1}\upsilon n^{3/2-\alpha }\right) \nonumber \\&\qquad + 5\exp \left( -2c_{4}^{-2}\upsilon ^{2}n^{1-2\alpha }+\mu _2 c_{4}^{-1} \upsilon n^{1/2-\alpha }\right) . \end{aligned}$$

(A.6)

Second, we derive the exponential tail probability bound of $P\big (\big |\Vert \widetilde{d_k}\Vert _n^2-\Vert d_k\Vert _n^2\big |\ge \upsilon n^{-\alpha }\big )$ for any positive constants $\upsilon $ and $0\le \alpha <1/2$.

Note that $\Vert d_k\Vert _n^2=E\{h_{2}({{\mathcal {O}}}_{ik};{{\mathcal {O}}}_{jk};{{\mathcal {O}}}_{lk};G;H)\}=E(\widetilde{D}_{k2})$. Employing the Markov inequality, we obtain that, for any $\epsilon >0$ and $\xi >0$,

$$\begin{aligned} P\left( \widetilde{D}_{k2}-\Vert d_k\Vert _n^2\ge \epsilon \right) \le \exp (-\xi \epsilon )\exp \left( -\xi \left\| d_k\right\| _n^2\right) E\left\{ \exp (\xi \widetilde{D}_{k2})\right\} . \end{aligned}$$

Serfling (1980, Section 5.1.6) showed that any U-statistic can be represented as an average of averages of i.i.d. random variables. We can rewrite

$$\begin{aligned} \widetilde{D}_{k2}=(n!)^{-1}\Sigma _{n!}D_{2}\left( {{\mathcal {O}}}_{1k}; \cdots ;{{\mathcal {O}}}_{nk};G,H\right) , \end{aligned}$$

where $\Sigma _{n!}$ denotes the summation over all possible permutations of $(1,\ldots ,n)$, and each $D_{2}({{\mathcal {O}}}_{1k};\cdots ;{{\mathcal {O}}}_{nk};G,H)$ is an average of $m\equiv [n/3]$ i.i.d. random variables. Denote $\psi (\xi )=E[\exp \{\xi h_{2}({{\mathcal {O}}}_{ik};{{\mathcal {O}}}_{jk};{{\mathcal {O}}}_{lk};G,H)\}]$. Jensen’s inequality yields that

$$\begin{aligned} E\left\{ \exp \left( \xi \widetilde{D}_{k2}\right) \right\}= & {} E\left[ \exp \left\{ \xi (n!)^{-1}\Sigma _{n!}D_{2}\left( {{\mathcal {O}}}_{1k}; \cdots ;{{\mathcal {O}}}_{nk};G,H\right) \right\} \right] \nonumber \\\le & {} (n!)^{-1}\Sigma _{n!}E\left[ \exp \left\{ \xi D_{2}\left( {{\mathcal {O}}}_{1k}; \cdots ;{{\mathcal {O}}}_{nk};G,H\right) \right\} \right] \nonumber \\= & {} \psi ^{m}(\xi /m). \end{aligned}$$

As a result,

$$\begin{aligned}&P\left( \widetilde{D}_{k2}-\left\| d_k\right\| _n^2\ge \epsilon \right) \le \exp (-\xi \epsilon )\exp \left( -\xi \left\| d_k\right\| _n^2\right) \psi ^{m}(\xi /m) \\&\quad = \exp (-\xi \epsilon )\left\{ E\left( \exp \left[ m^{-1}\xi \left\{ h_{2} \left( {{\mathcal {O}}}_{ik};{{\mathcal {O}}}_{jk};{{\mathcal {O}}}_{lk}; G,H\right) -\left\| d_k\right\| _n^2\right\} \right] \right) \right\} ^{m}. \end{aligned}$$

Under condition C1, there exists a positive constant $c_{5}$ such that $P(|h_2|<c_{5})=1$. It follows from Lemma 1 in Li et al. (2012) that

$$\begin{aligned} E\left\{ \exp \left[ m^{-1}\xi \left\{ h_{2} \left( {{\mathcal {O}}}_{ik};{{\mathcal {O}}}_{jk}; {{\mathcal {O}}}_{lk};G,H\right) -\left\| d_k\right\| _n^2\right\} \right] \right\} \le \exp \left\{ c_{5}^2 \xi ^2/(2m^2)\right\} , \end{aligned}$$

which immediately entails that

$$\begin{aligned} P\left( \widetilde{D}_{k2}-\left\| d_k\right\| _n^2\ge \epsilon \right) \le \exp \left( -\frac{\epsilon ^2 m}{2c_{5}^2}\right) , \end{aligned}$$

by choosing $\xi =\epsilon m/c_{5}^2$. It further follows from the symmetry of the U-statistic that

$$\begin{aligned} P\left( \left| \widetilde{D}_{k2}-\left\| d_k\right\| _n^2\right| \ge \epsilon \right) \le 2\exp \left( -\frac{\epsilon ^2 m}{2c_{5}^2}\right) . \end{aligned}$$

Using the similar argument, we also have

$$\begin{aligned} P\left( \left| \widetilde{D}_{k1}-E(\widetilde{D}_{k1})\right| \ge \epsilon \right) \le 2\exp \left( -\frac{\epsilon ^2 m^{*}}{2c_{6}^2}\right) , \end{aligned}$$

where $c_{6}$ is a positive constant such that $P(|h_1|< c_{6})=1$ and $m^{*}=[n/2]$. Obviously, under condition C1, there exist constants $c_{7}$ and $c_{8}$ such that $0\le \Vert d_k\Vert _n^2=E(\widetilde{D}_{k2})\le E|\widetilde{D}_{k2}|\le c_{7}$ and $0\le E(\widetilde{D}_{k1})\le E|\widetilde{D}_{k1}|\le c_{8}$ for any $1\le k \le p_n$. Taking $\epsilon =\upsilon n^{-\alpha }$ and n large enough such that $(3n-2)n^{-2}E(\widetilde{D}_{k2})<\upsilon n^{-\alpha }$ and $(n-1)n^{-2}E(\widetilde{D}_{k1})<\upsilon n^{-\alpha }$, we have

$$\begin{aligned}&P\left( \left| \left\| \widetilde{d_k}\right\| _n^2-\left\| d_k\right\| _n^2\right| \ge 4 \upsilon n^{-\alpha }\right) \nonumber \\&\quad =P\left\{ \left| \frac{(n-1)(n-2)}{n^2}\left( \widetilde{D}_{k2} -\left\| d_k\right\| _n^2\right) -\frac{3n-2}{n^2}E(\widetilde{D}_{k2}) \right. \right. \nonumber \\&\qquad \left. \left. +\frac{n-1}{n^2}\left\{ \widetilde{D}_{k1}-E\left( \widetilde{D}_{k1}\right) \right\} +\frac{n-1}{n^2}E(\widetilde{D}_{k1})\right| \ge 4 \upsilon n^{-\alpha }\right\} \nonumber \\&\quad \le P\left( \left| \widetilde{D}_{k1}-E(\widetilde{D}_{k1})\right| \ge \upsilon n^{1-\alpha }\right) + P\left( \left| \widetilde{D}_{k2}-\left\| d_k\right\| _n^2\right| \ge \upsilon n^{-\alpha }\right) \nonumber \\&\quad \le 2\exp \left( -\frac{\upsilon ^2 n^{2-2\alpha } m^{*}}{2c_{6}^2}\right) + 2\exp \left( -\frac{\upsilon ^2 n^{-2\alpha }m}{2c_{5}^2}\right) \nonumber \\&\quad \le 2\exp \left( -c_{9}\upsilon ^2 n^{3-2\alpha }\right) + 2\exp \left( -c_{10}\upsilon ^2 n^{1-2\alpha }\right) , \end{aligned}$$

(A.7)

by noting that $m^*\ge m\ge n/4$, where $c_{9}=1/(8c_{6}^2)$ and $c_{10}=1/(8c_{5}^2)$. It follows from (A.6) and (A.7) that

$$\begin{aligned}&P\left( \left| \left\| \widehat{d_k}\right\| _n^2-\Vert d_k\Vert _n^2\right| \ge 8 \upsilon n^{-\alpha }\right) \nonumber \\&\quad \le P\left( \left| \left\| \widehat{d_k}\right\| _n^2-\left\| \widetilde{d_k}\right\| _n^2\right| \ge 4 \upsilon n^{-\alpha }\right) + P\left( \left| \big \Vert \widetilde{d_k}\big \Vert _n^2-\big \Vert d_k\big \Vert _n^2\right| \ge 4 \upsilon n^{-\alpha }\right) \nonumber \\&\quad \le 5\exp \left( -2c_3^{-2}\upsilon ^{2}n^{3-2\alpha } +\mu _1 c_3^{-1}\upsilon n^{3/2-\alpha }\right) \nonumber \\&\qquad + 5\exp \left( -2c_{4}^{-2}\upsilon ^{2}n^{1-2\alpha } +\mu _2 c_{4}^{-1}\upsilon n^{1/2-\alpha }\right) \nonumber \\&\qquad +2\exp \left( -c_{9}\upsilon ^2 n^{3-2\alpha }\right) + 2\exp \left( -c_{10}\upsilon ^2 n^{1-2\alpha }\right) \nonumber \\&\quad \le O\left\{ \exp \left( -\eta n^{1-2\alpha }\right) \right\} , \end{aligned}$$

(A.8)

where $\eta =\min \{2c_{4}^{-2}\upsilon ^{2},c_{10}\upsilon ^2\}$. Immediately, we have

$$\begin{aligned} P\left( \max _{1\le k \le p_n}\left| \left\| \widehat{d_k}\right\| _n^2-\left\| d_k\right\| _n^2\right| \ge 8 \upsilon n^{-\alpha }\right) \le O\left\{ p_n\exp \left( -\eta n^{1-2\alpha }\right) \right\} , \end{aligned}$$

(A.9)

which proves the first part of Theorem 1 by taking $c=8\upsilon $.

If $\mathcal {A}\nsubseteq \widehat{\mathcal {A}}$, then there must exist some $k\in \mathcal {A}$ such that $\Vert \widehat{d_k}\Vert _n^2< cn^{-\alpha }$. It follows from condition C2 that $|\Vert \widehat{d_k}\Vert _n^2-\Vert d_k\Vert _n^2|>cn^{-\alpha }$ for some $k\in \mathcal {A}$, which implies that $\{\mathcal {A}\nsubseteq \widehat{\mathcal {A}}\}\subseteq \{|\Vert \widehat{d_k}\Vert _n^2-\Vert d_k\Vert _n^2|>cn^{-\alpha }$ for some $k\;\in \mathcal {A}\}$. As a result, $\{\max _{k\in \mathcal {A}}|\Vert \widehat{d_k}\Vert _n^2-\Vert d_k\Vert _n^2|\le cn^{-\alpha }\}\subseteq \{\mathcal {A}\subseteq \widehat{\mathcal {A}}\}$. Using (A.8), we have

$$\begin{aligned} P\big (\mathcal {A}\subseteq \widehat{\mathcal {A}}\big )\ge & {} P\Big (\max _{k\in \mathcal {A}}\big |\big \Vert \widehat{d_k}\big \Vert _n^2-\big \Vert d_k\big \Vert _n^2\big |\le cn^{-\alpha }\Big )\nonumber \\\ge & {} 1-O\big \{a_n\exp \big (-\eta n^{1-2\alpha }\big )\big \}, \end{aligned}$$

where $a_n=|\mathcal {A}|$. Thus, the proof of Theorem 1 is completed. $\square $

Proof of Theorem 2

Under assumption (i), we rewrite

$$\begin{aligned} d_k(t,z)= & {} E\left[ \left\{ \frac{\Delta I(X>t)}{G(X)}-P(T>t)\right\} I(Z_k< z)\right] \nonumber \\= & {} E\left\{ E\left[ \left. \left\{ \frac{\Delta I(X>t)}{G(X)}-P(T>t)\right\} I(Z_k< z)\right| \mathbf{Z}\right] \right\} \nonumber \\= & {} E\left\{ I(Z_k< z)E\left[ \left. \left\{ \frac{\Delta I(X>t)}{G(X)}-P(T>t)\right\} \right| \mathbf{Z}\right] \right\} \nonumber \\= & {} E\left\{ I(Z_k< z)\left[ E\left\{ \left. \frac{\Delta I(X>t)}{G(X)}\right| \mathbf{Z}_{\mathcal {A}} \right\} -P(T>t)\right] \right\} . \end{aligned}$$

If $k\notin \mathcal {A}$, then assumption (ii) implies that

$$\begin{aligned} d_k(t,z)=E\left\{ I(Z_k< z)\right\} E\left[ E\left\{ \left. \frac{\Delta I(X>t)}{G(X)}\right| \mathbf{Z}_{\mathcal {A}} \right\} -P(T>t)\right] =0, \end{aligned}$$

for any t and z. As a result, $\Vert d_k\Vert _n^2=E\{d_k(X,Z_k)^2\}=0$. It follows from condition C2 that $\max _{k\notin \mathcal {A}}\Vert d_k\Vert _n^2 < \min _{k\in \mathcal {A}}\Vert d_k\Vert _n^2$. On the other hand, $\Vert d_k\Vert _n^2=0$ directly implies that $k\notin \mathcal {A}$ under condition C2. Thus, the first part of Theorem 2 is proved.

Under condition C2 and assumptions (i) and (ii), coupled with (A.9), we have

$$\begin{aligned}&P\left( \min _{k\in \mathcal {A}} \left\| \widehat{d_k}\right\| _n^2 \le \max _{k \notin \mathcal {A}} \left\| \widehat{d_k}\right\| _n^2\right) \nonumber \\&\quad =P\left( \max _{k \notin \mathcal {A}} \left\| \widehat{d_k}\right\| _n^2 -\max _{k \notin \mathcal {A}} \Vert {d_k}\Vert _n^2- \min _{k\in \mathcal {A}} \left\| \widehat{d_k}\right\| _n^2+\min _{k\in \mathcal {A}} \left\| d_k\right\| _n^2\ge \min _{k\in \mathcal {A}}\Vert d_k\Vert _n^2\right) \nonumber \\&\quad \le P\left( \max _{k \notin \mathcal {A}}\left| \left\| \widehat{d_k}\right\| _n^2-\left\| {d_k}\right\| _n^2\right| \ge cn^{-\alpha }\right) + P\left( \max _{k \in \mathcal {A}}\left| \left\| \widehat{d_k}\right\| _n^2-\left\| {d_k}\right\| _n^2\right| \ge cn^{-\alpha }\right) \nonumber \\&\quad \le 2P\left( \max _{1\le k \le p_n}\left| \left\| \widehat{d_k}\right\| _n^2-\left\| d_k\right\| _n^2\right| \ge cn^{-\alpha }\right) \nonumber \\&\quad \le O\left\{ p_n\exp \left( -\eta n^{1-2\alpha }\right) \right\} , \end{aligned}$$

which completes the proof of Theorem 2. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, J., Yin, G., Liu, Y. et al. Censored cumulative residual independent screening for ultrahigh-dimensional survival data. Lifetime Data Anal 24, 273–292 (2018). https://doi.org/10.1007/s10985-017-9395-2

Download citation

Received: 03 November 2015
Accepted: 16 May 2017
Published: 26 May 2017
Issue Date: April 2018
DOI: https://doi.org/10.1007/s10985-017-9395-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Censored cumulative residual independent screening for ultrahigh-dimensional survival data

Abstract

Access this article

Similar content being viewed by others

Model-free feature screening via distance correlation for ultrahigh dimensional survival data

Bayesian penalized Buckley-James method for high dimensional bivariate censored regression models

Model-free feature screening for high-dimensional survival data

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendix: Theoretic Proofs

Proof of Theorem 1

Proof of Theorem 2

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Censored cumulative residual independent screening for ultrahigh-dimensional survival data

Abstract

Access this article

Similar content being viewed by others

Model-free feature screening via distance correlation for ultrahigh dimensional survival data

Bayesian penalized Buckley-James method for high dimensional bivariate censored regression models

Model-free feature screening for high-dimensional survival data

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendix: Theoretic Proofs

Appendix: Theoretic Proofs

Proof of Theorem 1

Proof of Theorem 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation