Skip to main content
Log in

Integrated likelihoods in parametric survival models for highly clustered censored data

  • Published:
Lifetime Data Analysis Aims and scope Submit manuscript

Abstract

In studies that involve censored time-to-event data, stratification is frequently encountered due to different reasons, such as stratified sampling or model adjustment due to violation of model assumptions. Often, the main interest is not in the clustering variables, and the cluster-related parameters are treated as nuisance. When inference is about a parameter of interest in presence of many nuisance parameters, standard likelihood methods often perform very poorly and may lead to severe bias. This problem is particularly evident in models for clustered data with cluster-specific nuisance parameters, when the number of clusters is relatively high with respect to the within-cluster size. However, it is still unclear how the presence of censoring would affect this issue. We consider clustered failure time data with independent censoring, and propose frequentist inference based on an integrated likelihood. We then apply the proposed approach to a stratified Weibull model. Simulation studies show that appropriately defined integrated likelihoods provide very accurate inferential results in all circumstances, such as for highly clustered data or heavy censoring, even in extreme settings where standard likelihood procedures lead to strongly misleading results. We show that the proposed method performs generally as well as the frailty model, but it is superior when the frailty distribution is seriously misspecified. An application, which concerns treatments for a frequent disease in late-stage HIV-infected people, illustrates the proposed inferential method in Weibull regression models, and compares different inferential conclusions from alternative methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

References

  • Aalen OO, Borgan Ø, Gjessing HK (2008) Survival and event history analysis: a process point of view. Springer, New York

    Book  MATH  Google Scholar 

  • Andersen PK, Borgan Ø, Gill RD, Keiding N (1993) Statistical models based on counting processes. Springer, New York

    Book  MATH  Google Scholar 

  • Barndorff-Nielsen OE, Cox DR (1994) Inference and asymptotics. Chapman and Hall, London

    Book  MATH  Google Scholar 

  • Bartolucci F, Bellio R, Salvan A, Sartori N (2015) Modified profile likelihood for fixed-effects panel data models. Econ Rev (to appear). doi:10.1080/07474938.2014.975642

  • Bellio R, Guolo A (2015) Integrated likelihood inference in small sample meta-analysis for continuous outcomes. Scand J Stat (to appear). doi:10.1111/sjos.12172

  • Bellio R, Sartori N (2006) Practical use of modified maximum likelihoods for stratified data. Biom J 48:876–886

    Article  MathSciNet  Google Scholar 

  • Carlin PB, Hodges JS (1999) Hierarchical proportional hazards regression models for highly stratified data. Biometrics 55:1162–1170

    Article  MATH  Google Scholar 

  • Cohn DL, Fisher EJ, Peng JS, Hodges JS, Chesnut J, Child CC, Franchino B, Gibert CL, El-Sadr W, Hafner R, Korvick J, Ropka M, Heifets L, Clotfelter J, Munroe D, Horsburgh CR, and Terry Beirn Community Programs for Clinical Research on AIDS (1999) A prospective randomized trial of four three-drug regimens in the treatment of disseminated mycobacterium avium complex disease in AIDS patients: excess mortality associated with high-dose clarithromycin. Clin Infect Dis 29:125–133

  • Cox DR, Reid N (1987) Parameter orthogonality and approximate conditional inference (with discussion). J R Stat Soc Ser B Met 49:1–39

    MathSciNet  MATH  Google Scholar 

  • De Bin R (2012) Integrated likelihood for the treatment of nuisance parameters. Ph.D. Thesis, Department of Statistical Sciences, University of Padua. http://paduaresearch.cab.unipd.it/4872

  • De Bin R, Sartori N, Severini TA (2015) Integrated likelihoods in models with stratum nuisance parameters. Electron J Stat 9:1474–1491

    Article  MathSciNet  MATH  Google Scholar 

  • Kalbfleish JD, Prentice RL (2002) The statistical analysis of failure times data. Wiley, Hoboken

    Book  Google Scholar 

  • McCullagh P, Tibshirani R (1990) A simple method for the adjustment of profile likelihoods. J R Stat Soc Ser B Met 52:325–344

    MathSciNet  MATH  Google Scholar 

  • Munda M, Rotolo F, Legrand C (2012) Parfm: parametric frailty models in R. J Stat Softw 51(11)

  • Pierce DA, Bellio R (2006) Effects of the reference set on frequentist inferences. Biometrika 93:425–438

    Article  MathSciNet  MATH  Google Scholar 

  • R Core Team (2013) R: A language and environment for statistical computing. R Foundation for Statistical Computing , Vienna. URL http://www.R-project.org/

  • Sartori N (2003) Modified profile likelihoods in models with stratum nuisance parameters. Biometrika 90:533–549

    Article  MathSciNet  Google Scholar 

  • Severini TA (2000) Likelihood methods in statistics. Oxford University Press, Oxford

    MATH  Google Scholar 

  • Severini TA (2007) Integrated likelihood functions for non-Bayesian inference. Biometrika 94:529–542

    Article  MathSciNet  MATH  Google Scholar 

  • Severini TA (2010) Likelihood ratio statistics based on an integrated likelihood. Biometrika 97:481–496

    Article  MathSciNet  MATH  Google Scholar 

  • Severini TA (2011) Frequency properties on inferences based on an integrated likelihood function. Stat Sin 21:433–447

    MathSciNet  MATH  Google Scholar 

Download references

Acknowledgments

We thank the reviewers for their detailed and constructive comments which led to a substantial improvement of the paper. The research was supported by ‘Progetto Ateneo 2013’ (CPDA 131553), University of Padua. The second author acknowledges the financial support of the CARIPARO Foundation Excellence - grant 2011/2012.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giuliana Cortese.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 157 KB)

Supplementary material 2 (zip 106 KB)

Appendix

Appendix

1.1 Expected score for the ith cluster

In the following, we prove that the expected score for the ith cluster has the form given in (8).

Let us consider, for \(i=1,\ldots ,n\), the score functions for the nuisance parameters reported in Eq. (7). We need now to compute the expected score of \(\ell ^i_{\lambda _i} (\psi , \lambda _i) \) with respect to the processes in the true parameter \((\psi _0, \lambda _{i0})\). For this purpose, we proceed as follows.

For the leading integral on the right-hand side of (7), we use the Doob-Meyer decomposition \( dN_{ij}(t) = h_{ij}(t; \psi _0, \lambda _{i0}) \, Y_{ij}(t) \,dt + dM_{ij}(t)\), where \(h_{ij}(t; \psi _0, \lambda _{i0}) \, Y_{ij}(t) dt \) is a unique predictable process and \(dM_{ij}(t)\) is a martingale increment. Therefore, we obtain

$$\begin{aligned}&E \left\{ \ell ^i_{\lambda _i} (\psi , \lambda _i); \psi _0, \lambda _{i0} \right\} \\&= \sum _{j=1}^k E \left\{ \left( \int _0^{\infty } \frac{\partial }{\partial \lambda _i} \log h_{ij} (t; \psi , \lambda _i) \, h_{ij}(t; \psi _0, \lambda _{i0}) \, Y_{ij}(t) \, dt \right. \right. \\&\qquad \qquad \qquad + \int _0^{\infty } \frac{\partial }{\partial \lambda _i} \log h_{ij} (t ; \psi , \lambda _i) \,dM_{ij}(t) \\&\qquad \qquad \qquad \left. \left. - \int _0^{\infty } \frac{\partial }{\partial \lambda _i} h_{ij} (t ; \psi , \lambda _i)~ Y_{ij}(t) \,dt \right) ; \psi _0, \lambda _{i0} \right\} . \end{aligned}$$

The second integral on the right-hand side is a stochastic integral with respect to a martingale, and is itself also a martingale, since its integrand is a bounded predictable process. Therefore, the expected value of this integral is equal to zero. Consider now the expected values of the remaining two integrals with respect to the process \(h_{ij}(t; \psi _0, \lambda _{i0}) \, Y_{ij}(t)\); interchanges of integrals follow from Fubini’s theorem and we obtain the following general expression for the expected score of the ith cluster:

$$\begin{aligned}&E \left\{ \ell _{\lambda _i}^i (\psi , \lambda _i); \psi _0, \lambda _{i0} \right\} \nonumber \\&\quad = \sum _{j=1}^k \left\{ \int _0^{\infty } \frac{\partial }{\partial \lambda _i} \log h_{ij} (t ; \psi , \lambda _i) \, h_{ij} (t ; \psi _0, \lambda _{i0}) \, y_{ij}(t; \psi _0, \lambda _{i0})\, dt \right. \nonumber \\&\qquad \qquad \quad \left. - \int _0^{\infty } \frac{\partial }{\partial \lambda _i} h_{ij} (t ; \psi , \lambda _i) \, y_{ij} (t ; \psi _0, \lambda _{i0}) \, dt \right\} , \end{aligned}$$
(21)

where \( y_{ij}(t; \psi _0, \lambda _{i0}) = E\left\{ Y_{ij}(t); \psi _0, \lambda _{i0} \right\} \). Rewriting this expression with \( \frac{\partial }{\partial \lambda _i} \log h_{ij} (t ; \psi , \lambda _i) = \frac{\partial }{\partial \lambda _i} h_{ij} (t ; \psi , \lambda _i) / h_{ij} (t ; \psi , \lambda _i) \) gives the final equation in (8).

1.2 Expected score for the ith cluster under type I censoring

We show here the derivations of the expected score equation for the ith cluster given in (10).

Under type I censoring, let us assume that the censoring times for each cluster are constant, i.e., \(c_{i1}=\ldots = c_{ik}=c_i\) for \(i=1,\ldots ,n\). Then, we have \( y_{ij}(t; \psi _0, \lambda _{i0}) = S_{ij} (t; \psi _0, \lambda _{i0})\) for \( t \le c_i\), and \( y_{ij}(t; \psi _0, \lambda _{i0}) = 0\) for \( t > c_i\). By inserting these expression in the expected score Eq. (21), together with \( h_{ij}(t)= p_{ij}(t) / S_{ij}(t)\), it follows that

$$\begin{aligned} E \left\{ \ell _{\lambda _i}^i (\psi , \lambda _i); \psi _0, \lambda _{i0} \right\}= & {} \sum _{j=1}^k \left\{ \int _0^{c_i} \frac{\partial }{\partial \lambda _i} \log p_{ij} (t ; \psi , \lambda _i) ~ p_{ij} (t ; \psi _0, \lambda _{i0}) \, dt \right. \\&\qquad \quad - \int _0^{c_i} \frac{\partial }{\partial \lambda _i} \log S_{ij} (t ; \psi , \lambda _i) ~ p_{ij} (t ; \psi _0, \lambda _{i0}) \, dt \\&\qquad \quad \left. - \int _0^{c_i} \frac{\partial }{\partial \lambda _i} h_{ij} (t ; \psi , \lambda _i) ~ S_{ij} (t ; \psi _0, \lambda _{i0}) \, dt \right\} . \end{aligned}$$

Observe that if \( g'(t) = \frac{\partial }{\partial \lambda _i} h_{ij} (t ; \psi , \lambda _i) = \frac{\partial }{\partial \lambda _i} \left( - \frac{d}{dt} \log S_{ij}(t; \psi , \lambda _i) \right) = - \frac{d}{dt} \left( \frac{\partial }{\partial \lambda _i} \log S_{ij}(t; \psi , \lambda _i) \right) \), then \(g(t)= - \frac{\partial }{\partial \lambda _i} \log S_{ij} (t; \psi , \lambda _i)\). Therefore, by using this fact and integration by parts for the third integral on the right-hand side, we have

$$\begin{aligned} \begin{aligned} \int _0^{c_i} \frac{\partial }{\partial \lambda _i} h_{ij} (t ; \psi , \lambda _i) ~ S_{ij} (t ; \psi _0, \lambda _{i0}) \, dt =&- \frac{\partial }{\partial \lambda _i} \log S_{ij} (c_i ; \psi , \lambda _i) ~ S_{ij} (c_i ; \psi _0, \lambda _{i0}) \\&- \int _0^{c_i} \frac{\partial }{\partial \lambda _i} S_{ij} (t ; \psi , \lambda _i) ~ p_{ij} (t ; \psi _0, \lambda _{i0}) \, dt. \\ \end{aligned} \end{aligned}$$

Using this result for the third integral on the right-hand side of the expected score, we obtain

$$\begin{aligned} \begin{aligned} E \left\{ \ell _{\lambda _i}^i (\psi , \lambda _i); \psi _0, \lambda _{i0} \right\} = \sum _{j=1}^k&\left\{ \int _0^{c_i} \frac{\partial }{\partial \lambda _i} \log p_{ij} (t ; \psi , \lambda _i) ~ p_{ij} (t ; \psi _0, \lambda _{i0}) \, dt \right. \\&\quad \left. +\, \frac{\partial }{\partial \lambda _i} \log S_{ij} (c_i ; \psi , \lambda _i) ~ S_{ij} (c_i ; \psi _0, \lambda _{i0}) \right\} . \end{aligned} \end{aligned}$$

Finally, Eq. (10) follows by changing the notation to \( \eta _{ij} (t; \psi , \lambda _i) = \frac{\partial }{\partial \lambda _i} \log p_{ij}(t;\psi , \lambda _i), \) and \( H_{ij} (t; \psi , \lambda _i) = \frac{\partial }{\partial \lambda _i} log S_{ij}(t;\psi , \lambda _i) \), and setting the expected score equal to zero.

1.3 Integrated likelihood for the Weibull regression model

For the regression model (19), the expected score for the nuisance \(\alpha _i\) is given by

$$\begin{aligned} E\{ \ell ^i_{\alpha _i} (\psi _1,\psi _2,\alpha _i); \hat{\psi }_1, \hat{\psi }_2, \omega _i \} = \sum _j \left( \frac{\partial \eta _{ij}}{\partial \alpha _i} \right) E \left\{ \ell _{\eta _{ij}}^i (\psi _1, \eta _{ij}); \hat{\psi }_1, \phi _{ij} \right\} . \end{aligned}$$
(22)

An explicit expression can be obtained by applying the theory in Sect. 4 to a Weibull\((\eta _{ij}, \psi _1)\). Thus, \(\eta _{ij}, \psi _1\) and \(\phi _{ij}\) play here the same role as, respectively, \(\lambda _i, \psi \) and \(\phi _i\) in Sect. 4, with the exception that they depend on the index j because of the presence of covariates. Therefore, we can get \( E \{ \ell _{\eta _{ij}}^i (\psi _1, \eta _{ij}); \hat{\psi }_1, \phi _{ij} \} \) by replacing \(\lambda _i, \psi \) and \(\phi _i\) with \(\eta _{ij}, \psi _1\) and \(\phi _{ij}\) in expression (15). Then, given that \(\partial \eta _{ij} / \partial \alpha _i=-\eta _{ij}\) and using the parameterizations in (20), we can find a final expression for (22).

Setting (22) equal to zero leads to the relation between \(\alpha _i\) and the ZSE parameter \(\omega _i\), i.e.,

$$\begin{aligned} \alpha _i = -\frac{1}{\psi _1} \log \left[ \frac{\hat{\psi }_1 \sum _j e^{-\hat{\psi }_1 (\omega _i + \hat{\psi }_2 x_{ij})} \int _0^{\infty } t^{\hat{\psi }_1 -1} y_{ij}(t; \hat{\psi }_1,\hat{\psi }_2,\omega _i) dt }{\psi _1 \sum _j e^{-\psi _1 (\alpha _i + \psi _2 x_{ij})} \int _0^{\infty } t^{\psi _1 -1} y_{ij}(t; \hat{\psi }_1,\hat{\psi }_2,\omega _i) dt } \right] . \end{aligned}$$

This equation can also be obtained directly by replacing \(\lambda _i, \psi \) and \(\phi _i\) with \(\eta _{ij}, \psi _1\) and \(\phi _{ij}\) in (16).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cortese, G., Sartori, N. Integrated likelihoods in parametric survival models for highly clustered censored data. Lifetime Data Anal 22, 382–404 (2016). https://doi.org/10.1007/s10985-015-9337-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10985-015-9337-9

Keywords

Navigation