Название | The Statistical Analysis of Doubly Truncated Data |
---|---|
Автор произведения | Prof Carla Moreira |
Жанр | Медицина |
Серия | |
Издательство | Медицина |
Год выпуска | 0 |
isbn | 9781119500476 |
An important difference of double truncation when compared to one‐sided truncation is that, with doubly truncated data, the NPMLE of the probability distribution has no explicit form. In fact, the NPMLE may be non‐unique and even non‐existing (Xiao and Hudgens, 2019); see Chapter 2. Several iterative algorithms that have been proposed to compute the NPMLE in practice (Efron and Petrosian, 1999; Shen, 2010) will be reviewed in this book, and simulated and real data examples will be analysed with existing libraries of the software R
. Semiparametric and parametric alternatives to the NPMLE will be introduced too; these approaches avoid some of the aforementioned potential issues of non‐uniqueness or non‐existence of the NPMLE, also reducing the variance at the price of introducing some bias in estimation. Also, resampling procedures, testing problems, smoothing methods, regression models and multi‐state data analysis under double truncation will be presented.
1.4 Real Data Examples
In this section we introduce the datasets that will be used throughout the book for illustration purposes. All of them suffer from double truncation. These examples are available within the last update of the DTDA
package (Moreira et al., 2021a).
1.4.1 Childhood Cancer Data
The Childhood Cancer Data were gathered from the IPO (Instituto Português de Oncologia) of Porto, Portugal, by the RORENO (Registro Oncológico do Norte) service. The information corresponds to all children diagnosed from cancer between 1 January 1999 (
Because of the interval sampling, the age at diagnosis
Interestingly, the observed values for
Table 1.1 Descriptive statistics for Childhood Cancer Data: sample size
Group |
|
Mean (SD) | |
---|---|---|---|
All | 406 | 6.47 (4.50) | |
By gender | Female | 178 | 6.43 (4.51) |
Male | 228 | 6.51 (4.51) | |
|