Home | About | Journals | Submit | Contact Us | Français |

**|**Cancer Inform**|**v.7; 2009**|**PMC2883306

Formats

Article sections

Authors

Related links

Cancer Inform. 2009; 7: 271–280.

Published online 2009 December 14.

PMCID: PMC2883306

Eppley Cancer Institute, University of Nebraska Medical Center, 986805 Nebraska Medical Center, Omaha, NE. Email: ude.cmnu@mrehss

Copyright © 2009 the author(s), publisher and licensee Libertas Academica Ltd.

This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited.

This article has been cited by other articles in PMC.

A simple, computationally efficient procedure for analyses of the time period and birth cohort effects on the distribution of the age-specific incidence rates of cancers is proposed. Assuming that cohort effects for neighboring cohorts are almost equal and using the Log-Linear Age-Period-Cohort Model, this procedure allows one to evaluate temporal trends and birth cohort variations of any type of cancer without prior knowledge of the hazard function. This procedure was used to estimate the influence of time period and birth cohort effects on the distribution of the age-specific incidence rates of first primary, microscopically confirmed lung cancer (LC) cases from the SEER9 database. It was shown that since 1975, the time period effect coefficients for men increase up to 1980 and then decrease until 2004. For women, these coefficients increase from 1975 up to 1990 and then remain nearly constant. The LC birth cohort effect coefficients for men and women increase from the cohort of 1890–94 until the cohort of 1925–29, then decrease until the cohort of 1950–54 and then remain almost unchanged. Overall, LC incidence rates, adjusted by period and cohort effects, increase up to the age of about 72–75, turn over, and then fall after the age of 75–78. The peak of the adjusted rates in men is around the age of 77–78, while in women, it is around the age of 72–73. Therefore, these results suggest that the age distribution of the incidence rates in men and women fall at old ages.

It is well recognized that aging plays a fundamental role in the development of cancer in the human adult population. To describe the relationship between cancer incidence rates and the age of cancer presentation several mathematical models have been proposed (see, for example,^{1}^{–}^{5} and references therein). In these models, the distribution of incidence rates are presented as a set of numbers, *I _{i}*

Often, five-year-long age intervals are considered. For instance, 100 years of human life span can be divided on 20 five-year intervals: 0–4, 5–9, 10–14, …, 95–99. The center of these intervals can represent the corresponding age. The values *I _{i}*

The cross-sectional studies of the cancer incidence rate distributions in aging are different from the *longitudinal* studies of the analogous distributions. In the cross-sectional studies, the number of new cancer diagnoses can be counted simultaneously for different cohorts of people at a given time period, while in the longitudinal studies this data must be obtained for the same cohort of people but in different time periods. Each of these types of studies, cross-sectional vs. longitudinal, has their own advantages and disadvantages. For instance, it is clear that data for cross-sectional studies can be obtained much faster than for the longitudinal studies. In fact, to perform the aforementioned cross-sectional study, one has to collect data over a time period of five years (2000–2004), while for the analogous longitudinal studies, using a cohort of people born, say, in 1905–1909, to get data for all of the considered age intervals one must collect the corresponding incidence rates over 100 years. In addition, studies of cross-sectional data can provide clues to possible time period effects during which data was collected. For instance, implementation of new diagnostic techniques in a particular time period could influence the detection of a given cancer type at earlier ages (*age-period* effect). On the other hand, longitudinal studies (in contrast to cross-sectional data) can determine the influence of cohort effects on the age distribution of cancer rates (*age-cohort* effect). For example, dietary and life-style habits characteristic for a given generation of people can affect the cancer incidence rates.

In this connection, it must be emphasized that cross-sectional and longitudinal studies of cancer incidence rates performed independently can provide inconsistent or even confusing results. For instance, recently, using the SEER (Surveillance Epidemiology and End Results) database,^{6} Harding and colleagues^{4} analyzed the distribution of age-specific cancer incidence rates. For the vast majority of the examined cancers they found that the rates collected during three time periods, 1979–83, 1989–93 and 1999–2003 (cross-sectional data) increase up to the age of about 80 years and then fall at the oldest ages. However, the longitudinal data for lung cancer (LC) presented by Holford^{7} showed that the LC cohort risk increases with age, while the LC time period risk falls at old age. Moreover, it was suggested that there was no turnover, if both time period and cohort effects on cancer rates were considered.^{8}

Accounting for time period and cohort effects (*age-period-cohort* model) represents a main challenge for mathematical modeling of relationship between cancer incidence rates and age of cancer presentation. This is because mathematically this problem falls into a category of so called *identifiability* problems with multiple estimators.^{9} In general, parameters to be determined (i.e. estimates for time period and cohort effects) as a solution of the considered problem, cannot be unambiguously identified. In other words, multiple estimators can provide equally good solutions for the problem and “true” age-period-cohort effects are difficult (if not impossible) to estimate simultaneously.^{9}^{–}^{15} The only hope for solving this problem (obtain consistent estimation for period and cohort effects) is to utilize an additional assumption on the data that is used.

Until recently, mathematical modeling of the relationship between cancer incidence rates and the age of cancer presentation has been performed exclusively using cross-sectional data.^{1}^{–}^{4} To address this short-coming, we are proposing a simple, computationally efficient procedure for analyses of time period and birth cohort effects on the distribution of the age-specific incidence rates of cancers. Assuming that cohort effects for neighboring cohorts are almost equal and using the Log-Linear Age-Period-Cohort (LLAPC) Model, this procedure allows one to evaluate temporal trends and birth cohort variations of any type of cancer without prior knowledge of the hazard function. The proposed approach was used to analyze the influence of the time period and birth cohort effects on the LC incidence rate distributions. Only first primary, microscopically confirmed cases from the SEER9 database^{6} over the period of 1975–2004 were considered. Using a novel approach, which is valid for any hazard function, we demonstrated that the time period trends in men and women are different in LC, while the cohort trends are similar. We also demonstrated that the distribution of these incidence rates falls at old ages, even after accounting for time period and birth cohort effects.

We describe a novel, computationally efficient procedure for the analysis of the time period and birth cohort effects in the frame of the LLAPC model. This procedure is tested on the example of LC.

In our study, we used data from only the SEER registries^{6} that correspond to the following nine (SEER9) areas: Atlanta, Connecticut, Detroit, Hawaii, Iowa, New Mexico, San Francisco-Oakland, Seattle-Puget Sound, and Utah. We used these nine registries, rather than the current set of seventeen, because the longitudinal nature of our study requires us to use data dating back two decades when there were only nine registries. First primary, microscopically confirmed LC cases from the SEER9 database for patients with known gender and race were considered to be “filtered” data, whereas the cases where such filtering was not performed were considered to be “raw” data. We used only filtered data that are more reliable and homogeneous than raw data.^{5}^{,}^{16} The incidence rates, *I*(*t*), expressed per 100,000 persons and age-adjusted by the direct method to the 2000 United States standard population,^{17} and their standard errors, *SE*, were utilized. The data were combined in six five-year cross-sectional time periods: 1975–79, …, 2000–2004. The gender-specific incidence rates were grouped into 18 five-year age groups: 17 groups, ranging from 0 to 84 years, and the 18th group that included all cases for ages 85+.

Table 1 presents approximations of the observed incidence rates, *I _{i}*

$${I}_{i,j}({t}_{i})={v}_{j}{u}_{l}h({t}_{i})\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}i=1,\hspace{0.17em}\dots ,n;\hspace{0.17em}\hspace{0.17em}j=1,\hspace{0.17em}\dots ,\hspace{0.17em}m;\hspace{0.17em}l=1,\hspace{0.17em}\dots ,\hspace{0.17em}k;$$

(1)

where *i*, *j*, and *l* denote the given age interval, time period, and cohort, correspondingly; *n*, *m*, and *k* are numbers of the age intervals, time periods, and cohorts, correspondingly.

Presentation of the observed incidence rates by hazard function, *h*(*t*), and the time period (*v*) and birth cohort (*u*) coefficients.

In this table, the approximations of the cross-sectional data for six time periods 1975–79, …, 2000–2004 (index *j* = 1, …, 6) are shown in columns, while the approximations of the incidence rates for the same cohort groups (longitudinal data) are located along diagonals. We used only the data for the groups over age 30 (*i* = 7, …, 18), because the incidence rates for these groups were significant (according to SEER practice, the number of cases should exceed 15 to be statistically significant). We consider 17 birth cohorts (*l* = 1, …, 17), corresponding to birth year ranges of 1890–94, …, 1970–74. From this table one can see that *l* can be presented as *l* = *j* − *i* + 18.

Assuming that the numbers of cases have a Poisson distribution and the mathematical form of the hazard function is known *a priori* and the LLAPC model is used, one can make adjustments by using the maximum likelihood method for assessing the birth cohort and time period effect coefficients as well as parameters of the hazard function. These coefficients can be estimated by anchoring one time period coefficient (*v* = 1) and one birth cohort effect coefficient (*u* = 1).^{8}^{,}^{18}^{,}^{19} *Note*: the results of this procedure depend on the hazard function, and also the time period and cohort, to which the coefficients are anchored.

Below we describe a procedure that provides results independent of the hazard function. The hazard function values, presented in Table 1, can be canceled out by dividing the corresponding elements of the neighboring columns with indices *j* and *j* + 1 or *j* + 1 and *j*. Then from (1), one can obtain a pair of systems:

$$\begin{array}{c}\frac{{I}_{i,j}({t}_{i})}{{I}_{i,j+1}({t}_{i})}=\frac{{v}_{j}}{{v}_{j+1}}\frac{{u}_{l}}{{u}_{l+1}}\\ i=7,\hspace{0.17em}\dots ,18;\hspace{0.17em}j=1,\hspace{0.17em}\dots ,5;\hspace{0.17em}l=j-i+18\end{array}$$

(2)

$$\begin{array}{c}\frac{{I}_{i,j+1}({t}_{i})}{{I}_{i,j}({t}_{i})}=\frac{{v}_{j+1}}{{v}_{j}}\frac{{u}_{l+1}}{{u}_{l}}\\ i=7,\hspace{0.17em}\dots ,18;\hspace{0.17em}j=1,\hspace{0.17em}\dots ,5;\hspace{0.17em}l=j-i+18\end{array}$$

(3)

In (2) and (3), *I _{i}*

$$\begin{array}{c}C{V}_{i,j}=\frac{S{E}_{i,j}}{{I}_{i,j}({t}_{i})},\\ i=7,\hspace{0.17em}\dots ,18;\hspace{0.17em}j=1,\hspace{0.17em}\dots ,\hspace{0.17em}6;\hspace{0.17em}l=j-i+18\end{array}$$

(4)

*Note*: (2) provides 12 × 5 conditional equations for assessing five ratios of the time period coefficients (*v _{j}*/

In order to solve this identifiability problem, additional assumptions are required.^{9}^{–}^{15} Assuming that any pair of the neighboring cohorts has the cohort effect coefficient ratio close to 1, these ratios can be set equal to 1 in (2) and (3). The rationale behind this assumption is that the adjacent cohorts usually overlap in time intervals and thus values of their cohort effect coefficient should be close. Now for estimating five ratios, *v _{j}*/

$$\frac{{I}_{i,j}({t}_{i})}{{I}_{i,j+1}({t}_{i})}=\frac{{v}_{j}}{{v}_{j+1}}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}i=7,\hspace{0.17em}\dots ,\hspace{0.17em}18;\hspace{0.17em}\hspace{0.17em}j=1,\hspace{0.17em}\dots ,\hspace{0.17em}5.$$

(5)

$$\frac{{I}_{i,j+1}({t}_{i})}{{I}_{i,j}({t}_{i})}=\frac{{v}_{j+1}}{{v}_{j}}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}i=7,\hspace{0.17em}\dots ,18;\hspace{0.17em}\hspace{0.17em}j=1,\hspace{0.17em}\dots ,5.$$

(6)

When coefficients of variation (4) are small, the standard errors of the ratios, *I _{i,j}*(

$$\begin{array}{l}S{E}^{2}\left(\frac{{I}_{i,j}({t}_{i})}{{I}_{i,j+1}({t}_{i})}\right)\\ \hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}={\left(\frac{{I}_{i,j}({t}_{i})}{{I}_{i,j+1}({t}_{i})}\right)}^{2}\left(\frac{S{E}_{i,j}^{2}({t}_{i})}{{I}_{i,j}^{2}({t}_{i})}+\frac{S{E}_{i,j+1}^{2}({t}_{i})}{{I}_{i,j+1}^{2}({t}_{i})}\right)\end{array}$$

(7)

$$\begin{array}{l}S{E}^{2}\left(\frac{{I}_{i,j+1}({t}_{i})}{{I}_{i,j}({t}_{i})}\right)\\ \hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}={\left(\frac{{I}_{i,j+1}({t}_{i})}{{I}_{i,j}({t}_{i})}\right)}^{2}\left(\frac{S{E}_{i,j+1}^{2}({t}_{i})}{{I}_{i,j+1}^{2}({t}_{i})}+\frac{S{E}_{i,j}^{2}({t}_{i})}{{I}_{i,j}^{2}({t}_{i})}\right)\end{array}$$

(8)

It can be shown that when the numerators and denominators in (5) and (6) are normally distributed and their coefficients of variation are small, then the ratios presented in (5) and (6) will be also normally distributed. In fact, let us assume that we have random variables *A*_{1} = *a*_{1} + _{1} and *A*_{2} = *a*_{2} + _{2}, where *a*_{1} ≠ 0 and *a*_{2} ≠ 0 are constants and _{1} and _{2} are normally distributed random variables with zero means and standard deviations σ_{1} and σ_{2}, correspondingly. When coefficients of variation are small (i.e. σ_{1}/*a*_{1} 1 and σ_{2}/*a*_{2} 1), then one can express the *A*_{1}/*A*_{2} and *A*_{2}/*A*_{1} ratios in the bivariate Taylor series around *a*_{1}/*a*_{2} and *a*_{1}/*a*_{2}, and consider their linear approximations:

$$\begin{array}{l}\frac{{A}_{1}}{{A}_{2}}=\frac{{a}_{1}+{\varepsilon}_{1}}{{a}_{2}+{\varepsilon}_{2}}=\frac{{a}_{1}}{{a}_{2}}+\frac{1}{{a}_{2}}{\varepsilon}_{1}-\frac{{a}_{1}}{{a}_{2}^{2}}{\varepsilon}_{2}\\ \frac{{A}_{2}}{{A}_{1}}=\frac{{a}_{2}+{\varepsilon}_{2}}{{a}_{1}+{\varepsilon}_{1}}=\frac{{a}_{2}}{{a}_{1}}+\frac{1}{{a}_{1}}{\varepsilon}_{2}-\frac{{a}_{2}}{{a}_{1}^{2}}{\varepsilon}_{1}\end{array}$$

Because _{1} and _{2} are normally distributed variables, these linear combinations will be also normally distributed.

In the considered incidence rate data, coefficients of variation are less than 0.1, therefore the errors of the observed incidence rate ratios of the systems (5) and (6) can be considered as normally distributed. For estimation of *v _{j}*/

After anchoring any time period coefficient (for example, assuming that *v*_{6} = 1 and *SE*(*v*_{6}) = 0), one can obtain step by step the following estimates of *v _{j}*

$$\begin{array}{c}{v}_{5}^{*}={\left(\frac{{v}_{5}}{{v}_{6}}\right)}^{*};\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}{v}_{4}^{*}={\left(\frac{{v}_{4}}{{v}_{5}}\right)}^{*}\hspace{0.17em}{v}_{5}^{*};\hspace{0.17em}{v}_{3}^{*}={\left(\frac{{v}_{3}}{{v}_{4}}\right)}^{*}\hspace{0.17em}{v}_{4}^{*};\\ {v}_{2}^{*}={\left(\frac{{v}_{2}}{{v}_{3}}\right)}^{*}\hspace{0.17em}{v}_{3}^{*};\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}{v}_{1}^{*}={\left(\frac{{v}_{1}}{{v}_{2}}\right)}^{*}{v}_{2}^{*}\end{array}$$

(9)

For the other anchored time period coefficients, the estimates of *v _{j}*

After estimating the time period coefficients, the incidence rates can be corrected for the time effects and the following system can be obtained from (2) and (3):

$$\begin{array}{c}\frac{{I}_{i,j}^{T}({t}_{i})}{{I}_{i,j+1}^{T}({t}_{i})}=\frac{{u}_{l}}{{u}_{l+1}}\\ l=1,\hspace{0.17em}\dots ,17;\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}j=1,\hspace{0.17em}\dots ,\hspace{0.17em}5;\hspace{0.17em}\hspace{0.17em}i=j-l+18\end{array}$$

(10)

$$\begin{array}{c}\frac{{I}_{i,j+1}^{T}}{{I}_{i,j}^{T}}=\frac{{u}_{l+1}}{{u}_{l}}\\ l=1,\hspace{0.17em}\dots ,\hspace{0.17em}17;\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}j=1,\hspace{0.17em}\dots ,\hspace{0.17em}5;\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}i=j-l+18\end{array}$$

(11)

Here
${I}_{i,j}^{T}({t}_{i})$ denotes incidence rates corrected for the time effects. By the standard rules of error propagation, one can calculate the *SE* of the incidence rates ratios presented on the left side of (10) and (11). Now, there are 12 × 5 conditional equations for assessing 16 ratios of the cohort effect coefficients *u _{l}*/

$$\begin{array}{c}{u}_{10}^{*}={\left(\frac{{u}_{10}}{{u}_{9}}\right)}^{*},\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}{u}_{11}^{*}={\left(\frac{{u}_{11}}{{u}_{10}}\right)}^{*}\hspace{0.17em}{u}_{10}^{*},\hspace{0.17em}\dots ,\\ {u}_{17}^{*}={\left(\frac{{u}_{17}}{{u}_{16}}\right)}^{*}{u}_{16}^{*}\end{array}$$

(12)

and

$$\begin{array}{c}{u}_{8}^{*}={\left(\frac{{u}_{8}}{{u}_{9}}\right)}^{*},\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}{u}_{7}^{*}={\left(\frac{{u}_{7}}{{u}_{8}}\right)}^{*}{u}_{8}^{*},\hspace{0.17em}\dots ,\\ {u}_{1}^{*}={\left(\frac{{u}_{1}}{{u}_{2}}\right)}^{*}{u}_{2}^{*}\end{array}$$

(13)

After evaluating time period and cohort effect coefficients, one can divide the initial incidence rates, *I _{i}*

$$\begin{array}{c}{I}_{i,j}^{*}({t}_{i})=\frac{{I}_{i,j}({t}_{i})}{{v}_{j}^{*}{u}_{l}^{*}},\\ i=7,\hspace{0.17em}\dots ,\hspace{0.17em}18;\hspace{0.17em}\hspace{0.17em}j=1,\hspace{0.17em}\dots ,\hspace{0.17em}6;\hspace{0.17em}\hspace{0.17em}\hspace{0.17em}l=j-i+18\end{array}$$

(14)

The aforementioned approach looks similar to one used in.^{18} However, the approach used in^{18} is based on the assumption that the birth cohort effects are absent. This allows the authors of^{18} to evaluate coefficients *v _{j}*. Then, using the obtained time period effects, they correct the observed incidence rates and after that, estimate the

Table 2 shows the time period distributions (presented in columns) of the first primary, microscopically confirmed incidence LC rates for women. The observed patterns of the cross-sectional data are shown along columns and longitudinal data along diagonals. The cross-sectional and longitudinal data for the three consecutive cohorts that contain observations for the elderly exhibit turnovers at old age. Analogous observations for men also have turnovers (data not shown).

Filtered LC incidence rates for women collected in the SEER9 database during 1975–2004. Arrows show cohorts that turn over.

The longitudinal patterns shown in Table 2 are different from those presented in.^{7} In,^{7} using the raw LC incidence rates for women collected in Connecticut during the years of 1940–1984, it was shown that the longitudinal risks always increased with age (see Table 1 in^{7}). This discrepancy can be explained by the fact that in contrast to,^{7} where raw data was used, we analyzed filtered SEER9 data collected during 1975–2004.

Figures 1A and and2A2A show the distributions of the filtered incidence rates for men and women, correspondingly.

After estimating the time period and birth cohort effects, the incidence rates were adjusted to the 2000–2004 time period and to the 1945–1949 birth cohort. The adjusted rates are shown in Figures 1B and and2B.2B. As can be seen, the adjusted LC incidence rate distributions for both men and women have turnovers at old ages.

The adjusted rates for men and women increase (starting from the age of 30), reach the maximum and then fall at old ages. The differences are only in the age at which the distributions reach the maximum, and in the maximum values of the corresponding incidence rates. For men, this maximum is near the age of 77–78, while for women, it is near the age of 72–73. These patterns are different from the linear patterns (up to the age of 85) obtained in^{8} by accounting for time period and cohort effects on cancer. Again, this discrepancy can be explained by the use of raw data and an *a priori* assumed form of the hazard function for the time period and birth cohort adjustments used in,^{8} whereas we utilized only filtered data and our approach is independent of the hazard function.

Panels C and D of Figures 1 and and22 show the changes of the time period and cohort effect coefficients for men and women, correspondingly. The time period effect coefficients for men increase from the year 1975 to 1980 and then decrease until 2004. For women, these effects increase from 1975 to 1990 and then remain nearly constant. The birth cohort effect coefficients for men and women are similar; they increase from the cohort of 1890–94 until the cohort of 1925–29, then decrease until the cohort of 1950–54 and after that remain almost unchanged. It is possible that the observed temporal differences of the LC rates in men and women can be explained by the gender-specific smoking habits as it was suggested in^{21} (see also references in that paper).

For analyses of the time period and birth cohort effects on the distribution of the age-specific incidence rates of cancers, a simple, computationally efficient procedure, which does not require any prior knowledge of the hazard function, was proposed. Our approach uses the LLAPC model and assumes that cohort effects for neighboring cohorts are almost equal. The proposed procedure was used for analyzing the influence of the time period and birth cohort effects on the LC incidence rate distributions. However, this procedure can be applied for different types of cancers as well as for epidemiological studies of chronic diseases.

We found that the incidence rates of first primary, microscopically confirmed LC cases from the SEER9 database, adjusted by period and cohort effects, increase for both women and men, then turn over (at ages of about 72–73 and 77–78 for women and men, correspondingly) and fall at older ages. Thus, by utilizing the longitudinal and cross-sectional data and by accounting for time period and cohort effects, we have demonstrated that the LC incidence rates have a turnover at old ages, and the age at which this turnover takes place, is gender-specific. The explanation of this phenomenon should be a subject for future studies.

**Disclosures**

The authors report no conflicts of interest.

1. Armitage P, Doll R. The age distribution of cancer and a multistage theory of carcinogenesis. Br J Cancer. 1954;8:1–12. [PMC free article] [PubMed]

2. Cook PJ, Doll R, Fellingham SA. A mathematical model for the age distribution of cancer in man. Int J Cancer. 1969;4:93–112. [PubMed]

3. Pompei F, Wilson R. Age distribution of cancer: the incidence turnover at old age. Hum Ecol Risk Assess. 2001;7:1619–50. [PubMed]

4. Harding C, Pompei F, Lee E, Wilson R. Cancer Suppression at Old Age. Cancer Res. 2008;68:4465–78. [PubMed]

5. Mdzinarishvili T, Gleason MX, Kinarsky L, Sherman S. A generalized beta model for age distribution of cancers: application to pancreatic and kidney cancer. Cancer Inform. 2009;7:183–97. [PMC free article] [PubMed]

6. Surveillance, Epidemiology, and End Results (SEER) Program (www.seer.cancer.gov) Limited-Use Data (1973–2004), National Cancer Institute, DCCPS, Surveillance Research Program, Cancer Statistics Branch, released April 2007, based on the November 2006 submission.

7. Holford TR. Understanding the effects of age, period, and cohort on incidence and mortality rates. Annu Rev Public Health. 1991;12:425–57. [PubMed]

8. Meza R, Jeon J, Moolgavkar SH, Luebeck EG. Age-specific incidence of cancer: phases, transitions, and biological implications. Proc Natl Acad Sci U S A. 2008;105:16284–9. [PubMed]

9. Fu WJA. Smoothing Cohort Model in Age-Period-Cohort Analysis with Applications to Homicide Arrest Rates Lung Cancer Mortality Rates. Sociol Method Res. 2008;36:327–61.

10. Mason KO, Mason WM, Winsborough HH, Poole WK. Some Methodological Issues in the Cohort Analysis of Archival Data. Am Sociol Rev. 1973;38:242–58.

11. Rogers WL. Estimable Functions in Age, Period and Cohort Effects. Am Sociol Rev. 1982;47:774–87.

12. Clayton D, Schifflers E. Models for temporal variation in cancer rates. I: age-period and age-cohort models. Statistics in Medicine. 1987;6:449–67. [PubMed]

13. Clayton D, Schifflers E. Models for temporal variation in cancer rates. II: age-period-cohort models. Statistics in Medicine. 1987;6:469–81. [PubMed]

14. Moolgavkar SH, Lee JAH, Stevens RG. Analysis of vital statistical data. In: Rothman K, Greenland S, editors. Modern Epidemiology. 2nd Ed. Lippincott-Raven, PA: 1998. pp. 482–97.

15. Yang Y, Fu WJ, Land K. A Methodological Comparison of Age-Period-Cohort Models: The Intrinsic Estimator and Conventional Generalized Linear Models. Sociol Methodol. 2004;34:75–110.

16. Ries LAG, Young JL, Keel GE, Eisner MP, Lin YD, Horner MJ, editors. SEER Survival Monograph: Cancer Survival Among Adults: U.S. SEER Program, 1988–2001, Patient and Tumor Characteristics National Cancer Institute, SEER Program, NIH Pub. No. 07-6215, Bethesda, MD: 2007

17. Surveillance, Epidemiology, and End Results (SEER) Program Standard Populations (Millions) for Age-Adjustment [cited 2009 Feb 2]Available from: http://seer.cancer.gov/stdpopulations/stdpop.singleagesthru99.txt

18. Luebeck EG, Moolgavkar SH. Multistage carcinogenesis and the incidence of colorectal cancer. Proc Natl Acad Sci U S A. 2002;99:15095–100. [PubMed]

19. Moolgavkar SH, Meza R, Turim J. Pleural and peritoneal mesotheliomas in SEER: age effects and temporal trends, 1973–2005. Cancer Causes Control. 2009;20(6):935–44. [PubMed]

20. Lindberg V. Guide to uncertainties and error propagation Rochester, NY: c1999–2003 [updated 2003 Aug; cited 2009 Feb 2]. Available from: http://www.rit.edu/cos/uphysics/uncertainties/Uncertainties.html

21. Zheng T, Holford TR, Boyle P, et al. Time trend and the age-period-cohort effect on the incidence of histologic types of lung cancer in Connecticut, 1960–1989. Cancer. 2006;74(5):1556–67. [PubMed]

Articles from Cancer Informatics are provided here courtesy of **SAGE Publications**

PubMed Central Canada is a service of the Canadian Institutes of Health Research (CIHR) working in partnership with the National Research Council's national science library in cooperation with the National Center for Biotechnology Information at the U.S. National Library of Medicine(NCBI/NLM). It includes content provided to the PubMed Central International archive by participating publishers. |