In this section we show two approaches to discretizing a continuous covariate and testing for its significance based on the generalized estimating equation approach. Here we assume that the response for individual
i, i = 1, …,
n, is possibly multivariate and is denoted by
Yi = (
Yi1, …,
YiJ)t. We allow for the
Y to be continuous or discrete outcomes. Let
μi = E[
Yi] = (μ
i1, …, μ
iJ)t be the mean vector for individual
i. We assume a generalized linear model (GLM) framework, so that the distribution of the
Yij belongs to the same family with mean μ
ij and common dispersion parameter ϕ (
Liang and Zeger, 1986). Also, let
Xi be the continuous covariate associated with individual
i. We assume that the mean and the dichotomized covariate,
Zδ, are related by
where
g(·) is a known link function, α
j for
j = 1, …,
J, δ and γ are unknown parameters and

is the dichotomized covariate obtained from
X, namely
We are interested in estimating the threshold parameter δ as well as the inference on the parameter γ. For a given δ, estimates of parameters can be based on the generalized estimating equations (GEEs) suggested by
Liang and Zeger (1986):
where
β = (γ, α
1, …, α
j)
T and
Vi is a working covariance matrix. GEE estimates of
β,
![[beta]](/corehtml/pmc/pmcents/bgrcirc.gif)
are found by solving
Eq. (2). A sandwich estimator can be used to estimate the covariance matrix of
β:
where
and
Standard packages such as GEESE in R or PROC GENMOD in SAS can be used to obtain estimates and their variances for a given δ.
In our setup, we are interested on the hypothesis H
0 : γ = 0. One test that can be used is the Wald test given by

, which has a standard normal distribution when
H0 is true and δ is known
a priori. A second test is the generalized score test (
Boos, 1992). For the two-sample problem with no covariates,
Liu et al. (2008) show that the score statistic for the test H
0 : γ = 0 is given by
where
n0 is the number of observations with

,
n1 is the number of observations with

is the estimator of α
j under the null hypothesis,

. Under the null hypothesis,

has a chi-square distribution with one degree of freedom when δ is known
a priori.
In order to estimate δ, one common approach is to compute either

for values of δ in some range and pick up the value that maximizes

. This procedure is appropriate for estimating δ, but an adjustment must be made in order to make an inference for γ and preserve the type-I error rate.
The first adjustment that can be made is a GEE version of the approach in
Lausen and Schumacher (1992,
1996). Both the generalized score statistic and the Wald statistic can be used. In order to apply this approach, the possible range of threshold values must be restricted to [
X(nε);
X(n(1−ε))], where
X(k) is the
kth-order statistic of the continuous variable and 0 < ε < 0.5. Let
C (δ) be the test statistic for the two-sample problem with groups defined by δ. It can be shown that
as
n → ∞, where
W0 is a Brownian bridge. Details of the proof can be found in
Lausen and Schumacher (1992) and
Billingsley (1968, chap. 4).
Miller and Siegmund (1982) show that, for
b > 0,
where
![[var phi]](/corehtml/pmc/pmcents/x03C6.gif)
(·) is the standard normal density function. This result motivates the following approximation for a corrected
p-value:
where
z = Φ
−1(1 −
p/2), Φ(·) is the standard normal cumulative distribution function and
p is the unadjusted
p-value computed at the maximum of the test statistic. Notice that the approximation is valid for
p < 0.5. Inferences for γ can be made by computing the corrected
p-value for the maximum value of the test statistic.
The second approach considered here is a modification of the method in
Contal and O’Quigley (1999). To construct the test statistic, recall that, for the two-sample problem without covariates, assuming that the working covariance matrix is the identity matrix, the score function can be written as
where

is the linear predictor and μ
ij =
g−1(η
ij). Let

be the score evaluated under the null hypothesis.
Theorem 1.
Let {ξ
1, ξ
2, …}
be a stationary and ergodic process for which its conditional expected value E(ξ
i|ξ
1, …, ξ
i−1)
is equal to zero with probability 1
and for which the expected value
is positive and finite. Define
If
for p ![[set membership]](/corehtml/pmc/pmcents/x2208.gif)
[0, 1];
then Sn(
p)
W(
p)
as n → ∞
, where W(
p)
is a Brownian motion process. In our setup, in order to apply this result, we must first put the data in increasing order with respect to the continuous covariate
X. Now, with the ordered data, define
and assume that
Yi
Yi′. With this assumption, we can then see that the condition E(ξ
i|ξ
1, …, ξ
i−1) = 0 is verified. It is important to notice that we are assuming that individuals are independent, but observations within the same individual may be correlated. Also, under the null hypothesis H
0 : γ = 0, recalling that we are assuming a common dispersion parameter for distribution of the responses, we have that the
Yi are independent and identically distributed (i.i.d.), so the ξ
i are also independent and identically distributed. Therefore, the ξ
i are stationary and ergodic. Finally, under the null hypothesis, we have
where
VY is the covariance matrix of
Yi and
ġ is the vector of elements

, which is finite. With the ordered data, it is clear that there is a value δ
p for which

. In fact, any value of δ
p between
X([np]) and
X([np]+1) satisfies

, so the midpoint can be used. If we define
we have
Sn(
p)
W(
p) as
n → ∞. Also, the process defined by
Sn(
p) −
pSn(1) converges to a Brownian bridge (
O’Quigley, 2008, pg. 245).
In practice, we replace the unknown quantities in the ξ
i by consistent estimators under the null, so we work with
Ŝn(
p). Following
O’Quigley (2008, pg. 236), also based on Slutsky’s theorem, the large sample properties of the resulting test statistic will not be affected. Also, in general, the values of α
j and υ
2 are not known and we substitute by their estimators. The parameters α
j can be estimated using the estimating equations under the null hypothesis, and for υ
2 we can use the sample variance

. Hence
Therefore, because
Ŝn(1) = 0, we have that
Ŝn(
p) =
Ŝn(
p) −
pŜn(1) (
Klein and Wu, 2004) and, finally,
Ŝn(
p)
W0(
p) as
n → ∞, where
W0(
p) is a Brownian bridge. It has been shown that the distribution of the extreme value of the Brownian bridge (
Billingsley, 1999, pg. 103) is given by
If we order the ξ by the continuous covariate for which we want to find a cutpoint, under the null hypothesis this sequence is formed by i.i.d. observations and the sequence of
Ŝn(
p) constructed with the ξ ordered converges to a Brownian bridge. We can then use as test statistic the maximum of
Ŝn(
p),
p ![[set membership]](/corehtml/pmc/pmcents/x2208.gif)
[0, 1]. If the null hypothesis is rejected, we estimate as cutpoint any value between
X(m) and
X(m+1), where
X(m) is the
mth value of the ordered
Z and
m =
np*, where
p* is the value of
p for which |
Ŝn(
p)| is maximum. An important remark that should be made here is regarding the use of pseudo-observations. Although the pseudo-observations are computed based on a jackknife technique (and, therefore, they are computed based on (
n − 1) observations), it can be shown that under some conditions they can be approximated by independent and identically distributed variables (
Graw et al., 2009), so the results for the ξ remain valid.