In this section, we present the multiple testing formulation of the problem of identifying expression patterns/trends over ordered categories simultaneously for all the genes, having introduced some notations and definitions related to multiple testing.
Let
μij denote the mean response of the
jth variable (e.g. gene),
j = 1, … ,
m, in the
ith ordered category,
i = 1, … ,
p. A problem of biological interest is to group genes by the inequalities among the mean responses, known as directional patterns or order restrictions. Some common inequalities of interest are
μ1j
μ2j ![[less-than-or-eq, slant]](/corehtml/pmc/pmcents/les.gif)
…
μpj (monotone pattern),
μ1j
μ2j ![[less-than-or-eq, slant]](/corehtml/pmc/pmcents/les.gif)
…
μij
μ(i+1)j ![[gt-or-equal, slanted]](/corehtml/pmc/pmcents/ges.gif)
…
μpj,
i = 2, … ,
p − 1 (umbrella order with peak
μij). Let
δij =
μi+1j −
μij,
i = 1, … ,
p − 1,
j = 1, … ,
m. Then, the above inequalities of interest or any other inequalities can be restated in terms of the signs of the
δij's. Let
δj = (
δ1j, … ,
δqj)′, where
q =
p − 1. Suppose we test
and decide for a rejected
H0j which component
δij's are non-zero before declaring their signs to be positive or negative depending on the values of the corresponding test statistics. The declared signs of the
δij's then determine a possible inequality or directional pattern. For instance, in the case of
q = 4, suppose for a given gene
j, the
δj = (
δ1j, … ,
δ4j) is found significantly different from a null vector, with
δ1j and
δ2j declared to be positive and negative, respectively, and
δ3j and
δ4j zeros. Then, the corresponding directional pattern is
μ1j <
μ2j <
μ3j =
μ4j =
μ5j. We can test
H0j against
H1j for all the genes applying a suitable multiple testing method. Thus, given
p ordered categories for each gene, the task of identifying directional patterns of the mean expressions over these categories for all the genes is being formulated as a multiple testing problem where
H0j is tested against
H1j simultaneously for all the genes and the signs of the
δij's are determined subsequent to the rejection of the corresponding
H0j.
For multiple testing of
H0j against
H1j,
j = 1, … ,
m, we need
p-values that will provide a valid test for each of these individual testing problems and will allow us to make decisions on the individual
δij's once a
H0j is rejected. For that, we consider for each
j the
p-value available for testing each component null hypothesis

against the corresponding component alternative hypothesis

, for
i = 1, … ,
q, and apply a suitable combination method pooling these
qp-values by treating
H0j as an intersection of the subfamily of these
q component null hypotheses, that is,

, and
H1j as a union of the corresponding
q alternative hypotheses, that is,

. Before we discuss appropriate combination methods to be used, let us explain how to obtain these component
p-values and state the underlying assumptions.
For every
i = 1,

,
q and
j = 1,

,
m, suppose we use the absolute value of a test statistic
Tij for testing

against

. Let
Tij ~
Fij(
t,
δij) for some continuous cdf F, which is symmetric about 0 under

and gets stochastically larger or smaller as
δij either increases or decreases from 0. In other words, with
Fij(
t,
δij) denoting the cdf of
Tij at t under the parameter
δij, we have
Fij(
t,
δij)
![[less-than-or-eq, slant]](/corehtml/pmc/pmcents/les.gif)
or
Fij(
t, 0) according as
δij > or < 0, and

. Under this setting, a right-tailed test based on the absolute value of
Tij will be considered for testing

against

, with the corresponding two-sided
p-value being defined as
ij = 2 min {
Fij(
Tij, 0), 1 −
Fij(
Tij, 0)}. By the assumed distributional property of
Tij, it is easy to verify that under

, the two-sided
p-value
ij satisfies
Given
p-values for testing

against

, for
i = 1, … ,
q, a number of combination methods (or methods of pooling the
p-values) are available in the literature for testing the intersection null hypothesis

against the alternative

. Among these, however, the Bonferroni and Simes methods are often used in multiple testing and allow one to make decisions on the individual
δij's. For a review of these methods, one may see
Bernhard et al. (2004). Let
(1)j
(q)j be the ordered versions of
ij,
i = 1,

,
q, for a fixed
j = 1, … ,
m. Then, in the Bonferroni test, the pooled (or adjusted)
p-value is given by
Pj =
q
(1)j; whereas, in the Simes test, it is given by
Pj = min
1
i
q {
q
(i)j/
i}. While the Bonferroni test does not require any dependence structure in the underlying
p-values, the Simes test requires a certain type of positive dependence condition that is often satisfied in multiple testing applications (
Sarkar and Chang, 1997). Upon rejection of
H0j using the Bonferroni pooled
p-value at a level
α, the
ith component null hypothesis

can be rejected if
ij
α/
q. For the test based on the Simes pooled
p-value,

corresponding to every
ij
(Rj)j is rejected, where

, if the maximum exists; otherwise,
Rj = 0.
Now, suppose the pooled
p-value
Pj, based on either Bonferroni or Simes test, is available to us for every
j = 1, … ,
m, to carry out a multiple testing procedure to test
H0j against
H1j simultaneously for all
j = 1, … ,
m. We will use the multiple testing method of
Benjamini and Hochberg (1995) (the BH method) that is designed to control the false discovery rate (FDR). The FDR, for any given multiple testing procedure, is the expected proportion of false rejections (Type I errors) among all rejections, an overall measure of Type I error rate commonly used in microarray studies. More formally, with
V the number of falsely rejected true null hypotheses among
H1, … ,
Hm and
R the total number of rejected hypotheses among
H1, … ,
Hm, it is defined as
where
R![[logical or]](/corehtml/pmc/pmcents/x2228.gif)
1 = max(
R, 1). This method with a control of the FDR at a given level
α is a stepup test that, given ordered
p-values
P(1)
P(m) with the corresponding null hypotheses
H(1),
H(m), finds
k = max {1
j
m :
P(j)
jα/
m} and rejects those
H(j) for which
P(j)
P(k), provided this maximum exists, otherwise, accepts all the null hypotheses.
When a
H0j :
δj =
0 is rejected using the BH method and further decisions are being made on the signs of the component
δij's in the corresponding
δj, a directional error might occur due to wrong assignments of the signs. For instance, if there is a component
δij in
δj = (
δ1j, … ,
δqj) that is truly positive (or negative) but declared to be negative (or positive) while deciding on the signs of the
δij's upon rejection of
H0j :
δj =
0, a directional error occurs. So, we need to control such directional errors as well. A convenient and practical way of doing that would be to use an error rate combining both Type I and directional errors in the FDR framework and make sure that it is controlled. One such error rate is the mixed directional FDR (mdFDR), the sum of the FDR and the pure directional FDR (dFDR). The dFDR is defined as
where
S denotes the total number of false null hypotheses among
H1, … ,
Hm that are correctly rejected but at least one directional error has been made while deciding upon the signs of the components. In other words,
S is the number of rejected hypotheses
Hj's such that
δj ≠ 0 and for some
i = 1, … ,
q,
δij is declared to be positive when
δij ![[less-than-or-eq, slant]](/corehtml/pmc/pmcents/les.gif)
0, or
δij is declared to be negative when
δij ![[gt-or-equal, slanted]](/corehtml/pmc/pmcents/ges.gif)
0. Thus, more formally, the mdFDR is defined as
the expected proportion of Type I and directional errors among all rejections.
It is important to point out that the goal of this paper is to identify expression patterns of m genes over p ordered categories. For each gene it is biologically relevant to consider its expression pattern as a whole across p ordered categories rather than viewing this to be a problem of testing qm separate hypotheses which ignores the intrinsic biological structure present in the problem. Thus, rather than viewing it as a problem of performing qm tests, we treat it as a problem of performing a set of m tests each involving q-dimensional hypothesis. In addition, we want to emphasize that while making directional decisions for the components of δj, no directional errors are being made when δj = 0. In contrast, when making directional decisions regarding a non-null δj, a directional error is made if a component δij for which δij = 0 is declared to be positive or negative.
In the next section, we will develop methods to control the mdFDR. This will extend the following directional BH procedure of
Benjamini and Yekutieli (2005) from dimension one (i.e.,
q = 1) to a general dimension.
Definition 1 (The level-α directional BH Procedure)
- (1) Apply the BH method at level α to test H0j : δ1j = 0 against H1j : δ1j ≠ = 0 simultaneously for j = 1, … , m, based on the two-sided p-values
1j j = 1, … m. - (2) Let R denote the total number of null hypotheses rejected.
- (3) For every j = 1,
, m, with
, declare δ1j > or < 0 according as T1j > 0 or < 0.
It controls the mdFDR at α under independence of the underlying test statistics.