Suppose that
K = 1 and that

with probability 1 for
k = 0, 1, so that no subject dies in neither the actual world nor in the hypothetical world in which
g is enforced in the population. Thus, for
k = 0, 1,
Ok =
Lk since both
Tk and
Rk are deterministic and hence can be ignored. Suppose that
Lk and
Ak are binary variables (and so are therefore

and

) and that the treatment regime
g specifies that
Assume that
Assumption PO imposes two requirements,
Because by definition of regime
g,

, then requirement (
11) can be re-expressed as
Since indicators can only take the values 0 or 1 and

,
l0 = 0, 1 (by assumption (
10)), the preceding equality is equivalent to
that is to say,
By the definition of λ
0 (·|·) (see (
3) in ORR-I), the last display is equivalent to
Likewise, because

, and because

by the fact that

, requirement (
12) can be re-expressed as
or equivalently, (again because the events

and

have the same probability by

,
Under the assumption (
10), the last display is equivalent to
which, by the definition of λ
0 (·|·, ·, ·) in ((
3), ORR-I), is, in turn, the same as
We conclude that in this example, the assumption PO is equivalent to the conditions (
13) and (
14). We will now analyze what these conditions encode.
Condition (
13) encodes two requirements:
- i) the requirement that in the actual world there exist subjects with L0 = 1 and L0 = 0 (i.e. that the conditioning events L0 = 1 and L0 = 0 have positive probabilities), for otherwise at least one of the conditional probabilities in (13) would not be defined, and
- ii) the requirement that in the actual world there be subjects with L0 = 0 that take treatment A0 = 1 and subjects with L0 = 1 that take treatment A0 = 0, for otherwise at least one of the conditional probabilities in (13) would be 0.
Condition i) is automatically satisfied, i.e. it does not impose a restriction on the law of
L0, by the fact that

(since baseline covariates cannot be affected by interventions taking place after baseline) and the fact that we have assumed that

,
l0 = 0, 1.
Condition ii) is indeed a non-trivial requirement and coincides with the interpretation of the PO assumption given in section 3.1 for the case
k = 0. Specifically, in the world in which
g were to be implemented there would exist subjects with
L0 = 0. In such world the subjects with
L0 = 0 would take treatment

, then the PO assumption for
k = 0 requires that in the actual world there also be subjects with
L0 = 0 that at time 0 take treatment
A0 = 1. Likewise the PO condition also requires that for
k = 0 the same be true with 0 and 1 reversed in the right hand side of each of the equalities of the preceding sentence. A key point is that (
11) does not require that in the observational world there be subjects with
L0 = 0 that take
A0 = 0, nor subjects with
L0 = 1 that take
A1 = 1. The intuition is clear. If we want to learn from data collected in the actual (observational) world what would happen in the hypothetical world in which everybody obeyed regime
g, we must observe people in the study that obeyed the treatment at every level of
L0 for otherwise if, say, nobody in the actual world with
L0 = 0 obeyed regime
g there would be no way to learn what the distribution of the outcomes for subjects in that stratum would be if
g were enforced. However, we don t care that there be subjects with
L0 = 0 that do not obey
g, i.e. that take
A0 = 0, because data from those subjects are not informative about the distribution of outcomes when
g is enforced.
Condition (
14) encodes two requirements:
- iii) the requirement that in the actual world there be subjects in the four strata (L0 = 0, L1 = 0, A0 = 1), (L0 = 0, L1 = 1, A0 = 1), (L0 = 1, L1 = 0, A0 = 0) and (L0 = 1, L1 = 1, A0 = 0) (i.e. that the conditioning events in the display (14) have positive probabilities), for otherwise at least one of the conditional probabilities would not be defined, and
- iv) the requirement that in the actual world there be subjects in every one of the strata (L0 = 0, L1 = 0, A0 = 1), (L0 = 0, L1 = 1, A0 = 1), (L0 = 1, L1 = 1, A0 = 0) that have A1 = 0 at time 1 and the requirement that there be subjects in stratum (L0 = 1, L1 = 0, A0 = 0) that have A1 = 1 at time 1, for otherwise at least one of the conditional probabilities in (14) would be 0.
Given condition ii) and the sequential randomization (SR) and consistency (C) assumptions, condition iii) is automatically satisfied, i.e. it does not impose a further restriction on the joint distribution of (
L0,
L1,
A0). To see this, first note that by condition (ii) the strata (
L0 = 0,
A0 = 1) and (
L0 = 1,
A0 = 0) are non-empty. So condition (iii) is satisfied provided
But
and

by (
10). An analogous argument shows that

. Finally, condition (iv) is a formalization our interpretation of assumption PO in section 3.1 for
k = 1. In the world in which
g was implemented there would exist subjects that would have all four combination of values for

. However, subjects with

will only have

, so in this hypothetical world we will see at time 1 only four possible recorded histories,

,

,

and

. In this hypothetical world subjects with any of the first three possible recorded histories will take

and subjects with the last one will take

. Thus, in the actual world we must require that there be subjects in each of the first three strata (
L0 = 0,
L1 = 0,
A0 = 1), (
L0 = 0,
L1 = 1,
A0 = 1), (
L0 = 1,
L1 = 0,
A0 = 0) that take
A1 = 0 and subjects in the stratum (
L0 = 1,
L1 = 1,
A0 = 0) that take
A1 = 1. A point of note is that we don t make any requirement about the existence of subjects in strata other than the four mentioned in (iii) or about the treatment that subjects in these remaining strata take. The reason is that subjects that are not in the four strata of condition (iii) have already violated regime
g at time 0 so they are uninformative about the outcome distribution under regime
g.