Before describing our new approach, we briefly summarize Chen’s results. He considered the following parametric and semiparametric approaches to the estimation of
ψ0: a prospective likelihood approach under the model

that assumes that one has correctly modelled the nuisance baseline function
f (
Y |
a0, L); a retrospective likelihood approach under the model

that assumes that one has correctly specified a model for the nuisance baseline function
g(
A |
y0, L); a joint likelihood approach under the intersection model that assumes that both models

and

are correct; and a doubly robust locally semiparametric efficient approach under the union model

of § 1.
In his doubly robust approach, Chen establishes that in the semiparametric model
![[mathematical script A]](/corehtml/pmc/pmcents/x1D49C.gif)
characterized by the sole restriction
(1), the density
h(
A, Y |
L) can be written as
h(
A, Y |
L; ψ0), where
f (
y |
L, A =
a0) and
g(
a |
Y =
y0,
L) are the unknown conditional densities that generated the data and are solely restricted by ∫
γ (
y, a, L)
f (
y |
L, A =
a0)
g(
a |
Y =
y0, L)
dμ(
a, y) < ∞ almost everywhere. Then, he specifies parametric models
f (
Y |
a0, L; θ) and
g(
A |
y0, L; α) for the unknown nuisance baseline functions
f (
y |
a0, L) and
g(
a |
y0, L), obtains profile estimates
![[theta w/ hat]](/corehtml/pmc/pmcents/x03B8x0302.gif)
(
ψ) and
![[alpha]](/corehtml/pmc/pmcents/agrcirc.gif)
(
ψ) of the nuisance parameters
θ and
α and calculates the efficient score
Ŝeff (
ψ)
Seff {
![[theta w/ hat]](/corehtml/pmc/pmcents/x03B8x0302.gif)
(
ψ),
![[alpha]](/corehtml/pmc/pmcents/agrcirc.gif)
(
ψ)
, ψ} for
ψ in the semiparametric model
![[mathematical script A]](/corehtml/pmc/pmcents/x1D49C.gif)
evaluated at the law [
γ (
y, a, l; ψ)
, f {
y |
a0, l; ![[theta w/ hat]](/corehtml/pmc/pmcents/x03B8x0302.gif)
(
ψ)},
g{
a |
y0,l; ![[alpha]](/corehtml/pmc/pmcents/agrcirc.gif)
(
ψ)}] indexed by {
![[theta w/ hat]](/corehtml/pmc/pmcents/x03B8x0302.gif)
(
ψ)
, ![[alpha]](/corehtml/pmc/pmcents/agrcirc.gif)
(
ψ)
, ψ}. Next, he estimates
ψ0 with the solution
eff to
Pn{
Ŝeff (
ψ)} = 0, where
Pn(
H) = n
−1 ∑
i Hi, and proves that
eff is regular and asymptotically linear and thus consistent and asymptotically normal under the union model

. Further general results of
Robins and Rotnitzky (2001) imply that
Ŝeff (
ψ) is also the efficient score for
ψ in model

under the law [
γ (
y, a, l; ψ)
, f {
y |
a0, l; ![[theta w/ hat]](/corehtml/pmc/pmcents/x03B8x0302.gif)
(
ψ)}
, g{
a |
y0, l; ![[alpha]](/corehtml/pmc/pmcents/agrcirc.gif)
(
ψ)}]. It follows that the estimator
eff is locally semiparametric efficient under model

at the intersection submodel with both nuisance models correct; that is,
eff attains the semiparametric efficiency bound for the model

when both nuisance models happen to hold.
By definition, the efficient score
Seff = Π (
Sψ |

) for a parameter
ψ in a given model is the projection of the score
Sψ for
ψ onto the orthocomplement

to the nuisance tangent space Λ
nuis in the Hilbert space
L2
L2(
FO) of zero-mean functions of
p dimensions,
T
t(
A, Y, L) =
t(
O), with inner product
EFO (
T2)
E(
T2), and corresponding squared norm ‖
T‖
2 =
E(
TTT), where
FO is the distribution function that generated the data. Chen proves that for model
![[mathematical script A]](/corehtml/pmc/pmcents/x1D49C.gif)
, the set
contains all functions that have zero-mean conditional on both (
A, L) and (
Y, L). When both
A and
Y contain continuous components and
ψ0 ≠ 0,
Chen (2007) finds that this projection and therefore
Seff do not exist in closed form and must be computed using the iterative alternating conditional expectations algorithm. Each iteration requires the evaluation, by numerical integration, of conditional expectations, which seriously limits the practicality of Chen’s approach, particularly when
A and/or
Y have two or more continuous components.
The main contribution of our paper is to show that, even though the projection Π(
R |

) of a given random variable
R =
r(
Y, A, L) into the orthocomplement

does not exist in closed form when both
A and
Y contain continuous components, the set

does have a closed-form representation, which appears to be new. We use our representation to obtain doubly robust estimators, i.e. consistent and asymptotically normal estimators of
ψ0 in the union model

, that are nearly as efficient as
eff under the intersection submodel, yet do not require the alternating conditional expectations algorithm. Moreover, our closed-form representation of

is of independent interest, with applications beyond the present paper. For example,
Vansteelandt et al. (2008) use our representation to construct multiple robust estimators of the parameter encoding the interaction on an additive and multiplicative scale between two exposures
A1 and
A2 in their effects on an outcome
Y.
In the special situation where either
Y or
A has finite support,
Bickel et al. (1993) provide a closed-form expression for Π(
R |

), which Chen, however, did not use to give a closed-form expression for
Seff. We remedy this oversight and obtain doubly robust locally-efficient closed-form estimating functions when
Y and/or
A has finite support; some emphasis is given to the important case of dichotomous Y which, incidentally, coincides with the semiparametric logistic regression model.
In the following, for a vector
υ we write
υ
2 =
υυT. To simplify notation, we suppose
y0 =0 and
a0 = 0 throughout, so that
γ (
Y, 0
, L; ψ) =
γ (0
, A, L; ψ) =
γ (
Y, A, L; 0) = 1. We shall also use the following definition.
Definition 1. Given conditional densities f†(Y | L) and g†(A | L), the density h†(Y, A | L) = f†(Y | L)g†(A | L) that makes A and Y conditionally independent given L is an admissible independence density if the joint law of (Y, A) given L under h†(·, ·| L) is absolutely continuous with respect to the true law of (Y, A) given L with probability one. Furthermore, E†(· |·, L) denotes conditional expectations with respect to h†(Y, A | L).