Technological advances facilitating the acquisition of large arrays of biomarker data have led to new opportunities to understand and characterize disease progression over time. This creates an analytical challenge, however, due to the large numbers of potentially informative markers, the high degrees of correlation among them, and the time-dependent trajectories of association. We propose a mixed ridge estimator, which integrates ridge regression into the mixed effects modeling framework in order to account for both the correlation induced by repeatedly measuring an outcome on each individual over time, as well as the potentially high degree of correlation among possible predictor variables. An expectation-maximization algorithm is described to account for unknown variance and covariance parameters. Model performance is demonstrated through a simulation study and an application of the mixed ridge approach to data arising from a study of cardiometabolic biomarker responses to evoked inflammation induced by experimental low-dose endotoxemia.
Keywords: biomarkers, cardiovascular disease (CVD), mixed effects, repeated measures, ridge regression



~ MVN (0, σ2I) is an n × 1 vector of independent errors. It is straightforward to show that the least squares and maximum likelihood solutions to this equation are given by
= (XTX)−1XTY and Var(
is found through an iterative procedure that involves recursively updating Ŷ and S based on the current value of
= DZTV−1(Y – X
(t) = θ0 and
(t+1) = Z
(t+1)ZT +
2(t+1)I