Fourier transform infrared (FT-IR) spectroscopic imaging
1 provides simultaneous chemical and structural information from heterogeneous materials of interest
2 and is being used increasingly for biomedical studies, especially involving cells and tissues.
3, 4, 5, 6, 7, 8 Most biomedical samples, however, are chemically complex. Hence, their analysis often relies on treating the spectrum as a characteristic signature of the identity and/or physiologic state of the sample. Many studies seek to find the unique spectral signature or differences in spectral signatures between given classes of samples from a statistical, rather than purely biochemical, perspective. These classes may be tissue with different grades of disease or different cell types within the same tissue type, for example. Finding an IR imaging-based approach that can distinguish between disease states is of tremendous technological and medical importance as it can potentially improve diagnostic information, reduce costs and prevent errors. The tasks in this approach would be to discover differences in spectral properties of classes and develop a computer algorithm such that every spectrum (pixel) can be classified into a particular class without using dyes, stains or human supervision.
9 Though conceptually straightforward, this approach is exceptionally challenging not only because of the subtle differences between various components and disease states in tissue but also because of the variation in IR spectra that obscures differences between disease states. This variation may overwhelm differences due to disease states and is a prime cause of the failure of many analytical methods in providing robust diagnostic protocols. Quantification of the sources of analytic variability and redressing them, hence, are topics of much interest in IR spectroscopy
10 and other analytical technologies.
11,12,13Analytic variability can arise from (a) noise in signal measurement,
10,14 (b) differences within the tissue that leads to differences both within a given sample and between samples from the same patient, (c) differences between patients due to biologic diversity, (d) differences due to sample handling in different clinical settings or research groups and (e) causes not falling into any of the above categories. The variation may also be understood to be biological, technical or residual. Biological variation arises from different biological characteristics of samples such as patients, tissues, cells, subcellular components, etc. It is natural and expected variation, and often of interest in an experiment. Technical variation is attributable to both sample preparation and analytical techniques. Potential sources of technical variation include tissue acquisition,
15,16 fixation,
17 and sectioning, placement of tissue section on the slide
16 and post-preparation handling.
18 The process of data acquisition also introduces variation, such as measurement noise.
19 Although thoroughly identified, these potential sources of variation may not completely explain the total variation in a measurement. Residual variation refers to the unexplained variation in the experiment; for example, environmental conditions – room temperature and humidity – that may not be part of the sample or acquisition characteristics. Accordingly, residual variation will usually be present and, on occasion, can have a substantial impact on the analysis. In such a case, we may either re-examine potential sources of variation and/or re-design the experiment.
Understanding the relative importance of each of these factors and explaining the variance observed in large scale tissue studies is critical for developing any real-world application. While an understanding of the contributions of variance by various sources can result in improved protocol designs, the lack of such understanding brings into question the performance of any developed protocol
20. Hence, in this manuscript, we develop a framework to understand variability and its sources in IR spectroscopic imaging of tissue. This understanding may be extended to other analytical techniques and imaging modalities, in general, and may be used to improve the practice of IR spectroscopic imaging for biomedical analysis, in particular. The first challenge to understanding variability is to obtain a data set of sufficient diversity and size. Tissue microarrays (TMAs),
21 to this end, are an excellent tool and have been used previously in a number of studies.
22, 23, 24, 25 TMAs consist of many tissue samples arranged in a grid pattern (), in which multiple samples may be included from the same person and a population of different people is included. Multiple TMAs may further be employed to increase sample set diversity and size, both in the populations of patients as well as clinical settings and handling of samples. The second challenge is to quantify the effect of sources by determining their contribution to the total variance, which can be accomplished by applying analysis of variance (ANOVA) models to the acquired data set.
ANOVA is a popular statistical model for partitioning the total variance of the measured quantity in an experiment into various identifiable factors (or sources of variation). Consider a TMA consisting of tissue samples from several patients, and
n samples were taken from each patient. Here, patient is the only factor, and the difference between patients in terms of IR spectra is of interest. In this setting, the IR absorbance or any other combination of spectral features, which we term “metric”, can be expressed as
yjk = μ + β
j + ε
jk where
yjk is IR metric of the
kth sample from the
jth patient, μ is the overall mean, β is patient effect (
j = 1, …,
nβ), and ε is residual error effect. The total variance can be partitioned into two,

indicate the variance of patient effect and residual effect, respectively. Partitioning of variance can be carried out by computing sum of squares (SS) and mean squares (MS). The total SS is calculated as the sum of between-patient SS, sum of squared differences between the overall mean and patient means and within-patient SS (or residual SS), sum of squared differences between patient means and individual metric. Calculating MS, which is SS divided by degrees of freedom (df), the variances can be estimated by equating MS and expected mean square (EMS). EMS of patient effect and residual effect are

, respectively. Dividing the estimated variances by the total variance allows us to obtain the portion of variance explained by patient effect. The larger the portion of variance due to patient effect, the bigger is the difference between IR metrics of the patients due to a characteristic of the patients themselves. Further, the significance of the differences can be assessed by conducting a hypothesis test,
F-test.
F-test statistic, which is the ratio of between-patient MS and within-patient MS, is computed, and is compared to the
F-distribution with between-patient and within-patient df, resulting in
p-value for the test. A low
p-value denotes that the metric difference between patients is statistically significant. We note that the model can be extended to reflect additional variables; for example, including histologic class, the model becomes
yjk = μ + β
j + δ
l + βδ
jl + ε
jlk where δ is histologic class effect (
l = 1, …,
nδ) and βδ is the interaction effect between patient effect and histologic class effect. Two factors interact if the effect of one factor changes with changes in contributions from the other factor. Both β and δ are designated as main effect, which is the effect of a factor averaged across the levels of other factors (see
Supporting Information for details). Other analogous models, for example a
no-subcellular component model and
within-histologic class model (), can be constructed by adding measurement error and replacing patient and histologic class with core and subcellular component, respectively in these cases. ANOVA has been applied for analyzing several types of spectroscopic imaging data: chemical compounds
26, 27, collagen types
28, skin lesions
29, and plant species
30, 31, 32, but, to our knowledge, has not been applied to spectroscopic imaging data from tissues. To systematically apply this methodology for tissue analysis, we present appropriate ANOVA models () for different experimental designs of IR imaging data from TMAs, evaluate the statistical significance of the sources of variance, estimate variance contributions of the identified sources, and quantify the relative contributions of the sources to the total variation in the data. Finally, after examining the effect of the sources of variance, we also find the most discriminative spectral metrics and address the aspects of FT-IR imaging and TMA techniques that can be improved for better diagnostic protocols.
| Table 1Summary of ANOVA models in the manuscript. Here, y represents IR absorption of a pixel, and μ is the overall mean. α, β, γ, δ, and denote array, patient, core, histologic class, and subcellular component (more ...) |