National Children’s Study subject selection. Study populations should be selected to maximize spatial exposure contrasts for the pollutants of interest. Because multiple pollutants are of interest for the National Children’s Study, priorities must be established to allow identification of individuals with a wide range of exposure profiles for those key pollutants of study interest.
Issues to consider include spatial scale variations of pollutants, in order to select a study population that maximizes exposure contrasts (). identifies the spatial scales of variability for ambient pollutants to consider in the study design for the National Children’s Study. The scales are categorized as regional (100–1,000 km), urban (4–50 km), neighborhood (50 m to 4 km), and household (≤50 m, including outdoor and indoor microenvironments). For some exposures, contrast in exposure can be achieved by considering indoor sources and behavior (e.g., smoking vs. nonsmoking homes), if indoor-source pollutant health effects are of interest. For PM, the spatial scale variability of importance depends on the constituents of interest. For example, elemental carbon (EC) from ambient primary combustion processes varies on urban and neighborhood scales. Indoor sources from combustion also contribute to personal EC exposure (
LaRosa et al. 2002). In contrast, particulate sulfates typically vary on a regional scale. To maximize exposure gradients to EC, subjects would need to be selected on a neighborhood scale, such as based on distance to busy roadways. Sulfates’ regional nature would be better reflected in a subject selection scheme involving different regions of the United States.
| Table 2Spatial scales of variability for ambient air pollutants. |
To select subjects based on exposure contrasts for ambient pollutants (e.g., ozone, sulfate), exposure data on geographic variation in levels and spatial gradients over time are needed. For criteria pollutants, existing data are available from a national network of monitoring stations. Data for many other pollutants of biologic interest may be sparse or nonexistent (e.g., EC and air toxics). In addition, for other pollutants with both indoor and outdoor sources (e.g., PM mass, nitrogen oxides, volatile organic compounds), much of the variability in exposure is driven by indoor source activity and/or very proximate local sources (e.g., traffic). For these pollutants, levels may need to be measured or modeled with the appropriate spatial and temporal resolution in pilot studies to ascertain the appropriate spatial, temporal, and behavioral determinants. In addition to variable pollutant source strengths, subject-specific temporal–spatial–physical patterns of activity may meaningfully affect both within and between-group exposure assignments. Capturing this variability in applicably useful ways for large study population studies is challenging and often a multi-faceted approach using self-administered questionnaires, walk-through surveys, instrument deployments, and sentinel monitoring.
Because several pollutants of interest for the National Children’s Study are regional in nature, subject selection from areas with contrasting pollution profiles is likely to be most informative. The national scope of the National Children’s Study provides the opportunity to maximize the number of study profiles. For example, the constituents of PM <2.5 μm in diameter (PM
2.5) within a region are highly correlated, but between regions the correlations may be lower. PM
2.5 sulfate is higher in the eastern United States and lower in the western United States, whereas PM
2.5 nitrate is lower in the eastern United States and higher in the western United States. Therefore, the comparable effect of these PM
2.5 constituents may be separable by study design. Replication of pollution profiles in different regions is also important to allow for effects of geographic variables such as weather and other confounding variables to be controlled in the analyses (
Jerrett et al. 2003a,
2003b;
Krewski et al. 2000;
Peters 1997;
Peters et al. 1999a). Exposures within homes with common sources are also highly correlated and may be separated by design.
An example of the integration of these approaches is the Southern California Children’s Health Study (CHS), a study performed by investigators in the University of Southern California (USC)/University of California at Los Angeles Children’s Environmental Health Center. The USC CHS is a multiyear cohort study of several thousand southern California school children (
Berhane et al. 2004;
Kunzli et al. 2003;
Peters 1997). The primary USC CHS research question is whether ambient air pollution causes chronic adverse respiratory health effects during childhood and adolescent growth and development. Almost 12,000 children from schools in 13 southern California communities have been recruited into five cohorts since the study began in 1993.
Communities were selected to maximize differences in outdoor air pollutant concentrations. To distinguish the effects of different pollutants, communities were selected to minimize the spatial correlations between three priority study pollutants [O
3, nitrogen dioxide, and PM < 10 μm in diameter (PM
10)]. However, the full quasi-factorial design could not be fulfilled because all the potential pollution profiles do not occur in nature. Specific community selections were based on historical air pollution levels for several years before study inception, exposure patterns, and census demographic data. Because of differences in the number of locations at which pollutants were measured and the frequency and type of measurements made, data available for selecting communities were more reliable for O
3 than for PM
10, and more reliable for PM
10 than for NO
2. Demographically heterogeneous communities were selected because they would be more likely to exhibit overlapping distributions of confounding risk factors and would allow adjustments for confounding in the analysis. Replication of exposure profiles was employed to improve the chance of including demographically comparable communities and to allow estimation of residual variance within pollution profiles. Additional details have been described previously (
Berhane et al. 2004;
Peters et al. 1999a,
1999b). This design resulted in contrasting exposure profiles for O
3 and a package of correlated pollutants (PM
10, PM
2.5, and NO
2) primarily of mobile source origin. This approach can be extended to other pollutants, such as ultrafine particles whose concentrations may also vary on a localized scale of ≤50 m. Selecting subjects within communities based on the distance between the home and the nearest busy roadway or other traffic density metric may maximize the exposure contrasts of ultrafines within the profiles of other pollutants such as O
3.
Other potential valuable exposure sampling designs might consider “matrix” sampling approaches, which would draw on subsets of subjects for specific substudies or specialty projects. In the larger perspective however, maximizing differences in community exposure profiles can provide a rich population base from which to develop and inform multiple studies seeking to optimize the National Children’s Study effort.
Exposure metrics. Because of the large size, long duration, and diversity of outcomes and exposures of interest in the proposed National Children’s Study, the exposure assessment effort should rely on modeling to provide estimates for the entire cohort, supported by subject-derived questionnaire data. Necessary survey information on temporal–spatial–physical patterns of activity and household characteristics can be collected for the entire cohort, and targeted exposure substudies can be performed in selected subsamples of study subjects.
Issues to consider include modeling for large-scale investigations over long periods (e.g., the National Children’s Study), which is currently the only feasible approach for assigning exposure estimates for the entire cohort. This is especially true for ambient air pollutants that display significant spatial variation on urban, neighborhood, or household spatial scales.
A variety of exposure assessment modeling approaches are available, including proximity-based, geostatistical, land-use regression (LUR), dispersion, integrated meteorologic emission, and hybrid approaches involving personal sampling in combination with one or more of the above methods (Jerrett et al. 2004). Each model varies by data input requirements, software/hardware, technical expertise, and resulting accuracy and extrapolation potential.
Modeled estimates can be refined using targeted substudies designed to measure levels at geographic locations over time on the scale of spatial and temporal variation of the pollutants under study. The time resolution of the exposure estimate needs to be appropriately matched to outcomes to capture effects of frequency, magnitude, and duration of peak or episodic exposure events that may have effects during windows of vulnerability. Long-term average exposures, including average peak levels or hours above threshold levels, are likely more important for relationships with chronic disease, but this assumption needs to be evaluated for specific agents and outcomes of focus in the National Children’s Study.
Data availability and quality for model input are critically important. Central-site monitoring data can be used to assign exposure for outdoor environments, but the utility of this assignment will depend on the relative variability of the pollutant across the sampling area of interest (intra- vs. intercommunity variability issues). Estimates of indoor concentrations require individual information on home operating conditions, home source profiles and activity, factors influencing the penetration of outdoor pollutants and/or the dilution of pollutants of indoor origin (
LaRosa et al. 2002;
Navidi et al. 1999). Information about temporal, spatial, and physical activity patterns are also important but are likely to have insufficient time resolution over the period of study interest. Broader categories of “usual” patterns of activity, household operation, and susceptibility factors can be considered as modifying factors for the exposure–response relationship using available central-site monitoring data (
Gauderman et al. 2000;
Janssen et al. 2002).
An existing national system of central site monitors collects continuous data on criteria air pollutants and more limited data on hazardous air pollutants [
U.S. Environmental Protection Agency (EPA) 2004]. It is possible to add additional instruments to monitoring sites to measure additional pollutants or speciate PM at reasonable cost. However, the use of central-site monitoring data for epidemiology studies requires a quality assurance activity beyond that which is used for regulatory activities as well as methods to address missing data issues. The Health Effects Institute recently funded a study to compile existing estimates of air toxics into a coherent national database. When available, these data may contribute to the National Children’s Study, and selection of the sampling sites for the National Children’s Study should take into account the location of existing and upcoming monitoring data. No similar monitoring network exists to assess exposure from indoor sources, which may need to rely on questionnaire information and substudies across regions.
Modeling of pollutants with large intra-community variation requires additional community measurements. Substudies can be designed to exploit obtainable information for modeling study subject exposures (
Jerrett et al. 2005). These additional microenvironmental measurements can be used for fitting models to better estimate exposure, for model validation, and for assessment of errors in exposure assignments. Calibration studies using repeated personal monitoring may be designed and conducted to validate the exposure estimates and correct for exposure error in the analysis (
Berhane et al. 2004;
Fraser and Stram 2001;
Mallick et al. 2002;
Stram et al. 1995).
An illustration of these approaches may be seen in the USC CHS. The USC CHS framework employed a hierarchical approach for estimating exposure, ranging from the coarsest spatial estimates based on community pollutant levels measured at a single central monitor per community, to the finest spatial-scale estimates based on integrated models for individual exposure assessment. The framework involved the following pollutant measurement and modeling levels: a) continuous monitoring of O3, NO2, and PM10, and of PM2.5 mass and composition on a time-integrated 14-day basis, at a central monitoring station in each community; b) measurement of selected pollutants at multiple locations within each community; and c) adjustment of the central site monitor to the levels around children’s homes and schools based on a limited number of field measurements. This framework is augmented by a) modeling of vehicle emissions using geostatistical methods and spatial dispersion models, b) estimating outdoor pollutant concentrations at schools and homes for the entire study population using spatial statistical models in a hybrid microenvironmental approach, and c) modeling individual exposure estimates for the entire study population using unified modeling methods that integrated information with different spatial and temporal resolutions. These unified methods include community monitored pollutant levels, studies of indoor and outdoor levels in homes and schools; step counters; questionnaire-based data on time–activity patterns including commuting patterns, traffic patterns, and housing characteristics; and appropriate accounting of uncertainty in the exposure estimates.
The USC CHS developed a microenvironmental exposure model that, in principle, can provide estimates of exposures to pollutants of ambient origin in five microenvironments. These include residential outdoors, residential indoors, school outdoors, school indoors, and inside vehicles. The exposure model uses individual-level time–activity and housing survey data, residence and school-level traffic model estimates, and community-level air quality measurement data and regional transport factors to estimate short-term and long-term individual exposures. The model estimates show the largest amount of within-community variations in individual exposures of any of the models; however, validating these types of models is difficult and resource intensive (
Peters 1997).
Newer modeling strategies such as LUR models are promising. LUR employs the pollutant of interest as the dependent variable and proximate land use, traffic, and physical environmental variables as independent predictors. The methodology seeks to predict pollution concentrations at a given site based on surrounding land use and traffic characteristics. The incorporation of land use variables into the interpolation algorithm detects small-area variations in air pollution more effectively than do standard methods of interpolation (i.e., kriging) (
Briggs et al. 1997,
2000;
Lebret et al. 2000). These methods are promising for the National Children’s Study because they can be extrapolated, based on land use coverage, without need for extensive monitoring in each location. Most major urban centers maintain land use information, and the U.S. Census has much of the information needed on population density and employment structures. The National Children’s Study could support the monitoring needed to calibrate LUR models that are regionally representative of broad land use and emission patterns. Derived coefficients could then be applied to other places within the region without need for extensive monitoring.
Use of limited substudies for exposure refinement. Assessment of some exposures of interest will require individual measurements of exposures using snapshots of personal and microenvironmental exposures over short periods and/or in selected microenvironments.
Issues to consider include the large number of interrelated factors that are important in designing exposure substudies. These include the substudy’s purpose, the population sample to include, whether personal or microenvironmental samples should be collected, the respondent burden, study feasibility, sample collection and analytic costs, temporal variation of exposure, subject activity patterns, household operation by residents, and uses in model validation and calibration.
These elements are nicely illustrated in the Columbia Pregnancy Cohort Study (PCS), a study performed by the Columbia University Center for Children’s Environmental Health, which has focused on the effects of pre- and postnatal exposures to air pollution on birth outcomes and neurodevelopmental and respiratory health outcomes in childhood via through recruitment and follow-up of pregnant women and their offspring (
Miller et al. 2001;
Perera et al. 2003,
2004a;
Tonne et al. 2004;
Whyatt et al. 2003). In the Columbia PCS, direct air pollution exposure assessment begins in the third trimester of pregnancy with collection of a 48-hr personal sample of PM
2.5 and vapors for each pregnant woman. These samples are analyzed for polycyclic aromatic hydrocarbon (PAH) and pesticide concentrations (i.e., a “snapshot” measurement representing “usual” exposure). In a validation substudy, the investigators also collected sequential 2-week integrated indoor samples, analyzed for the same variables as above, for the entire third trimester (preferred over the personal snapshot as an exposure surrogate of third-trimester exposures, but obviously more intensive laborwise, cost-wise, and subjectwise). A home dust sample was also collected during the third trimester from subjects and analyzed for standard allergens relevant to maternal exposures and possible prenatal sensitization, based on evidence emerging from the Columbia PCS (
Miller et al. 2001).
Another time interval of study exposure interest was the first 2 years of life, when infants/toddlers spend substantial amounts of time in the home; this may be a critical exposure window for development of allergy and asthma. Columbia PCS homes were visited when the child reached 1 year of age, and a dust sample was collected for allergen analysis. Additional sampling was performed in a subset of 25% of the homes, where 2-week samples of indoor and outdoor air PM2.5, black carbon, and NO2 were collected. These samples are being used to develop and test a spatial LUR model that will then be used to estimate exposures in the full cohort that are representative of those occurring in early childhood.
As a part of its investigations of childhood asthma in Baltimore, Maryland, the Johns Hopkins Center for Asthma in the Urban Environment (JHU Center) has conducted an intervention trial and a cohort study of asthma morbidity (
Breysse et al. 2005;
Swartz et al. 2004). The exposure assessment efforts for these studies include indoor and outdoor air pollution as well as indoor allergens in approximately 400 homes. The major focus of these studies was indoor air where investigators assessed 3-day average indoor PM
10, PM
2.5, NO
2, O
3, and nicotine at 3-month intervals (
Breysse et al. 2005). In addition, 3-day time resolved PM was assessed using a data-logging nephalometer. Ambient PM air pollution was assessed using a monitoring site centrally located to the study area.
Results from these studies demonstrate the importance of assessing indoor air. Children, particularly young children, spend the great majority of their time in the home. Others have noted (
Wallace et al. 2004) that indoor PM concentrations are generally higher than outdoor levels, and cigarette smoking as well as other household activities are responsible for this increase. In some cases, the PM contribution from cigarette smoking to indoor PM is greater than that penetrating from outdoor air. The JHU Center results indicate, for example, that a single cigarette contributes between 1 and 2 μg/m
3 to indoor PM. In addition, a strategy that uses repeat measures allows larger time frame variability to be assessed (e.g., seasonal).
Results from the Michigan Center for the Environment and Children’s Health demonstrate the importance of focusing on the home as an important microenvironment for children’s exposure (
Keeler et al. 2002;
Yip et al. 2004). An important lesson from these studies is that home-based exposure assessments are feasible for studies involving hundreds of children and need to be considered in the National Children’s Study. This conclusion is particularly true for newborn children who spend essentially all of their time in the home. The microenvironments of importance include the indoor environment in a range of housing types, because there is a growing recognition that housing quality is an important predictor of indoor air pollution and can affect outdoor pollution penetration rates as well as being a general risk factor for poor health (
Kingsley 2003).
As described above, the USC CHS experience suggests that exposure assignment accuracy can be improved by conducting substudies with a limited number of measurements extended temporally and spatially. In evaluating the minimal sampling needed to successfully predict long-term exposures in study communities, USC CHS investigators found that the intraclass correlation between estimated annual average of pollutants, based on 2-week subset measurements, and the true annual average was greater than 0.9 for O3, NO2, and nitric oxide in southern California, if two winter, two summer, and one spring sample were obtained. Greater numbers of samples did not appreciably improve the correlation. These results indicate that accurate estimates of the pollutant annual average levels can be obtained at homes, schools, and other central site locations with a limited number of samples. Local measurements can then be combined with concurrent central site measurements to estimate neighborhood and household scale concentrations for the entire cohort. Although the optimum number of samples may differ by region of the country or in different neighborhoods within communities, depending on the pollutants of interest and geographic and temporal variation in the processes driving air pollution, this general strategy may be of use in planning efficient National Children’s Study substudies.
Analytic and interpretation issues. Understanding issues of spatial/temporal correlations of air pollutants, the surrogacy of specific pollutants for components of the complex mixture, and the exposure misclassification inherent in exposure estimates will be critical in analyzing and interpreting National Children’s Study findings.
Issues to consider include the fact that air pollutants occur as complex mixtures of gases and particles, but coexisting constituents may covary, based on their common sources or photochemical pathways. The ambient level of one pollutant may therefore be a surrogate for other pollutants arising from the same source, so interpretation of findings for individual pollutants must account for this surrogacy (
Manchester-Neesvig et al. 2003;
Sarnat et al. 2001). Identification of pollutant sources therefore provides a potentially important mechanism to evaluate source-specific health effects and can ultimately lead to effective strategies for reducing population exposure.
Substudies among subjects in differing geographic locations may be useful for defining pollutant relationships. For example, in assessing PM, chemical tracers have been identified that can serve as “fingerprints” for individual sources, or source types, of air pollution (
Laden et al. 2000;
Manchester-Neesvig et al. 2003;
Sarnat et al. 2002). This type of information can be used to apportion contributions to the measured PM mass on a per sample basis, along with providing data critical to the assessment and interpretation of health effects associated with individual chemical components of PM. Quantitative assessments of source contributions for large data sets are often determined using a statistical receptor modeling approach. This type of data analysis is best suited to longitudinal study designs and can be limiting because it may require collection of a large number of samples to obtain robust results.
The recent successful development and deployment of several types of continuous portable PM mass and number monitors offer the potential for producing real-time (< 5-min interval) data. The continuous data collection format of these samplers allows a better understanding of source emission patterns and exposures, especially in urban environments, and can be used to enhance investigations of short-term peak exposures. These highly time-resolved exposure data can be coupled with personal time–activity pattern data to quantitatively identify exposures from specific emission sources. To date, real-time PM samplers do not yet offer the ability to determine PM chemical speciation. A combination of methodologic approaches (employing chemical tracers and continuous PM number and mass count information) may improve the ability to identify specific sources and source types contributing to the measured exposure to PM and other pollutants.
Exposure misclassification is a critical issue for exposure assessment efforts, especially modeled exposures. In most large cohort studies, it is not possible to accurately measure the true personal exposure of individuals over the time interval that is most relevant for the outcomes of interest. Thus, virtually all exposure assessments provide at best estimates of true exposures, with some error. Errors may arise because of temporal factors (e.g., the exposure metric captures only a snapshot of the relevant time interval) or spatial factors (e.g., the exposure metric is collected at a location different from where the subject lives and breathes). Additionally, inherent imprecision in the specific method selected for study application may also result in some measurement error. For the results of the study to ultimately be interpretable, it is important in designing the study for investigators to analyze the nature of the exposure misclassification errors that are likely to be present. Quantitative estimates of exposure errors can be obtained by carrying out calibration substudies where results from more complete exposure metrics are compared with results from the modeled metrics (
Berhane et al. 2004;
Fraser and Stram 2001;
Mallick et al. 2002;
Sarnat et al. 2001;
Stram et al. 1995). Bayesian statistical frameworks may assist with assessing the impact of measurement error on the exposure–response relationships (
Berhane et al. 2004).
Modifiers of exposure–outcome relationships. “Usual” temporal, spatial, and physical patterns of activity can be used as modifiers of the exposure–outcome relationships. Highly time-resolved activity information over the study period of interest may not be necessary, and is not likely to be available, for all National Children’s Study participants throughout the study. Personal exposure estimates, based on time in microenvironments, are likely to be associated with large uncertainties. “Usual” patterns of activity, such as time usually spent outdoors, can be collected by questionnaire and used as modifiers of exposure–outcome relationships (
Gauderman et al. 2002). Activity-level assignments may be important in moving from exposure to delivered dose of an airborne pollutant to the lung. For example, for asthma prevalence and incidence, USC CHS investigators saw little association with community levels of exposure. However, when physical activity was considered, O
3 was strongly associated with asthma incidence (where variation entered from increased ventilation rates associated with exercise and likely increased dose to the lung). An important challenge for the National Children’s Study is assessing activity patterns among mothers, infants, and young children.
For extremely large study populations for which individual questionnaires may be impractical to administer and/or collect, randomized sampling schemes or oversampling in certain nested subsamples of possible increased interest may be worth careful consideration.
Use of biomarkers. Biomarkers of exposure offer utility for evaluation of specific exposures that have multiple routes of exposure. For specific airborne pollutants, exposure assessments may need to consider multiple routes of human exposure. In addition to inhalation, dermal absorption and oral ingestion may be important pathways of exposure for pollutants of interest with regard to young children, infants, and pregnant or lactating mothers. The use of exposure biomarkers is one potentially valuable approach in this area (
Weaver et al. 1998). Interpreting the relationship between these markers and exposures, however, is a complex function of the timing and routes of exposure, and of the pollutant toxicokinetics. As discussed above, temporal–spatial–physical patterns of activity will almost surely affect this dynamic in important ways, from modification of ventilation rates to facilitated dermal absorption during periods of elevated, increased, or extended activities. As exposure assessment tools, biomarkers offer the potential advantage of integrating the net effect of all of these factors in producing a given internal dose for a given individual. Such measurements may better represent true health-relevant exposures for an individual than any external measure of exposure can.
Biomarker measurements are substantially integrated into the exposure and health assessment designs of the Columbia PCS. From an exposure perspective, biomarkers focus on DNA-bound PAHs (
Perera et al. 2004a,
2004b), pesticides in blood plasma and meconium (
Perera et al. 2003;
Whyatt et al. 2001,
2003,
2004), and the environmental tobacco smoke (ETS) metabolite cotinine in urine (
Perera et al. 2004b), beginning with maternal and infant cord blood samples at birth, and continuing with follow-up assessments in the child at 2 and 5 years of age. PAH-DNA adducts also can be viewed as early measures of procarcinogenic health effects (
Perera et al. 2004b). Other effect-related biomarkers focus on the time course of sensitization to environmental allergens, including measurements of maternal, cord-blood, and child IgE, and production of proinflammatory cytokines or proliferation of mononuclear cells in response to specific allergens (
Miller et al. 2001).
The integration of newly developed pesticide biomarkers within the epidemiologic design of the Columbia PCS has made possible significant new advances in our understanding of the health effects and patterns of exposures to pesticides among urban women and children (
Perera et al. 2003;
Whyatt et al. 2001,
2003,
2004). A wide range of pesticides have been shown to be quantifiable in the plasma of women and their newborns, with significant correlations between maternal and cord blood levels in many cases (
Whyatt et al. 2003). For some but not all pesticides, correlations also were demonstrated between plasma levels at birth (either cord blood or maternal) and air measurements collected during the third trimester of pregnancy. Cord plasma, but not air, levels of the insecticide chlorpyrifos and diazinon were significantly associated with decreased birth weight and length (
Whyatt et al. 2004). Of particular significance, levels of several pesticides in both air and plasma showed significant declines across women enrolled before and after the U.S. EPA insecticide phase-out (
Whyatt et al. 2003). Furthermore, associations with adverse birth outcomes were significant only for infants born before the phase-out (
Whyatt et al. 2004). These findings illustrate the utility of well-targeted biomarker measurements, in conjunction with health and external exposure measures, for birth cohort studies.
Cotinine and nicotine as markers for ETS, an important source of PM exposure, has a long history of use in biomonitoring. Hair nicotine has the potential to provide estimates of ETS exposure over a 2–3 month period or longer (
Jaakkola and Jaakkola 1997), and other nicotine metabolites (e.g. cotinine) may be useful indicators of both exposure and bioavailability.