Although many neuronal targets for treating the underlying disorder in FXS have emerged, and early translational work has begun, there are still many uncertainties about how to best demonstrate the treatment effects in clinical trials. FXS serves as a good model to develop such designs for other developmental disorders, particularly because FXS is a single genetic disorder in which affected individuals all have the same basic cellular defect, a mouse model is available, some information on synaptic function of FMRP in brain is known, and aspects of FXS model more common disorders with likely mechanistic overlap, including ASDs, ADHD, and learning disabilities.
Some lessons that have been learned from early clinical trials completed to date in FXS subjects are as follows. It is very difficult to identify good cognitive outcome measures for drug effect that can be performed by FXS subjects crossing the entire range of function because of problems with basal scores and ceilings (
Berry-Kravis et al, 2006,
2009), and more work to validate appropriate measures is needed. Predicting the outcome that will improve most in a complex disorder during initial trials is essentially guesswork, when the treatment targets the underlying disorder and not a specific behavior. Therefore, it is important to run small exploratory trials initially to obtain information from which to choose the primary outcome in subsequent larger trials, and set inclusion criteria to select for subjects with problems in the area targeted by the drug. The factor structure of existing key rating scales such as the ABC (Maltas
et al, 2011, personal communication) may not be appropriate for FXS if the measure was developed in a non-FXS cohort. This may prevent detection of improvement in a behavioral area because of dilution of scores with items that are not contributing to that behavior. The placebo effect in FXS trials is large (
Berry-Kravis et al, 2011), most likely because many families are aware of current research. Parents who fill out the rating scales meant to detect medication effect are hoping for and expecting improvement from targeted treatments. Thus, findings from short open-label trials may not be reproduced in placebo-controlled trials. A placebo lead-in (period of placebo treatment for all subjects before randomization to active drug or placebo) is a strategy for FXS trials to limit the placebo effect and allow the treatment effect to be more evident. Recruitment is very difficult and prolonged when concomitant medications are not allowed (
Erickson et al, 2011a), and thus reasonable concomitant medications that are not predicted to have a major interaction with the agent being studied, will need to be allowed in FXS trials. In fact, this will make a prediction possible about whether the agent being tested can add incrementally to currently existing therapy. Finally, it appears likely that subgroups of patients will respond better to different targeted agents, even in a uniform disorder such as FXS. There may be biomarkers or markers of clinical severity that define a treatment response just in a subset of patients, as in the socially impaired group in the R-baclofen study (
Wang et al, 2010b) and the fully methylated group in the AFQ056 study (
Jacquemont et al, 2011).
The design and evaluation of outcome measures for trials of targeted treatments in FXS and neurodevelopmental disorders has proven to be a major hurdle. Outcome measures chosen need to test a broad ability range to prevent low or high-functioning individuals from showing ceiling or floor effects, overcome problems with co-operation and variable performance, be shown to be reproducible, and quantify core defects and correlate with quality of life and true functional improvement. Only a subset of outcome measures utilized in recent trials have turned out to fulfill the majority of these criteria (
Berry-Kravis et al, 2006,
2008a,
2008b,
2009) suggesting better, FXS-specific measures are needed. Only recently have investigators begun to develop templates to test the feasibility, reproducibility, and validity assessment before using the measure in a trial (
Berry-Kravis et al, 2008b;
Hessl et al, 2009;
Knox and Berry-Kravis, 2009;
Farzin et al, 2011). Choice of outcome measures must also balance use of accepted behavioral measures, which are generally caregiver rating scales (such as the ABC), with precedent for use in drug registration/FDA approval
versus use of novel measures (
Hessl et al, 2009;
Knox and Berry-Kravis, 2009;
Farzin et al, 2011) that are more quantitative and may objectively measure core phenotypes and electrophysiology (such as eye tracking or PPI). These novel quantitative measures advance treatment science, but have no precedent for registration, do not clearly predict a specific functional outcome, and are often expensive and difficult to run at multiple sites.