Our results demonstrate that tDCS at 2 mA is not associated with effective blinding when compared with the commonly used sham using this electrode montage and stimulation procedure. For a proportion of tDCS naïve participants, blinding is maintained, but the probability of a participant correctly identifying the stimulation condition is greater than would be expected by chance. Given the high agreement in the second session, the threat to participant blinding appears substantially worse for crossover trials. Participants were more confident in their judgement where they judged that they were receiving active stimulation after the first session though this difference diminished by the second stimulation session.
It is highly likely that the sensory effects of active stimulation were responsible for compromising participant blinding. Familiarity with the experience of stimulation and the ability to compare between sessions amplified this issue after the second stimulation session. Reports of persistent itch or tingling during stimulation in response to the adverse events question are suggestive of this. Most participants probably would not consider these sensations to be adverse effects and so only a minority reported them. Assessor blinding was also compromised in a substantial proportion (60%) of active stimulation in both sessions and this represents an important potential source of bias, regardless of study design, in studies where outcomes are assessed in the immediate post stimulation period.
The current finding has substantial implications for much of the existing literature relating to tDCS. For example, 2 mA intensity and similar electrode montages have been used in almost all trials of tDCS for chronic pain
[11], the majority of sham controlled tDCS trials for depression,
[6],
[13],
[14] and all trials of tDCS for reducing cravings
[16],
[18]–
[20]. All of these studies have reported superior efficacy of active stimulation over sham and while some
[13],
[14] report adequate participant blinding, the issue of assessor blinding was not assessed. While we cannot predict the degree of influence that inadequate blinding may have had in these studies, non-specific effects of interventions are known to be important in such clinical conditions
[21],
[22]. Further, there is evidence that incomplete blinding leads to exaggerated effects in clinical studies with subjective outcomes
[23], and that placebo effects are larger with physical placebo interventions
[22]. Thus, we contend that clinical studies that have used 2 mA tDCS should be interpreted with renewed caution. This point is emphasised by the recognised phenomenon that trials of new clinical interventions are often associated with small study effects and a publication bias that influence the evidence base, with a propensity for negative studies to not reach full publication
[24],
[25].
How might blinding of tDCS at this intensity be improved? Assessor blinding might be ensured by having the participant wear headgear that conceals the area under the electrodes. It is possible that longer ramping times may improve participant blinding but this may not be sufficient where participants are aware of sensations throughout the stimulation period. McFadden et al.
[26] demonstrated that the pre-application of topical anaesthetics to electrode sites substantially reduced (but did not abolish) the sensations associated with stimulation, although the same process would be difficult at more posterior locations in participants with hair. Any modified sham protocol will require rigorous testing to ensure adequate blinding.
An alternative approach may be to reduce stimulation intensity. Indeed, it is not clear that higher stimulation intensities are necessary in clinical studies
[11]. Effects on cortical excitability have been clearly demonstrated at intensities of 1 mA
[1] and there is evidence to suggest that successful participant blinding is achievable under these conditions
[9],
[10]. Using intensities of 1 mA in future research may represent a more methodologically sound option, although it is plausible that reducing the intensity may reduce potential efficacy. Future studies of tDCS may benefit from other methods to optimise blinding, for example de facto masking
[27], in which the treatment is not blinded but both treatments are presented as the active one. De facto masking might be more problematic if a non-stimulation sham is used that carries less credibility with participants but would seem very possible if the “sham” condition is active tDCS over a distinct brain area that is not hypothesized to elicit specific treatment effects.
That we found inadequate blinding using a therapy widely held as blindable
[8] raises the possibility that clinical trials of other therapies are vulnerable to similar problems. One obvious example is in trials of TENS, in which the sham condition often involves a deactivated TENS unit and as such there will be distinct differences in the experience of stimulation. It is important to also acknowledge that inadequate blinding is not the only threat to the validity of clinical trials and continued attention should be paid in the design of trials to ensuring rigour in the selection and allocation process of future trials
[28].
Our study has some limitations. We did not investigate the perceptual correlates of stimulation in any detail. We took this decision so as to minimise the risk that participants would over-scrutinize the experience of tDCS, which we felt would not accurately reflect the conditions of the average clinical trial. As such we cannot tell with confidence which factors most impacted on blinding. The VAS scale that we used to measure participants’ confidence in their judgements has not been specifically validated for that task and may have lacked sensitivity and validity, although this would not confound our results so much as reduce our power to detect non-blinding. The predominance of female participants might plausibly have affected our results. There is some evidence that differences exist between males and females, in pain threshold and pain evoked by a standard noxious stimulus, but the nature of the difference depends upon the type of stimulus and the context in which it is tested (see
[29]for a review). There is also some debate as to whether pain thresholds vary in females according to stage of their menstrual cycle
[30]–
[32]. However, randomisation of the order of stimulation should mitigate any potential impact of these issues on our data. Finally the persistence of noticeable skin redness that persists beyond the immediate post-stimulation period represents a further risk to participant blinding and suggests that our results may underestimate the scale of the problem.
In conclusion, contrary to the assumption of blinding, which underpins the growing support of tDCS for clinical conditions, our data show that both participant and assessor blinding is compromised at 2 mA intensity when using this electrode montage and stimulation procedure. The findings have important implications for the interpretation of studies which have utilised this approach and for the design of future tDCS studies.