Accurate coding of temporal information has direct behavioral relevance for the computation of sound source location. Birds and mammals show exquisite sensitivity to interaural time differences (ITDs): when sound comes from one side of the body, it reaches one ear before the other. The brain uses these ITDs to compute sound location in the horizontal (azimuthal) plane (
Konishi 2003;
Yin 2002).
There is general agreement that the basic sensitivity for ITD and binaural correlation arises through a cross-correlation like comparison of inputs to the two ears (
Batra and Yin 2004;
Joris and Yin 2007;
Yin et al. 1987). The cross correlator neurons act as coincidence detectors (reviews in
Grothe 2003;
Konishi 2003;
Yin 2002). The coincidence detection is performed separately and in parallel in many narrowly tuned frequency channels. The sound waveform is encoded by phase-locked neural discharges in the auditory nerve, i.e. by a precise correlation between the phase of the stimulus and the firing of spikes. Coincidence detection between such inputs from each ear gives rise to a discharge pattern that varies cyclically as a function of interaural phase difference, showing a maximum when both inputs are in phase and a minimum when they are 180° out of phase. Thus, sensitivity to interaural phase differences (IPDs) is created. IPD is a relative measure of time and, knowing the stimulus period, can be translated into absolute ITD. In fact, within each narrowly tuned frequency channel, IPD and ITD are interchangeable. ITD is the physical cue to the azimuthal position of a sound source. A current controversy centers on the question of how the coding of a range of ITDs enables the nervous system to precisely localize sound sources along the azimuthal plane.
In principle, an array of coincidence detectors could be set up, situated along interdigitating or counter-current delay line inputs from each ear. In such a circuit, the delay lines introduce successively greater input delays to the coincidence detectors they contact serially. In consequence, each individual coincidence detector fires maximally at the phase difference between its inputs that exactly compensates for the conduction delay introduced at its place. Such a circuit, generating a place map of interaural phase difference at each frequency is well known as the place-code model or Jeffress model, after
Jeffress (1948). However, the task of ITD coding is affected by both head size and the ability to phase lock. The sharpness of ITD selectivity of the individual coincidence detectors increases for neurons with higher characteristic frequency because their temporal precision is greater. For example, the spikes of an auditory neuron phase-locking to a 5kHz stimulus (with a period of 200 μs) show a temporal dispersion of about ±40 μs around the preferred phase; for a neuron phase-locking to 1 kHz (with a period of 1 ms) the temporal dispersion is typically ±100 μs (
Köppl 1997). In coincidence detector neurons using such inputs, this results in correspondingly steeper slopes for the 5 kHz and shallower slopes for the 1 kHz ITD selectivity curves (
Batra and Yin 2004). Animals with smaller heads that naturally experience a smaller ITD range therefore have less precise information available at equivalent frequencies than animals with larger heads.
Animals with smaller heads also do not have the option of simply using higher frequencies. As the above example illustrates, phase-locking is a process that demands increasing temporal precision in spike generation with increasing frequency. Due to the biophysical limitations of the cell membranes involved, phase-locking faces a clear upper frequency limit. For the auditory neurons providing the input to the coincidence detector circuits discussed here, this upper limit varies between 3 and 10kHz in different species (review in
Köppl 1997).
The basic problem of the interaction between head size and the frequency range available for creating the neural code of ITD was formalized in a model of IPD representation (
Harper and McAlpine 2004). Assuming that the ITDs an animal naturally encounters should be coded with maximal accuracy,
Harper and McAlpine (2004) argued that the neural representation of IPD within the population of the first binaural coincidence detectors should conform to either one of two distinct strategies, depending on head size and frequency range. One is a homogeneous distribution of the maxima of their selectivity curves (hereafter called best IPD), collectively covering the physiological ITD range of the animal within each frequency band. Although the model does not address the question as to how the distribution is achieved, such a distribution is consistent with the Jeffress model and an orderly representation of best IPDs along input delay lines. The second strategy of ITD coding is characterized by a non-homogeneous distribution of best IPD, with distinct subpopulations of neurons within each frequency band. The best IPDs of each population fall within a narrow range and often outside the physiological range of the animal. Instead of the maxima, the slopes of the IPD-selectivity curves cover the physiological range, and each slope covers most of this range. Various terms and variations have been suggested for this broad category of models in the past, summarized as Left–Right Count-Comparison models by
Colburn and Kulkarni (2005). Here, the term two-channel model will be used, emphasizing the fact that all the coincidence detectors of each brainstem hemisphere together are believed to comprise one channel (or population). The relative excitation in the two channels from the two hemispheres is assumed to be read out as a correlate of ITD and thus as azimuthal sound source location (review in
Palmer 2004).
Experimental evidence for both types of models of ITD coding exists. As has been reviewed by many authors (e.g.
Konishi 2003), all of the characteristics of the Jeffress model appear fulfilled in the relevant brainstem nucleus (Nucleus laminaris, NL) of the barn owl, at least within the frequency range that has been extensively studied (above 3 kHz;
Carr and Konishi 1990;
Pena et al. 1996). Experimental data from the equivalent brainstem nucleus (medial superior olive, MSO) in the gerbil provide the clearest support for the two-channel model (
Brand et al. 2002). In addition, a likely neural mechanism has been revealed in the gerbil for creating the unique distribution of best IPDs. It relies on additional phase-locked inhibitory inputs to the coincidence detector (MSO) neurons and does not require input delay lines (reviewed in
Grothe 2003). However, data from different mammalian species are often ambiguous and their interpretation in support for the Jeffress model on the one hand or the two-channel model on the other is intensely controversial (recent summaries in
Joris and Yin 2007;
McAlpine 2005;
Palmer 2004).
A virtue of the optimal coding scheme suggested by
Harper and McAlpine (2004) is that it makes clear predictions for specific examples of head sizes and frequencies about which coding strategy should be optimal and thus allows for experimental testing. As a general rule, a Jeffress-like code and homogeneous representation of best IPDs is optimal at frequencies high enough so that the head’s ITD range exceeds ±0.5 cycles, while one or two channels with discrete populations of best IPD are optimal at frequencies below that. The barn owl and the gerbil were put forward as examples where experimental data clearly fit those predictions, however, this has recently been challenged for the low-frequency range of the owl (
Wagner et al. 2007).
The key prediction of Jeffress’ model, a topographic map of best ITD in the MSO or NL, has not been experimentally addressed recently. In 1990, Carr and Konishi used physiological and anatomical techniques to show that axonal delay lines form maps of ITD in the NL of the barn owl. In the cat, two studies provided anatomical evidence for axonal delay lines in the contralateral afferents (
Beckius et al. 1999;
Smith et al. 1993), while
Yin and Chan (1990) showed a correlation between best delay and rostrocaudal position in the MSO. However, the owl has been challenged as a highly specialized and potentially untypical case (e.g.
McAlpine 2005) and the evidence in the cat is not conclusive (
Joris and Yin 2007).
We have therefore examined this key prediction in the chicken, an unspecialized bird with a small range of physiological ITDs (
Hyson et al. 1994) and a relatively low range of frequencies of phase-locking (
Salvi et al. 1992), both similar to the values in the gerbil.
Harper and McAlpine’s (2004) optimal coding scheme predicts ITD coding in discrete channels for frequencies up to 3 kHz, i.e., up to the limit of phase-locking. However, anatomical studies show that the chicken Nucleus magnocellularis (NM) projects in a delay-line pattern to NL (
Parks and Rubel 1975;
Young and Rubel 1983) and appropriate conduction delays have been measured in brain-slice preparations of this circuit (
Overholt et al. 1992). This suggests a map-like representation of a range of IPDs, inconsistent with the prediction of a uniform population of neurons on each side of the brainstem. However, it is unknown whether those delay lines determine the responses of NL neurons in the mature chicken in vivo and if so, what range of IPDs they cover. We have carried out in vivo recordings of NL activity, combined with histological verification of recording sites. We show that the NL contains a systematic, gradual representation of the animal’s ITD range. This and a host of monaural and binaural response properties investigated are entirely consistent with the Jeffress model.