Related Articles
Interaural time differences (ITDs) can be used to localize sounds in the horizontal plane. ITDs can be extracted from either the fine structure of low-frequency sounds or from the envelopes of high-frequency sounds. Studies of the latter have included stimuli with periodic envelopes like amplitude-modulated tones or transposed stimuli, and high-pass filtered Gaussian noises. Here, four experiments are presented investigating the perceptual relevance of ITD cues in synthetic and recorded “rustling” sounds. Both share the broad long-term power spectrum with Gaussian noise but provide more pronounced envelope fluctuations than Gaussian noise, quantified by an increased waveform fourth moment, W. The current data show that the JNDs in ITD for band-pass rustling sounds tended to improve with increasing W and with increasing bandwidth when the sounds were band limited. In contrast, no influence of W on JND was observed for broadband sounds, apparently because of listeners' sensitivity to ITD in low-frequency fine structure, present in the broadband sounds. Second, it is shown that for high-frequency rustling sounds ITD JNDs can be as low as 30 μs. The third result was that the amount of dominance for ITD extraction of low frequencies decreases systematically with increasing amount of envelope fluctuations. Finally, it is shown that despite the exceptionally good envelope ITD sensitivity evident with high-frequency rustling sounds, minimum audible angles of both synthetic and recorded high-frequency rustling sounds in virtual acoustic space are still best when the angular information is mediated by interaural level differences.
doi:10.1007/s10162-011-0303-2
PMCID: PMC3254714
PMID: 22124890
binaural hearing; envelope; roughness; duplex theory; dominance region
Bilateral cochlear implantation seeks to improve hearing by taking advantage of the binaural processing of the central auditory system. Cochlear implants typically encode sound in each spectral channel by amplitude modulating (AM) a fixed-rate pulse train, thus interaural time differences (ITD) are only delivered in the envelope. We investigated the ITD sensitivity of inferior colliculus (IC) neurons with sinusoidally AM pulse trains. ITD was introduced independently to the AM and/or carrier pulses to measure the relative efficacy of envelope and fine structure for delivering ITD information. We found that many IC cells are sensitive to ITD in both the envelope (ITDenv) and fine structure (ITDfs) for appropriate modulation frequencies and carrier rates. ITDenv sensitivity was generally similar to that seen in normal-hearing animals with AM tones. ITDenv tuning generally improved with increasing modulation frequency up to the maximum modulation frequency that elicited a sustained response in a neuron (tested ≤Hz). ITDfs sensitivity was present in about half the neurons for 1,000 pulse/s (pps) carriers and was nonexistent at 5,000 pps. The neurons that were sensitive to ITDfs at 1,000 pps were those that showed the best ITD sensitivity to low-rate pulse trains. Overall, the best ITD sensitivity was found for ITD contained in the fine structure of a moderate rate AM pulse train (1,000 pps). These results suggest that the interaural timing of current pulses should be accurately controlled in a bilateral cochlear implant processing strategy that provides salient ITD cues.
doi:10.1152/jn.00751.2007
PMCID: PMC2570106
PMID: 18287556
Background
When a second sound follows a long first sound, its location appears to be perceived away from the first one (the localization/lateralization aftereffect). This aftereffect has often been considered to reflect an efficient neural coding of sound locations in the auditory system. To understand determinants of the localization aftereffect, the current study examined whether it is induced by an interaural temporal difference (ITD) in the amplitude envelope of high frequency transposed tones (over 2 kHz), which is known to function as a sound localization cue.
Methodology/Principal Findings
In Experiment 1, participants were required to adjust the position of a pointer to the perceived location of test stimuli before and after adaptation. Test and adapter stimuli were amplitude modulated (AM) sounds presented at high frequencies and their positional differences were manipulated solely by the envelope ITD. Results showed that the adapter's ITD systematically affected the perceived position of test sounds to the directions expected from the localization/lateralization aftereffect when the adapter was presented at ±600 µs ITD; a corresponding significant effect was not observed for a 0 µs ITD adapter. In Experiment 2, the observed adapter effect was confirmed using a forced-choice task. It was also found that adaptation to the AM sounds at high frequencies did not significantly change the perceived position of pure-tone test stimuli in the low frequency region (128 and 256 Hz).
Conclusions/Significance
The findings in the current study indicate that ITD in the envelope at high frequencies induces the localization aftereffect. This suggests that ITD in the high frequency region is involved in adaptive plasticity of auditory localization processing.
doi:10.1371/journal.pone.0041328
PMCID: PMC3407190
PMID: 22848464
In order to examine the effect of inhibition on processing auditory temporal information, responses of single neurons in the inferior colliculus of the chinchilla to sinusoidally amplitude-modulated (SAM) tones alone and the presence of a steady-state tone were obtained. The carrier frequency of the SAM tone was either the characteristic frequency (CF) or a frequency in the inhibitory response area of a studied neuron. When the carrier frequency was set to the neuron’s CF, neurons responded in synchrony to the SAM-tone envelope, as expected. When the carrier frequency was set to a frequency at which pure tones produced inhibition, SAM tones elicited little or no response, also as expected. However, when the same SAM tone was paired with a pure tone whose frequency was set to the neuron’s CF, responses synchronized to the SAM tone envelope were obtained. These modulated responses were typically one-half cycle out-of-phase with the response to the SAM tone at CF, suggesting that they arose from cyclic inhibition and release from inhibition by the SAM tone. The results demonstrate that the representation of temporal information by inferior colliculus neurons is influenced by temporally-patterned inhibition arising from locations remote from CF.
doi:10.1016/j.heares.2006.07.012
PMCID: PMC1592138
PMID: 16945495
amplitude modulation; inhibition; inferior colliculus; physiology; chinchilla; CAP compound action potential; CF characteristic frequency; CMR comodulation masking release; dB SPL deciBels sound pressure level; DNLL dorsal nucleus of the lateral lemniscus; Fc carrier frequency; Fm modulation frequency; GABA gamma aminobutyric acid; Hz Hertz; IC inferior colliculus; kHz kilohertz; MTF modulation transfer function; PST peri-stimulus time; RA response area; RLF rate-level function; SAM sinusoidally amplitude modulated; VS vector strength
Dreyer and Oxenham (2008) reported that spectrally-flanking noise increased threshold-ITDs conveyed by high-frequency transposed tones but rendered them indiscriminable when they were conveyed by high-frequency sinusoidally amplitude-modulated (SAM) tones [Dreyer and Oxenham, J. Acoust. Soc. Am. 123, EL1-EL7 (2008)]. This study extends those observations and evaluates the role of “off-frequency listening.” Threshold-ITDs were measured using 4 kHz-centered transposed or SAM tonal “targets.” In “baseline” conditions, targets were presented without spectrally-flanking noise. Additionally, targets were presented along with continuous diotic broadband Gaussian noise spectrally “notched” between 3.6 and 4.4 kHz. In another condition, only the high-pass segment of the notched noise was continuously present. In the final condition, only the low-pass segment was continuously present. Results indicate: 1) relative to baseline, adding notched noise resulted in similar relative increases of threshold-ITDs for both SAM and transposed targets; 2) the presence of the high-pass segment of the notched noise resulted in greater relative increases in threshold-ITDs over those obtained in baseline conditions for SAM tones as compared to transposed tones; 3) comparisons among all of the data were consistent with the interpretation that both on-frequency and off-frequency processing of envelope-based ITDs can be disrupted by the presence of a notched noise.
doi:10.1121/1.2980523
PMCID: PMC2647747
PMID: 19045794
Interaural time difference (ITD) is a cue to the location of sounds containing low frequencies and is represented in the inferior colliculus (IC) by cells that respond maximally at a particular best delay (BD). Previous studies have demonstrated that single ITD-sensitive cells contain sufficient information in their discharge patterns to account for ITD acuity on the midline (ITD = 0). If ITD discrimination were based on the activity of the most sensitive cell available (“lower envelope hypothesis”), then ITD acuity should be relatively constant as a function of ITD. In response to broadband noise, however, the ITD acuity of human listeners degrades as ITD increases. To account for these results, we hypothesize that pooling of information across neurons is an essential component of ITD discrimination. This report describes a neural pooling model of ITD discrimination based on the response properties of ITD-sensitive cells in the IC of anesthetized cats.
Rate versus ITD curves were fit with a cross-correlation model of ITD sensitivity, and the parameters were used to constrain a population model of ITD discrimination. The model accurately predicts ITD acuity as a function of ITD for broadband noise stimuli when responses are pooled across best frequency (BF). Furthermore, ITD tuning based solely on a system of internal delays is not sufficient to predict ITD acuity in response to 500 Hz tones, suggesting that acuity is likely refined by additional mechanisms. The physiological data confirms evidence from the guinea pig that BD varies systematically with BF, generalizing the observation across species.
doi:10.1523/JNEUROSCI.0762-04.2004
PMCID: PMC2041891
PMID: 15306644
auditory; binaural; hearing; inferior colliculus; localization; psychophysics
The encoding of sound level is fundamental to auditory signal processing, and the temporal information present in amplitude modulation is crucial to the complex signals used for communication sounds, including human speech. The modulation transfer function, which measures the minimum detectable modulation depth across modulation frequency, has been shown to predict speech intelligibility performance in a range of adverse listening conditions and hearing impairments, and even for users of cochlear implants. We presented sinusoidally amplitude modulated (SAM) tones of varying modulation depths to awake macaque monkeys while measuring the responses of neurons in the auditory core. Using spike train classification methods, we found that thresholds for modulation depth detection and discrimination in the most sensitive units are comparable to psychophysical thresholds when precise temporal discharge patterns rather than average firing rates are considered. Moreover, spike timing information was also superior to average rate information when discriminating static pure tones varying in level but with similar envelopes. The limited utility of average firing rate information in many units also limited the utility of standard measures of sound level tuning, such as the rate level function (RLF), in predicting cortical responses to dynamic signals like SAM. Response modulation typically exceeded that predicted by the slope of the RLF by large factors. The decoupling of the cortical encoding of SAM and static tones indicates that enhancing the representation of acoustic contrast is a cardinal feature of the ascending auditory pathway.
doi:10.1523/JNEUROSCI.4170-09.2010
PMCID: PMC3551278
PMID: 20071542
auditory; cortex; temporal coding; contrast; synchrony; spike trains; speech
The lateral superior olive (LSO) is a brainstem nucleus that is classically understood to encode binaural information in high-frequency sounds. Previous studies have shown that LSO cells are sensitive to envelope interaural time difference in sinusoidally amplitude-modulated (SAM) tones (Joris and Yin, J Neurophysiol 73:1043–1062, 1995; Joris, J Neurophysiol 76:2137–2156, 1996) and that a subpopulation of LSO neurons exhibit low-threshold potassium currents mediated by Kv1 channels (Barnes-Davies et al., Eur J Neurosci 19:325–333, 2004). It has also been shown that in many LSO cells the average response rate to ipsilateral SAM tones decreases with modulation frequency above a few hundred Hertz (Joris and Yin, J Neurophysiol 79:253–269, 1998). This low-pass feature is not directly inherited from the inputs to the LSO since the response rate of these input neurons changes little with increasing modulation frequency. In the current study, an LSO cell model is developed to investigate mechanisms consistent with the responses described above, notably the emergent rate decrease with increasing frequency. The mechanisms explored included the effects of after-hyperpolarization (AHP) channels, the dynamics of low-threshold potassium channels (KLT), and the effects of background inhibition. In the model, AHP channels alone were not sufficient to induce the observed rate decrease at high modulation frequencies. The model also suggests that the background inhibition alone, possibly from the medial nucleus of the trapezoid body, can account for the small rate decrease seen in some LSO neurons, but could not explain the large rate decrease seen in other LSO neurons at high modulation frequencies. In contrast, both the small and large rate decreases were replicated when KLT channels were included in the LSO neuron model. These results support the conclusion that KLT channels may play a major role in the large rate decreases seen in some units and that background inhibition may be a contributing factor, a factor that could be adequate for small decreases.
doi:10.1007/s10162-011-0300-5
PMCID: PMC3298618
PMID: 22160752
LSO; low-threshold potassium channel; auditory brainstem; cochlear nucleus bushy cells; envelope processing
The lateral superior olive (LSO) is a brainstem nucleus that is classically understood to encode binaural information in high-frequency sounds. Previous studies have shown that LSO cells are sensitive to envelope interaural time difference in sinusoidally amplitude-modulated (SAM) tones (Joris and Yin, J Neurophysiol 73:1043–1062, 1995; Joris, J Neurophysiol 76:2137–2156, 1996) and that a subpopulation of LSO neurons exhibit low-threshold potassium currents mediated by Kv1 channels (Barnes-Davies et al., Eur J Neurosci 19:325–333, 2004). It has also been shown that in many LSO cells the average response rate to ipsilateral SAM tones decreases with modulation frequency above a few hundred Hertz (Joris and Yin, J Neurophysiol 79:253–269, 1998). This low-pass feature is not directly inherited from the inputs to the LSO since the response rate of these input neurons changes little with increasing modulation frequency. In the current study, an LSO cell model is developed to investigate mechanisms consistent with the responses described above, notably the emergent rate decrease with increasing frequency. The mechanisms explored included the effects of after-hyperpolarization (AHP) channels, the dynamics of low-threshold potassium channels (KLT), and the effects of background inhibition. In the model, AHP channels alone were not sufficient to induce the observed rate decrease at high modulation frequencies. The model also suggests that the background inhibition alone, possibly from the medial nucleus of the trapezoid body, can account for the small rate decrease seen in some LSO neurons, but could not explain the large rate decrease seen in other LSO neurons at high modulation frequencies. In contrast, both the small and large rate decreases were replicated when KLT channels were included in the LSO neuron model. These results support the conclusion that KLT channels may play a major role in the large rate decreases seen in some units and that background inhibition may be a contributing factor, a factor that could be adequate for small decreases.
doi:10.1007/s10162-011-0300-5
PMCID: PMC3298618
PMID: 22160752
LSO; low-threshold potassium channel; auditory brainstem; cochlear nucleus bushy cells; envelope processing
The directionality of hair cell stimulation combined with the vibration of the basilar membrane causes the auditory nerve fiber action potentials, in response to low-frequency stimuli, to occur at a particular phase of the stimulus waveform. Because direct mechanical measurements at the cochlear apex are difficult, such phase locking has often been used to indirectly infer the basilar membrane motion. Here, we confirm and extend earlier data from mammals using sine wave stimulation over a wide range of sound levels (up to 90 dB sound pressure level). We recorded phase-locked responses to pure tones over a wide range of frequencies and sound levels of a large population of auditory nerve fibers in the anesthetized guinea pig. The results indicate that, for a constant frequency of stimulation, the phase lag decreases with increases in the characteristic frequency (CF) of the nerve fiber. The phase lag decreases up to a CF above the stimulation frequency, beyond which it decreases at a much slower rate. Such phase changes are consistent with known basal cochlear mechanics. Measurements from individual fibers showed smaller but systematic variations in phase with sound level, confirming previous reports. We found a “null” stimulation frequency at which little variation in phase occurred with sound level. This null frequency was often not at the CF. At stimulation frequencies below the null, there was a progressive lag with sound level and a progressive lead for stimulation frequencies above the null. This was maximally 0.2 cycles.
doi:10.1007/s10162-008-0151-x
PMCID: PMC2674197
PMID: 19093151
auditory nerve; phase response; basilar membrane; guinea pig
The brainstem auditory pathway is obligatory for all aural information. Brainstem auditory neurons must encode the level and timing of sounds, as well as their time-dependent spectral properties, the fine structure and envelope, which are essential for sound discrimination. This study focused on envelope coding in the two cochlear nuclei of the barn owl, nucleus angularis (NA) and nucleus magnocellularis (NM). NA and NM receive input from bifurcating auditory nerve fibers and initiate processing pathways specialized in encoding interaural time (ITD) and level (ILD) differences, respectively. We found that NA neurons, though unable to accurately encode stimulus phase, lock more strongly to the stimulus envelope than NM units. The spectrotemporal receptive fields (STRFs) of NA neurons exhibit a pre-excitatory suppressive field. Using multilinear regression analysis and computational modeling, we show that this feature of STRFs can account for enhanced across-trial response reliability, by locking spikes to the stimulus envelope. Our findings indicate a dichotomy in envelope coding between the time and intensity processing pathways as early as at the level of the cochlear nuclei. This allows the ILD processing pathway to encode envelope information with greater fidelity than the ITD processing pathway. Furthermore, we demonstrate that the properties of the neurons’ STRFs can be quantitatively related to spike timing reliability.
doi:10.1523/JNEUROSCI.5422-10.2011
PMCID: PMC3059808
PMID: 21368035
Nucleus angularis; STRF; spectrotemporal tuning; cochlear nuclei; barn owl; response reliability
Responses of single neurons in the inferior colliculus of the chinchilla to amplitude-modulated tones were obtained. In one condition, the modulating waveform was a low-frequency sinusoid (SAM tone). In the other, the modulator was a trapezoid with fixed parameters, used to create trains of brief tone bursts presented at various repetition rates (TRAM tone). Modulation frequency (or repetition rate) was varied over the range from 10 to 200 Hz. Many individual neurons exhibited strong selectivity for modulator type. Neurons with pauser discharge patterns to steady-state tones usually exhibited greater responsiveness to SAM tones than to TRAM. In contrast, neurons that responded transiently to steady-state tones usually exhibited greater responsiveness to TRAM tones than to SAM. Neurons with sustained responses to steady-state tones responded strongly to both types of modulated tones. The selectivity for modulator type suggests that transient neurons may play a different functional role in the representation of envelopes than do other types of neurons.
doi:10.1007/s101620020026
PMCID: PMC3202442
PMID: 12486595
Bilateral cochlear implantation is intended to provide the advantages of binaural hearing, including sound localization and better speech recognition in noise. In most modern implants, temporal information is carried by the envelope of pulsatile stimulation, and thresholds to interaural time differences (ITDs) are generally high compared to those obtained in normal hearing observers. One factor thought to influence ITD sensitivity is the overlap of neural populations stimulated on each side. The present study investigated the effects of acoustically stimulating bilaterally mismatched neural populations in two related paradigms: rabbit neural recordings and human psychophysical testing. The neural coding of interaural envelope timing information was measured in recordings from neurons in the inferior colliculus of the unanesthetized rabbit. Binaural beat stimuli with a 1-Hz difference in modulation frequency were presented at the best modulation frequency and intensity as the carrier frequencies at each ear were varied. Some neurons encoded envelope ITDs with carrier frequency mismatches as great as several octaves. The synchronization strength was typically nonmonotonically related to intensity. Psychophysical data showed that human listeners could also make use of binaural envelope cues for carrier mismatches of up to 2–3 octaves. Thus, the physiological and psychophysical data were broadly consistent, and suggest that bilateral cochlear implants should provide information sufficient to detect envelope ITDs even in the face of bilateral mismatch in the neural populations responding to stimulation. However, the strongly nonmonotonic synchronization to envelope ITDs suggests that the limited dynamic range with electrical stimulation may be an important consideration for ITD encoding.
doi:10.1007/s10162-007-0088-5
PMCID: PMC2538436
PMID: 17657543
sound localization; binaural; inferior colliculus; psychophysics
A simple, biophysically specified cell model is used to predict responses of binaurally sensitive neurons to patterns of input spikes that represent stimulation by acoustic and electric waveforms. Specifically, the effects of changes in parameters of input spike trains on model responses to interaural time difference (ITD) were studied for low-frequency periodic stimuli, with or without amplitude modulation. Simulations were limited to purely excitatory, bilaterally driven cell models with basic ionic currents and multiple input fibers. Parameters explored include average firing rate, synchrony index, modulation frequency, and latency dispersion of the input trains as well as the excitatory conductance and time constant of individual synapses in the cell model. Results are compared to physiological recordings from the inferior colliculus (IC) and discussed in terms of ITD-discrimination abilities of listeners with cochlear implants. Several empirically observed aspects of ITD sensitivity were simulated without evoking complex neural processing. Specifically, our results show saturation effects in rate–ITD curves, the absence of sustained responses to high-rate unmodulated pulse trains, the renewal of sensitivity to ITD in high-rate trains when inputs are amplitude-modulated, and interactions between envelope and fine-structure delays for some modulation frequencies.
doi:10.1007/s10162-008-0141-z
PMCID: PMC2644392
PMID: 18941838
binaural hearing; auditory brainstem model; electric hearing; interaural time delay; cochlear implant
The quality of temporal coding of sound waveforms in the monaural afferents that converge on binaural neurons in the brainstem limits the sensitivity to temporal differences at the two ears. The anteroventral cochlear nucleus (AVCN) houses the cells that project to the binaural nuclei, which are known to have enhanced temporal coding of low-frequency sounds relative to auditory nerve (AN) fibers. We applied a coincidence analysis within the framework of detection theory to investigate the extent to which AVCN processing affects interaural time delay (ITD) sensitivity. Using monaural spike trains to a 1-s broadband or narrowband noise token, we emulated the binaural task of ITD discrimination and calculated just noticeable differences (jnds). The ITD jnds derived from AVCN neurons were lower than those derived from AN fibers, showing that the enhanced temporal coding in the AVCN improves binaural sensitivity to ITDs. AVCN processing also increased the dynamic range of ITD sensitivity and changed the shape of the frequency dependence of ITD sensitivity. Bandwidth dependence of ITD jnds from AN as well as AVCN fibers agreed with psychophysical data. These findings demonstrate that monaural preprocessing in the AVCN improves the temporal code in a way that is beneficial for binaural processing and may be crucial in achieving the exquisite sensitivity to ITDs observed in binaural pathways.
doi:10.1007/s10162-011-0268-1
PMCID: PMC3123442
PMID: 21567250
coincidence detection; interaural time difference; discrimination; binaural; sound localization
The quality of temporal coding of sound waveforms in the monaural afferents that converge on binaural neurons in the brainstem limits the sensitivity to temporal differences at the two ears. The anteroventral cochlear nucleus (AVCN) houses the cells that project to the binaural nuclei, which are known to have enhanced temporal coding of low-frequency sounds relative to auditory nerve (AN) fibers. We applied a coincidence analysis within the framework of detection theory to investigate the extent to which AVCN processing affects interaural time delay (ITD) sensitivity. Using monaural spike trains to a 1-s broadband or narrowband noise token, we emulated the binaural task of ITD discrimination and calculated just noticeable differences (jnds). The ITD jnds derived from AVCN neurons were lower than those derived from AN fibers, showing that the enhanced temporal coding in the AVCN improves binaural sensitivity to ITDs. AVCN processing also increased the dynamic range of ITD sensitivity and changed the shape of the frequency dependence of ITD sensitivity. Bandwidth dependence of ITD jnds from AN as well as AVCN fibers agreed with psychophysical data. These findings demonstrate that monaural preprocessing in the AVCN improves the temporal code in a way that is beneficial for binaural processing and may be crucial in achieving the exquisite sensitivity to ITDs observed in binaural pathways.
doi:10.1007/s10162-011-0268-1
PMCID: PMC3123442
PMID: 21567250
coincidence detection; interaural time difference; discrimination; binaural; sound localization
In reverberant environments, acoustic reflections interfere with the direct sound arriving at a listener’s ears, distorting the binaural cues for sound localization. We investigated the effects of reverberation on the directional sensitivity of single neurons in the inferior colliculus (IC) of unanesthetized rabbits. We find that reverberation degrades the directional sensitivity of single neurons, although the amount of degradation depends on the characteristic frequency (CF) and the type of binaural cues available. When interaural time differences (ITD) are the only available directional cue, low-CF cells sensitive to ITD in the waveform fine time structure maintain better directional sensitivity in reverberation than high-CF cells sensitive to ITD in the envelope induced by cochlear filtering. On the other hand, when both ITD and interaural level difference (ILD) cues are available, directional sensitivity in reverberation is comparable throughout the tonotopic axis of the IC. This result suggests that, at high frequencies, ILDs provide better directional information than envelope ITDs, emphasizing the importance of the ILD-processing pathway for sound localization in reverberation.
doi:10.1523/JNEUROSCI.5517-09.2010
PMCID: PMC2896784
PMID: 20534831
directional sensitivity; interaural level difference; inferior colliculus; interaural time difference; reverberation; sound localization
The auditory system can segregate sounds that overlap in time and frequency, if the sounds differ in acoustic properties such as fundamental frequency (f0). However, the neural mechanisms that underlie this ability are poorly understood. Responses of neurons in the inferior colliculus (IC) of the anesthetized chinchilla were measured. The stimuli were harmonic tones, presented alone (single harmonic tones) and in the presence of a second harmonic tone with a different f0 (double harmonic tones). Responses to single harmonic tones exhibited no stimulus-related temporal pattern, or in some cases, a simple envelope modulated at f0. Responses to double harmonic tones exhibited complex slowly modulated discharge patterns. The discharge pattern varied with the difference in f0 and with characteristic frequency. The discharge pattern also varied with the relative levels of the two tones; complex temporal patterns were observed when levels were equal, but as the level difference increased, the discharge pattern reverted to that associated with single harmonic tones. The results indicated that IC neurons convey information about simultaneous sounds in their temporal discharge patterns and that the patterns are produced by interactions between adjacent components in the spectrum. The representation is “low-resolution,” in that it does not convey information about single resolved components from either individual sound.
doi:10.1152/jn.00516.2007
PMCID: PMC2649952
PMID: 17913991
An auditory neuron can preserve the temporal fine structure of a low-frequency tone by phase-locking its response to the stimulus. Apart from sound localization, however, much about the role of this temporal information for signal processing in the brain remains unknown. Through psychoacoustic studies we provide direct evidence that humans employ temporal fine structure to discriminate between frequencies. To this end we construct tones that are based on a single frequency but in which, through the concatenation of wavelets, the phase changes randomly every few cycles. We then test the frequency discrimination of these phase-changing tones, of control tones without phase changes, and of short tones that consist of a single wavelet. For carrier frequencies below a few kilohertz we find that phase changes systematically worsen frequency discrimination. No such effect appears for higher carrier frequencies at which temporal information is not available in the central auditory system.
doi:10.1371/journal.pone.0045579
PMCID: PMC3446936
PMID: 23029113
When a pure tone or low-numbered harmonic is presented to a listener, the resulting travelling wave in the cochlea slows down at the portion of the basilar membrane (BM) tuned to the input frequency due to the filtering properties of the BM. This slowing is reflected in the phase of the response of neurons across the auditory nerve (AN) array. It has been suggested that the auditory system exploits these across-channel timing differences to encode the pitch of both pure tones and resolved harmonics in complex tones. Here, we report a quantitative analysis of previously published data on the response of guinea pig AN fibres, of a range of characteristic frequencies, to pure tones of different frequencies and levels. We conclude that although the use of across-channel timing cues provides an a priori attractive and plausible means of encoding pitch, many of the most obvious metrics for using that cue produce pitch estimates that are strongly influenced by the overall level and therefore are unlikely to provide a straightforward means for encoding the pitch of pure tones.
doi:10.1007/s10162-011-0305-0
PMCID: PMC3298616
PMID: 22160791
auditory nerve; pitch; frequency; pure tone; phase transitions
When a pure tone or low-numbered harmonic is presented to a listener, the resulting travelling wave in the cochlea slows down at the portion of the basilar membrane (BM) tuned to the input frequency due to the filtering properties of the BM. This slowing is reflected in the phase of the response of neurons across the auditory nerve (AN) array. It has been suggested that the auditory system exploits these across-channel timing differences to encode the pitch of both pure tones and resolved harmonics in complex tones. Here, we report a quantitative analysis of previously published data on the response of guinea pig AN fibres, of a range of characteristic frequencies, to pure tones of different frequencies and levels. We conclude that although the use of across-channel timing cues provides an a priori attractive and plausible means of encoding pitch, many of the most obvious metrics for using that cue produce pitch estimates that are strongly influenced by the overall level and therefore are unlikely to provide a straightforward means for encoding the pitch of pure tones.
doi:10.1007/s10162-011-0305-0
PMCID: PMC3298616
PMID: 22160791
auditory nerve; pitch; frequency; pure tone; phase transitions
Performing sound recognition is a task that requires an encoding of the time-varying spectral structure of the auditory stimulus. Similarly, computation of the interaural time difference (ITD) requires knowledge of the precise timing of the stimulus. Consistent with this, low-level nuclei of birds and mammals implicated in ITD processing encode the ongoing phase of a stimulus. However, the brain areas that follow the binaural convergence for the computation of ITD show a reduced capacity for phase locking. In addition, we have shown that in the barn owl there is a pooling of ITD-responsive neurons to improve the reliability of ITD coding. Here we demonstrate that despite two stages of convergence and an effective loss of phase information, the auditory system of the anesthetized barn owl displays a graceful transition to an envelope coding that preserves the spectrotemporal information throughout the ITD pathway to the neurons of the core of the central nucleus of the inferior colliculus.
doi:10.1152/jn.01162.2006
PMCID: PMC2532515
PMID: 17314241
Widely divergent vertebrates share a common central temporal mechanism for representing periodicities of acoustic waveform events. In the auditory nerve, periodicities corresponding to frequencies or rates from about 10 Hz to over 1,000 Hz are extracted from pure tones, from low-frequency complex sounds (e.g., 1st harmonic in bullfrog calls), from mid-frequency sounds with low-frequency modulations (e.g., amplitude modulation rates in cat vocalizations), and from time intervals between high-frequency transients (e.g., pulse-echo delay in bat sonar). Time locking of neuronal responses to periodicities from about 50 ms down to 4 ms or less (about 20–300 Hz) is preserved in the auditory midbrain, where responses are dispersed across many neurons with different onset latencies from 4–5 to 20–50 ms. Midbrain latency distributions are wide enough to encompass two or more repetitions of successive acoustic events, so that responses to multiple, successive periods are ongoing simultaneously in different midbrain neurons. These latencies have a previously unnoticed periodic temporal pattern that determines the specific times for the dispersed on-responses.
doi:10.1007/s00359-010-0607-4
PMCID: PMC3257830
PMID: 21072522
Periodic sounds; Fundamental pitch; Auditory midbrain; Pulse rate; Response synchrony
In this study, we analyze the processing of low-frequency sounds in the cochlear apex through responses of auditory nerve fibers (ANFs) that innervate the apex. Single tones and irregularly spaced tone complexes were used to evoke ANF responses in Mongolian gerbil. The spike arrival times were analyzed in terms of phase locking, peripheral frequency selectivity, group delays, and the nonlinear effects of sound pressure level (SPL). Phase locking to single tones was similar to that in cat. Vector strength was maximal for stimulus frequencies around 500 Hz, decreased above 1 kHz, and became insignificant above 4 to 5 kHz. We used the responses to tone complexes to determine amplitude and phase curves of ANFs having a characteristic frequency (CF) below 5 kHz. With increasing CF, amplitude curves gradually changed from broadly tuned and asymmetric with a steep low-frequency flank to more sharply tuned and asymmetric with a steep high-frequency flank. Over the same CF range, phase curves gradually changed from a concave-upward shape to a concave-downward shape. Phase curves consisted of two or three approximately straight segments. Group delay was analyzed separately for these segments. Generally, the largest group delay was observed near CF. With increasing SPL, most amplitude curves broadened, sometimes accompanied by a downward shift of best frequency, and group delay changed along the entire range of stimulus frequencies. We observed considerable across-ANF variation in the effects of SPL on both amplitude and phase. Overall, our data suggest that mechanical responses in the apex of the cochlea are considerably nonlinear and that these nonlinearities are of a different character than those known from the base of the cochlea.
doi:10.1007/s10162-010-0255-y
PMCID: PMC3085685
PMID: 21213012
cochlear mechanics; cochlear apex; phase locking; Meriones unguiculatus
In this study, we analyze the processing of low-frequency sounds in the cochlear apex through responses of auditory nerve fibers (ANFs) that innervate the apex. Single tones and irregularly spaced tone complexes were used to evoke ANF responses in Mongolian gerbil. The spike arrival times were analyzed in terms of phase locking, peripheral frequency selectivity, group delays, and the nonlinear effects of sound pressure level (SPL). Phase locking to single tones was similar to that in cat. Vector strength was maximal for stimulus frequencies around 500 Hz, decreased above 1 kHz, and became insignificant above 4 to 5 kHz. We used the responses to tone complexes to determine amplitude and phase curves of ANFs having a characteristic frequency (CF) below 5 kHz. With increasing CF, amplitude curves gradually changed from broadly tuned and asymmetric with a steep low-frequency flank to more sharply tuned and asymmetric with a steep high-frequency flank. Over the same CF range, phase curves gradually changed from a concave-upward shape to a concave-downward shape. Phase curves consisted of two or three approximately straight segments. Group delay was analyzed separately for these segments. Generally, the largest group delay was observed near CF. With increasing SPL, most amplitude curves broadened, sometimes accompanied by a downward shift of best frequency, and group delay changed along the entire range of stimulus frequencies. We observed considerable across-ANF variation in the effects of SPL on both amplitude and phase. Overall, our data suggest that mechanical responses in the apex of the cochlea are considerably nonlinear and that these nonlinearities are of a different character than those known from the base of the cochlea.
doi:10.1007/s10162-010-0255-y
PMCID: PMC3085685
PMID: 21213012
cochlear mechanics; cochlear apex; phase locking; Meriones unguiculatus