Surgery and Animal Preparation
Experiments were conducted on three hemispheres (two left and one right hemispheres) of two adult female cats. All protocols were approved by the University of California San Francisco Committee on Animal Research in accordance with federal guidelines for care and use of animals in research. Animals were sedated by intramuscular injections of a mixture of ketamine (22 mg/kg) and acepromazine (0.11 mg/kg). After venous cannulation, sodium pentobarbital (15–30 mg/kg) was administered and supplemented as needed throughout the surgical procedure. Following tracheotomy, a craniotomy was performed to expose the ectosylvian gyrus. The dura mater was partially removed, and the cortical surface was covered with viscous silicone oil. Before commencing the electrophysiological recordings, sodium pentobarbital anesthesia was replaced with a continuous intravenous infusion of a mixture of ketamine (2–10 mg/kg/h) and diazepam (0.05–0.2 mg/kg/h) in lactated Ringers (1–3 ml/kg/h). To prevent edema and mucus secretion, dexamethasone (1.2 mg/kg, S.C.) and atropine sulfate (0.04 mg/kg, S.C.) were administered every 12 hours. Since recordings lasted for three to four days, an antibiotic (cephalosporin, 11 mg/kg, I.V.) was administrated to prevent wound infection. Body temperature was monitored and maintained by a water heating pad at 37±1°C. Electrocardiogram and respiration rate were monitored continuously during surgery and recording procedures.
Acoustic Stimulus and Extracellular Recordings
Experiments were conducted in a double-walled, anechoic chamber (Industrial Acoustics, Bronx, NY). Stimuli were delivered by a STAX-54 headphone through a sealed tube into the acoustic meatus contralateral to the studied hemisphere. The system frequency transfer function was flat (±6 dB) up to 14 kHz and rolled off 10 dB/octave at higher frequencies.
Two different stimuli, pure tone bursts and click trains, were presented for measuring frequency response areas (FRAs) and RRTFs, respectively. Pure tone stimuli of 50ms duration (including 3-ms linear rise and fall time) were generated at intervals of 400–750 ms by a microprocessor (TMS32010, 16 bits resolution and 120 kHz digital-to-analog sampling rate). FRAs were mapped by presenting 675 pseudo-randomized tone bursts at 45 different frequencies (3–5 octave range) and 15 sound levels (70 dB range in 5 dB steps). For RRTFs, click trains (monopolar, rectangular pulses of 200 µs duration; 500 ms train duration) were systematically presented 15 times for repetition rates from 1 to 38 Hz (1, 6, 10, 14, 18, 22, 26, 30, 34, and 38 Hz) at sound levels of 82–102 dB SPL (peak equivalent). The relatively high levels were applied to enhance synchrony among the multi-unit responses. For sites with non-monotonic rate-level functions values at the lower end of the range were used. For some recording sites, higher repetition rates were presented (up to 250 Hz).
Parylene- or epoxylite-coated tungsten microelectrodes (Micro Probes, Potomac, MD or Frederic Haer & Co., Bowdoinham, ME) with 0.5–4 MΩ impedance at 1 kHz were used for multi-unit recordings. Single or double microelectrodes were advanced perpendicular to the cortical surface with a hydraulic microdrive (David Kopf Instruments, Tujunga, CA). A video picture of the cortical surface was captured and digitized with a CCD digital camera (Cohu, San Diego, CA). Each recording site was marked on the digitized picture using Canvas software (Deneva, Miami, FL). The marked sites were used to reconstruct tessellation maps of the recording area (see below). Neuronal activity was obtained in thalamocortical recipient layers
[69]. Action potentials were amplified and band-pass filtered (0.3–10 kHz; World Precision Instruments, Sarasota, FL, and Axon Instruments, Union City, CA), fed to an oscilloscope, and isolated from background noise with a time/amplitude window discriminator (BAK Electronic, Mount Airy, MD). For FRAs and RRTFs, spikes occurring in the first 50 ms or 550ms, respectively, after stimulus onset were recorded at 10 or 100 µs resolution for the analyses.
Data Analysis
Data were analyzed using the MATLAB (Mathwork, Natick, MA) platform. StatView (SAS Institute, Cary, NC) was used for statistical analysis.
Spectral receptive field parameters such as CF, minimum threshold, quality factors, and response latency were measured
[30]. Threshold was defined as minimum excitatory SPL, and estimated at 5 dB resolution. CF was defined as the frequency at which a single neuron or neuron cluster produced sound-evoked spikes at threshold sound level. Spectral bandwidths were calculated as CF divided by excitatory bandwidth at 40 dB (Q40) above threshold; the higher the Q-value, the more sharply tuned are the neurons. Latency was determined as the minimum value in the averaged latency-level function at CF and the two adjacent test-frequencies (CF 1/15 to 1/9 octaves). Results for spectral receptive field parameter distributions in AAF were presented elsewhere
[30].
For RRTFs, spike occurrence to the first click was discarded except for the 1 Hz stimulus since it does not contribute to repetition information. Spikes were counted from the second click onset to 550 ms after the first click onset (for 1 Hz stimulus, spikes occurring between the first click onset and 550 ms were used). VS and FR were used to measure temporal following activity
[6],
[50]. VS measures how well spikes are synchronized to the clicks relative to the duration of the repetition period:
where
n is the total number of spikes,
t is time of spike occurrence, and
T is the inter-click interval
[70]. Significance of synchronization was examined by a Rayleigh test (
p<0.001). Bin width was 1 ms. Repetition rate tuning curves were constructed without smoothing across different repetition rates. Best repetition rate was defined as that a repetition rate that evoked the largest response strength for VS or FR. RRTF tuning curves for VS and FR were classified into three filter types. A band-pass filter was assigned when the response peak was flanked by troughs in which the responses drop <75% of the peak
[71]. If one of the response troughs did not reach the criterion, then RRTFs were considered to be either a low- or high-pass filter. Although most recordings were made from multi-units, past studies have shown that single- and multi-unit recordings share similar RRTFs or modulation transfer functions
[50],
[72],
[73].
ISIs between two consecutive spikes were measured in the time window of 550 ms with a bin width of 1 ms for each trial and accumulated across all 15 trials. Spike train irregularity in ISIs was estimate based on the CV that was defined as the standard deviation of ISIs divided by the mean of ISIs.
Voronoi-Dirichlet Tessellation Map
To reconstruct the spatial distribution of receptive field or temporal parameters across the cortical surface, tessellation maps were calculated by Voronoi-Dirichlet tessellation
[74]. The polygon surrounding each electrode penetration in the tessellation map characterizes the area assigned to the functional parameter at the recording site. Borders between neighboring polygons were determined from the midpoints of a straight line between adjacent recording points. The value of each receptive field or temporal parameter in the cortical surface map is illustrated by color code.
Mutual Information Analysis
The MI of the repetition rate carried in the FR was computed based on 15 presentations of the same set of repetition rates. MI analyses were limited to six different repetition rates (1, 6, 10, 14, 22, and 30 Hz for which we obtained data sets for a majority of recording sites). MI between repetition rate
f and firing rate
fr is given by

. In our case, all repetition rates
f were presented the same number of times, so that
where
Nf
=

6 was the number of different repetition rates. To account for the fact that MI is positively biased
[75],
[76], the values were linearly extrapolated to infinite dataset limit (i.e., number of repetitions; not to the limit of infinite word length). Extrapolation was done by removing different sets of one, two, three, or four presentations at a time. The final value and its standard deviation was obtained as a result of a linear fit in
1/Nf, each repeated 15 times for different combinations of dropped presentations.
MI between repetition rate and VS was evaluated similarly. VS values were calculated for each stimulus presentation to form distributions of VS values associated with each stimulus periodicity. The MI conveyed by the VS code quantifies how well these distributions (and thus stimulus repetition rates) can be distinguished from each other. Non-significant VS measures were assigned a MI of zero bits/stimulus (Rayleigh test, p>0.001). In the case of information carried by ISIs, the distribution of ISIs P(isi|f) was computed for each stimulus repetition rate f and averaged across repeated stimulus presentations. These information values were then also extrapolated to the infinite dataset size, according to procedures described above.
Additive information values (Code(x) + Code(y) in ) represent the sum of information values computed for each pattern of neural responses separately, with separate extrapolation to infinite dataset size. Joint information values (Code(x) × Code(y) in ) were computed based on joint probabilities of two measures of neural responses, such as VS and FR (, white bars); extrapolation to infinite dataset size in this case was based on recomputation of these joint probabilities from fractions of the data, and then using a linear extrapolation with respect to the inverse of the dataset size to find the value for infinite dataset size.
Spatial Organization Analysis
The existence of spatial organization for experimental variables was established using two complementary approaches. Spatial autocorrelation, a measure of redundancy, was used to estimate global spatial organization by calculating Geary's C coefficient
[77]. C values are based on value differences between pairs of observations and can vary between 0, indicating perfect positive spatial correlation (high spatial uniformity, maximal neighbor similarity), and 2, indicating negative spatial correlation (maximal dispersion, high value contrast between neighbors). Random spatial distribution (the null-hypothesis) results in a C value of 1. In a Monte-Carlo analysis, the statistical significance of the experimental C value was derived from the C-value distribution of 10,000 randomly redistributed map versions.
Local spatial organization was assessed through the value similarity between each polygon and its nearest neighbors. Statistically significant similarity between a polygon and its direct neighbors was determined by comparison with 10,000 redistributions of the neighboring polygon values. The number of significant polygons in a given experimental map was compared to the number of significant polygons in 1,000 randomized maps. The number of significant polygons estimates the proportion of local parameter clusters. Neither of the two tests takes into account where in the map local or global similarities are situated. However, the larger the number of local clusters, the higher is the probability of a confluence of them, increasing global organization and, thus, spatial autocorrelation.