Alzheimer’s disease (AD) research has recently witnessed a great deal of activity focused on developing new statistical learning tools for automated inference using imaging data. The workhorse for many of these techniques is the Support Vector Machine (SVM) framework (or more generally kernel based methods). Most of these require, as a first step, specification of a kernel matrix
between input examples (i.e., images). The inner product between images Ii and Ij in a feature space can generally be written in closed form, and so it is convenient to treat
as “given”. However, in certain neuroimaging applications such an assumption becomes problematic. As an example, it is rather challenging to provide a scalar measure of similarity between two instances of highly attributed data such as cortical thickness measures on cortical surfaces. Note that cortical thickness is known to be discriminative for neurological disorders, so leveraging such information in an inference framework, especially within a multi-modal method, is potentially advantageous. But despite being clinically meaningful, relatively few works have successfully exploited this measure for classification or regression. Motivated by these applications, our paper presents novel techniques to compute similarity matrices for such topologically-based attributed data. Our ideas leverage recent developments to characterize signals (e.g., cortical thickness) motivated by the persistence of their topological features, leading to a scheme for simple constructions of kernel matrices. As a proof of principle, on a dataset of 356 subjects from the ADNI study, we report good performance on several statistical inference tasks without any feature selection, dimensionality reduction, or parameter tuning.
between input examples (i.e., images). The inner product between images Ii and Ij in a feature space can generally be written in closed form, and so it is convenient to treat
as “given”. However, in certain neuroimaging applications such an assumption becomes problematic. As an example, it is rather challenging to provide a scalar measure of similarity between two instances of highly attributed data such as cortical thickness measures on cortical surfaces. Note that cortical thickness is known to be discriminative for neurological disorders, so leveraging such information in an inference framework, especially within a multi-modal method, is potentially advantageous. But despite being clinically meaningful, relatively few works have successfully exploited this measure for classification or regression. Motivated by these applications, our paper presents novel techniques to compute similarity matrices for such topologically-based attributed data. Our ideas leverage recent developments to characterize signals (e.g., cortical thickness) motivated by the persistence of their topological features, leading to a scheme for simple constructions of kernel matrices. As a proof of principle, on a dataset of 356 subjects from the ADNI study, we report good performance on several statistical inference tasks without any feature selection, dimensionality reduction, or parameter tuning.Index Terms: Cortical thickness based kernels, topological persistence, Alzheimer’s disease, ADNI



The publisher's final edited version of this article is available at
{+1, −1} (or yi
) in a classification (or regression) setting. X comprises the cortical data of both AD and control groups. The original signal defined on the cortical surface is noisy and it is necessary to estimate the unknown signal to increase signal-to-noise ratio and smoothness of the signal for subsequent analysis. This is achieved via image smoothing over
. Making use of the mean signal is not ideal for constructing similarity matrices which promote separability, due to signal attenuation. Such a process will make it rather difficult to identify subtle, but clinically relevant variations. Further, substantial overlap in the class distributions confounds inference. In contrast, our proposed method is motivated by the topological characterization of the signal which makes no assumption on point-wise correspondences of cortical measure on a naturally formed spherical atlas. Hence, no assumption is made regarding a specific pre-processing tool for extraction, allowing the proposed framework to handle disparities among spherical atlases of individual subjects (generated by some pre-processing tool of choice). As a result, the collection of critical points on the mean signal is adequate to characterize the topology of the signal. By pairing the critical points in a nonlinear fashion, we construct scatter plot for each image, which encodes the topological properties of the input signal. The concentration maps of such scatter plots are then used to learn the disease patterns by employing off-the-shelf ML toolbox. We now present details of the algorithm in the following subsections.
) provides the heat kernel Kσ as
is written as a convolution:
f, Ylm
is the Fourier coefficient. This is a more flexible spectral approach that explicitly represents the solution to the diffusion equation analytically.
0 for these neuropsychological scores. In other words, the estimated values of individual cognitive scores were discriminative for clinically different groups. This also helps explain the fact that gray matter atrophy, and thus topological features lead to cognitive decay.
(zenith and azimuthal respectively) are associated with spherical atlas. (Top) Differences in means at each vertex, (AD vs. control for left and right hemispheres