|Home | About | Journals | Submit | Contact Us | Français|
Face recognition technology has evolved over years with the Principal Component Analysis (PCA) method being the benchmark for recognition efficiency. The face recognition techniques take care of variation of illumination, pose and other features of the face in the image. We envisage an application of these face recognition techniques for classification of medical images. The motivating factor being, given a condition of an organ it is represented by some typical features. In this paper we report the use of the face recognition techniques to classify the type of Esophagitis, a condition of inflammation of the esophagus. The image of the esophagus is captured in the process of endoscopy. We test PCA, Fisher Face method and Independent Component Analysis techniques to classify the images of the esophagus. Esophagitis is classified into four categories. The results of classification for each method are reported and the results are compared.
Medical Decision Support Systems have been evolving with the advances in the technologies in the related fields. The Decision is based on various forms of data like text, image, video and signals. The analysis and feature determination with the various forms of data have allowed the Medical Diagnosis field to incorporate them for efficient diagnosis.
Face Recognition field has evolved since the reporting of the PCA method . Literature reports a number of algorithms based on PCA, which form the basis of numerous studies in the psychological and algorithmic approaches [2-6]. The face recognition algorithms have been focusing on accurate recognition of the face under various conditions like lighting, face pose, tilt and possible changes in features of the face due to aging . In Medical Imaging field, the process of extracting the features is a general approach. The major task is to design a classifier with the determined features. The classifier performance greatly depends on the type of features and the input dimensions of features. There are numbers of methods [8-9] suggested to reduce the input dimensions of the classifier with an attempt to improve the classification efficiency. One of the statistical methods in reduction of dimension is PCA method which greatly reduces the dimensionality of the data set while retaining the majority of the variation present in the data set. The combination of PCA and Linear Discriemnant tries to group the data for better classification. The Independent Compo-nent method approach tries to get better classification by modeling data as Independent components.
In this paper we report the use of Face Recognition Techniques for classification of Esophagitis. Esophagitis is a condition of inflammation of the Esophagus. The condition is detected in the process of endoscopy of the Upper GastroIntestinal (GI) tract. The Upper GI tract consists of three organs namely the Esophagus, Stomach and the Duodenum . The condition of Esophagitis occurs due to inflammation and subsequent erosion of the esophageal mucosa. On the onset of Esophagitis the subject complains of heart-burn during intake of food . The most common cause is gasteroesophageal reflux (GRED) . The onset maybe due to laxity of the lower esophageal sphincter (LES) that separates the stomach and the esophagus (Fig. 11) wherein the contents of the stomach at a pH level of 2-4 escape into the esophagus area with a pH level of 6-7 or due some drug induced inflammation. Some studies investigate lifestyle traits to study the onset of Esophagitis . The guidelines for the most commonly occurring Esophagitis is listed in . The process of endoscopy facilitates observation of the internal mucosa of the GI tract. The observation helps the expert to arrive on the diagnosis .
The process of endoscopy generates a lot of images which form as good source to test the algorithms. There are two popular methods of Classification – 1) Savary-Miller Method  and 2) Los Angeles Classification method . The Savary- Miller method classifies Esophagitis in four grades Grade I, II, III and IV based on the inflammatory lesions. The severity of Esophagitis increases with the grade. The Los Angeles method is based on the extent of mucosal breaks and the spread on the esophageal circumference. The method classifies Esophagitis in four grades namely Grade A, B, C and D. The mucosal break and its spread over the circumference of the esophagus increases from Grade A to Grade D. We will be adopting the Savary-Miller Method. Results on the following three popular algorithms are reported namely PCA (Eigen face method), Fisherface (PCA-LDA) method  and Independent Component Analysis (ICA)  techniques to classify the images of the esophagus. The results of classification for each method are reported and the results are compared.
The rest of the paper is organized as follows : the Materials and Methodology section will explain the process of data collection, data labeling, details of the algorithms and their implementation. In the Results section we report the classification results of each of the methods. In the Discussion section we compare the results by these methods. In the Conclusion section we conclude by commenting on the technologies and its efficacy for the domain of application.
The images of esophagus are captured in the process of endoscopy. The images are categorized into four categories namely in four grades Grade I, II, III and IV based on the inflammatory lesions of esophagus. The Grade IV comes under the category of severe Esophagitis involving bleeding. We consider only Grade I, II and III, the fourth category being the Normal Esophagus (Fig. 22). The images are segregated and each Grade image is treated as image of a person with different pose and feature variation. Therefore we have four faces to recognize/classify.
Seventy images per category are saved and they are subjected to the process of face recognition algorithms with varying train and test ratios. The Algorithms tested are PCA (Eigen Face) method, PCA- Linear Discriminate Analysis (Fisher Faces) method and ICA method. The details of the algorithms follows.
PCA is one of the earliest methods suggested in Face Recognition techniques. The method also serves as a benchmark for performance on any new techniques proposed. It is popularly known as eigen face method. PCA is statistical method used for reduction of dimensions of a given dataset. The essence of the dataset is represented by principal components. In face recognition PCA reduces the dimensions by exploiting the similarities between images. The process of PCA is discussed below
The efficiency of classification and the confusion matrix for this method is reported in the Results section.
Developed in 1997 by P. Belhumeur et al., The method is based on Fisher’s Linear Discriminant Analysis is faster than eigenfaces, in some cases, has lower error rates, works well even if different illumination, works well even if different facial express. LDA seeks directions that are efficient for discrimination between the data. Fisher method tries to shape the scatter to have efficient classification. The method maximizes the ratio of the between-class scatter and the within-class scatter.
The procedure for the fisherface method is discussed below
The Fisherface method creates a set of projection vectors by using these scatter matrices to maximize between-class measure while simultaneously minimizing within-class scatter, thus yielding superior results. This is illustrated in the results section.
Independent component analysis (ICA) is a method for finding underlying factors or components from multivariate (multi-dimensional) statistical data. What distinguishes ICA from other methods is that it looks for components that are both statistically independent, and nonGaussian. While PCA seeks directions that represent data best in a sense, ICA seeks such directions that are most independent from each other.
ICA does not require the orthonormalization of vectors, which allows higher-order dependencies in image pixels. ICA removes the first and second order statistics by sphering the data, the process makes the data uncorrelated. Each image is stored as a row vector in X, which is multiplied by the whitening matrix Wz =2cov(X)1/2. Whitening is a process that transform the observed vector X linearly so that we obtain a new vector which is white, i.e. its components are uncorrelated and their variances equal unity. In other words, the covariance matrix of equals the identity matrix:
ICA finds statistically independent images, represented by the rows in the matrix A, that are mixed together with matrix W such that A = WX. The rows are analogous to eigen faces and columns of W-1 are weights of each image. The amount of computation makes the ICA process slow compared to the PCA and Fisherface method.
Based on the expert opinion, the images of esophagus were segregated in four classes namely Grade I, Grade II, Grade III Esophagitis and Normal Esophagus. We have seventy images in each class. The dataset is split into train:test ratio. The first phase is training and the second phase is testing. After each run the classification efficiency is determined. The results for the mentioned algorithm are illustrated in the Table 11. The classification efficiency is based on the confusion matrix determined after each test run. The confusion matrix is a visualization tool predictive analysis . A typical confusion matrix for two class classifier is shown in (Fig. 33). The following parameters can be determined from the confusion matrix –
From the figure the parameters are as follows -True Positive (TP) is classifier reporting the same label as known label, False Negative (FN) is classifier reporting negative even if the sample is labeled positive, False Positive (FP) is classifier reporting as positive even though the sample is negative, True Negative (TN) confirming that the label does not belong to that class. The parameters that can be determined by the confusion matrix are as follows-
The specificity and sensitivity parameter for each method is shown in Table 22.
From the results it is observed that the PCA-LDA method has the highest efficiency and it improves with the increasing ratio of training. The amount of time required for classification is largest in ICA method and significantly low in PCA and PCA-LDA method. The efficiency shown in Table 11 is for the overall efficiency of the classification. It is illustrated that the Specificity (Sp) and Sensitivity (Se) values are high for Grade II and Grade III type for Esophagitis and the values are highest for PCA-LDA method, thus justifying the result for maximizing the between-class measure while simultaneously minimizing within-class scatter.
The motivation for the study is based on the paper on the study conducted for classification of White Blood Cells using Eigen method . The paper reports classification percentage of various blood cells in the range 90% to 100%. The comparative study of various methods as applied to face recognition with application to face book is presented in . The Face Evaluator a package implemented using MATLAB has been extensively used for this paper. Literature reports use of PCA for semantic based image retrieval system  that can discriminate between images differing only through subtle, domain specific cues, which is a characteristic feature of many medical images. The efficient retrieval of images also reflects on the goodness of the feature set selected. These experiments help to get better feature sets which can be used to train either a Neural Network or a Support Vector Machine to get better classification. The classification using the statistical method is also dependent on the distance measure used in the classification .
Based on the results it can be seen that with proper choice of train and test ratios, the method is suitable for classification. This method can be modified to display the images to which the test image is nearest. This feature will include the explanation for the classification. This is a requirement for the Decision Support System. The disadvantage being that with the dataset increasing the amount of memory required increases. This can be taken care by periodically, updating the database by discarding the least significant images.
A complete system with a User Interface can be proposed with tools to manage the training set, with a facility to do away with the images which do not significantly help the process of classification, effectively reducing the dataset required for efficient classification.