­Department of Electrical & Control Instrumentation, Kaunas University of Technology, Lithuania * Intelligent Systems Laboratory, Halmstad University,

  • View
    215

  • Download
    0

Embed Size (px)

Text of ­Department of Electrical & Control Instrumentation, Kaunas University of Technology,...

  • Categorizing Sequences of Laryngeal Datafor Decision SupportDepartment of Electrical & Control Instrumentation, Kaunas University of Technology, Lithuania* Intelligent Systems Laboratory, Halmstad University, Sweden Department of Otolaryngology, Kaunas University of Medicine, LithuaniaA. Gelinis, E. Vaiiukynas, E. Kelertas, M. Baauskien, A. Verikas*, V. Uloza, A. VegienThe 4th International Conference onElectrical & Control Technologies

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • COST Action 2103Can we be more dedicated 2 basic human needs?eHealth investigates modern best practices in the use of Information and Communication Technologies (ICT) as tools for enhancing health promotion, health protection, quality of service, accessibility and efficiency in all aspects of health care delivery (to meet all health needs of society).COST Action 2103 Advanced Voice Function Assessment combine previously unexploited techniques with new theoretical developments to improve voice production models and analysis algorithms with a view to voice quality & diseases.

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Decision Support SystemIn clinical practice, decision making is quite often based on subjective evaluation of patient data.Long-term goal of our work is decision support system for automated diagnostics of laryngeal diseases, using such information sources:voice signal of patient (recorded with microphone);images of vocal folds (laryngeal videostroboscopy);questionnaire data.In this experiment we explore kernel-based correlative learning techniques on information extracted from videostroboscopy (sequences of images) for categorizing laryngeal disorders.

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Laryngeal VideostroboscopyWell-established technique for measuring the glottal gap, glottic closure, glottic angle and the angular velocities of vocal fold abduction and adduction (with single-ash-timing).Multiple fundamental frequencies may be recorded in the case of some diseases, such as polyps, nodules, and cysts (with multiple-ash-timing).Duration = 20 80 secondsResolution = 768 576 pixelsFrame rate = 25 frames / secondData used is from Department of Otolaryngology, Kaunas University of Medicine, Lithuania.

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Categories of Laryngeal DataMass lesions of vocal folds = [ healthy | diffuse | nodular ]

    Nodular lesions (localized thickenings):nodulespolypscysts

    Diffuse lesions:papillomatahyperplasticlaryngitis w/ keratosiscarcinoma

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Healthy | Diffuse | Nodular

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Color Space & FeaturesSame incident light - different RGB values (by different acquisition equipment). Transform space!Similarity of two colors from their distance in the space is evaluated by using L*a*b color space.Multi-scale retinex theory-based color image en- hancement mitigates capturing condition variation.Color distribution is expressed by 3D color histogram of N bins as a random N-vector, which consists of N = 4096 (161616) bins.Compact description of histogram is obtained by removing not so important bins from all images.

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Nonlinear Features ExtractionKernel Principal Component Analysis (Scholkopf)orthogonal transformation of coordinate system of datapowerful method of extracting nonlinear features for classication (finding nonlinear relationships in data)nonlinear generalization of PCA via the kernel tricklinear PCA in the feature space = KPCA in the inputhelps avoid overtting the distributions in the possibly innite-dimensional feature space F (or RKHS)only the rst few important eigenvectors are kept, effectively regularizing the resulting distributionsfirst principal component = most of variability in the dataSolving the eigenvalue problem of kernel PCA is to diagonalize the kernel (data covariance) matrix.

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Kernel Methods for Classificationkernelpatternanalysisalgorithmidentified pattern (data class) functiondatasubspaceK(X,Z)LS-SVMf(x)=i(xi,x)Kernel trick: K(X, Z) = K(,X), K(,Z) = (X), (Z)

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Support Vector Machine Classifier

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Kernel Trick in SVMPrimal representation (inputs)

    Dual representation (features)

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Primal & Dual Problems in SVMPrimal problem (margin maximization)

    Dual problem (quadratic programming)

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • SVM Solution Support VectorsLagrangian with Lagrange multipliers :

    Conditions for optimality (saddle point):

    Primal representation / Dual representation

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Least Squares SVM (LS-SVM)Primal problem (margin maximization)

    Dual problem (quadratic programming)

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • LS-SVM Solution Support VectorsLagrangian with Lagrange multipliers :

    Conditions for optimality (saddle point):

    Primal representation / Dual representation

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Ensemble-based MatchingEnsemble matching methods generally consider the task of obtaining a similarity function which operates on pairs of sets of feature vectors, or pairs of ensembles.Video is considered as an ensemble of feature vectors, where each vector is extracted from corresponding video frame image.The alignment of the two subspaces spanned by the elements of the two ensembles is used as a measure of similarity.The kernel principal angle is the angle between the principal subspaces of two matrices, each matrix composed of feature vectors as columns.

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Kernel Principal AnglesPrincipal angles are angles between a pair of vector sets in two linear subspaces (also relates to principal correlation). If = 0, the cosine is 1 and vector sets are parallel.If = /2 , the cosine is 0 and vector sets are orthogonal.

    Quantities cos(k) are often referred to as principal correlations or canonical correlations of the matrix pair.

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Sequence Kernel Function (KPA)Kernelized Modified Gram-Schmidt Orthogonalization used Kernel trick was used to compute principal angles in the feature space induced by a minor Gaussian (RBF) kernel: K(x, z) = (x), (z) = kminor(x,z)

    Instead of dealing with the set-elements x in input space, we work with their mappings in feature space (x), which are accessible only as scalar products.

    Using a linear minor kernel (polynomial degree 1) is equivalent to computing principal angles in input space. Positive-definite kernel (similarity metric) for our LS-SVM:K (Ai, Aj) f (Ai, Aj) = k=1..n cos2(k)

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Experimental SetupAvailable data from 30 patients has:10 patients evaluated as belonging to the nodular class;12 patients evaluated as belonging to the diffuse class;8 patients evaluated as belonging to the healthy class.There were 100 image frames in each 4 s image sequence, but only 10 from the middle were used.Data normalized to 0 mean and variance 1.Polynomial kernel of degree q = 1, 2, 3 in KPCA.Gaussian kernel as a minor kernel to estimate the KPA kernel (measure over a pair of matrices).Due to the small number of data points (image sequences) leave-one-out test was chosen for %.

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Results Number of KPC

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Results Number of KPC & Gamma

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Results Gamma & Sigma

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • Summary of ExperimentLeast squares support vector machine (LS-SVM) was constructed in Matlab for classification into 3 classes, namely healthy | nodular | diffuse.Kernel function (KPA kernel) employed by LS-SVM was designed to be used over a pair of matrices, rather than over a pair of vectors, to kernelize principal angles between subspaces.Different number of kernel principal components was extracted from N-vector histogram & tested.Disease recognition is done on the premises, that:different disorders generate different subspacesprincipal angles between subspaces can be measured

    Categorizing Sequences of Laryngeal Data for Desicion Support

  • ConclusionsIn this paper, we have investigated an approach to an automated analysis of sets of vocal cord images aiming to categorize laryngeal disorders.Sequences of color laryngeal images were categorized into the healthy, nodular and diffuse classes using an LS-SVM with a kernel dened over a pair of matrices.Bearing in mind the high similarity of the decision classes, encouraging classication performance was obtained when testing the developed tools on data recorded during routine videostroboscopy.Relatively high classication accuracy obtained (85,7%) encourages further studies in this area.

    Categorizing Sequences of Laryngeal Data for Desicion Support

    European Cooperation in the field of Scientific & Technical Research (COST)health so