Department of Computer Engineering
MS THESIS PRESENTATION
Identification of Cancer Patient Subgroups via Pathway Based Multi-View Graph Kernel Clustering
Ali Burak Ünal
(Supervisor: Asst. Prof. Dr. Öznur Taştan Okan)
Computer Engineering Department
Characterizing patient genomic alterations through next-generation sequencing technologies opens up new opportunities for refining cancer subtypes. Different omics data provide different views into the molecular biology of the tumors. How- ever, tumor cells exhibit high levels of heterogeneity, and different patients harbor different combinations of molecular alterations. On the other hand, different alterations may perturb the same biological pathways. In this work, we propose a novel clustering procedure that quantifies the similarities of patients from their alteration profiles on pathways via a novel graph kernel. For each pathway and patient pair, a vertex labeled undirected graph is constructed based on the patient molecular alterations and the pathway interactions. The proposed smoothed shortest path graph kernel (smSPK) assesses similarities of pair of patients with respect to a pathway by comparing their vertex labeled graphs. Our clustering procedure involves two steps. In the first step, the smSPK kernel matrices for each pathway and data type are computed for patients pairs to construct multiple kernel matrices and in the ensuing step, these kernel matrices are input to a multi-view kernel clustering algorithm to stratify patients. We apply our methodology on 361 renal cell carcinoma patients, using somatic mutations, gene and protein expressions data. This approach yields subgroup of patients that differ significantly in their survival times (p-value ? 0.005). The proposed methodology allows integrating other type of omics data and provides insight into disrupted pathways in each patient subgroup.
DATE: 31 July 2017, Monday @ 13:30