A Clustering Approach to Integrative Analysis of Multiomic Cancer Data

02/10/2022
by   Dongyan Yan, et al.
0

Rapid technological advances have allowed for molecular profiling across multiple omics domains from a single sample for clinical decision making in many diseases, especially cancer. As tumor development and progression are dynamic biological processes involving composite genomic aberrations, key challenges are to effectively assimilate information from these domains to identify genomic signatures and biological entities that are druggable, develop accurate risk prediction profiles for future patients, and identify novel patient subgroups for tailored therapy and monitoring. We propose integrative probabilistic frameworks for high-dimensional multiple-domain cancer data that coherently incorporate dependence within and between domains to accurately detect tumor subtypes, thus providing a catalogue of genomic aberrations associated with cancer taxonomy. We propose an innovative, flexible and scalable Bayesian nonparametric framework for simultaneous clustering of both tumor samples and genomic probes. We describe an efficient variable selection procedure to identify relevant genomic aberrations that can potentially reveal underlying drivers of a disease. Although the work is motivated by several investigations related to lung cancer, the proposed methods are broadly applicable in a variety of contexts involving high-dimensional data. The success of the methodology is demonstrated using artificial data and lung cancer omics profiles publicly available from The Cancer Genome Atlas.

READ FULL TEXT
research
02/20/2022

Pairwise Nonlinear Dependence Analysis of Genomic Data

In The Cancer Genome Atlas (TCGA) dataset, there are many interesting no...
research
04/26/2017

Network-based coverage of mutational profiles reveals cancer genes

A central goal in cancer genomics is to identify the somatic alterations...
research
12/29/2022

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Genomic Data

Rapid advancements in collection and dissemination of multi-platform mol...
research
02/28/2013

Bayesian Consensus Clustering

The task of clustering a set of objects based on multiple sources of dat...
research
05/21/2020

Using the "Hidden" Genome to Improve Classification of Cancer Types

It is increasingly common clinically for cancer specimens to be examined...
research
03/11/2018

A pathway-based kernel boosting method for sample classification using genomic data

The analysis of cancer genomic data has long suffered "the curse of dime...

Please sign up or login with your details

Forgot password? Click here to reset