Cancer-inspired Genomics Mapper Model for the Generation of Synthetic DNA Sequences with Desired Genomics Signatures

05/01/2023
by   Teddy Lazebnik, et al.
0

Genome data are crucial in modern medicine, offering significant potential for diagnosis and treatment. Thanks to technological advancements, many millions of healthy and diseased genomes have already been sequenced; however, obtaining the most suitable data for a specific study, and specifically for validation studies, remains challenging with respect to scale and access. Therefore, in silico genomics sequence generators have been proposed as a possible solution. However, the current generators produce inferior data using mostly shallow (stochastic) connections, detected with limited computational complexity in the training data. This means they do not take the appropriate biological relations and constraints, that originally caused the observed connections, into consideration. To address this issue, we propose cancer-inspired genomics mapper model (CGMM), that combines genetic algorithm (GA) and deep learning (DL) methods to tackle this challenge. CGMM mimics processes that generate genetic variations and mutations to transform readily available control genomes into genomes with the desired phenotypes. We demonstrate that CGMM can generate synthetic genomes of selected phenotypes such as ancestry and cancer that are indistinguishable from real genomes of such phenotypes, based on unsupervised clustering. Our results show that CGMM outperforms four current state-of-the-art genomics generators on two different tasks, suggesting that CGMM will be suitable for a wide range of purposes in genomic medicine, especially for much-needed validation studies.

READ FULL TEXT
research
12/16/2021

A molecular generative model with genetic algorithm and tree search for cancer samples

Personalized medicine is expected to maximize the intended drug effects ...
research
07/22/2022

Deep Learning Hyperparameter Optimization for Breast Mass Detection in Mammograms

Accurate breast cancer diagnosis through mammography has the potential t...
research
07/27/2019

Genetic Deep Learning for Lung Cancer Screening

Convolutional neural networks (CNNs) have shown great promise in improvi...
research
07/09/2020

Identifying efficient controls of complex interaction networks using genetic algorithms

Control theory has seen recently impactful applications in network scien...
research
10/22/2018

Bayesian multi-domain learning for cancer subtype discovery from next-generation sequencing count data

Precision medicine aims for personalized prognosis and therapeutics by u...
research
11/22/2020

Topological Data Analysis of copy number alterations in cancer

Identifying subgroups and properties of cancer biopsy samples is a cruci...
research
09/06/2023

EvoCLINICAL: Evolving Cyber-Cyber Digital Twin with Active Transfer Learning for Automated Cancer Registry System

The Cancer Registry of Norway (CRN) collects information on cancer patie...

Please sign up or login with your details

Forgot password? Click here to reset