Genetic Analysis of Prostate Cancer with Computer Science Methods

03/28/2023
by   Yuxuan Li, et al.
0

Metastatic prostate cancer is one of the most common cancers in men. In the advanced stages of prostate cancer, tumours can metastasise to other tissues in the body, which is fatal. In this thesis, we performed a genetic analysis of prostate cancer tumours at different metastatic sites using data science, machine learning and topological network analysis methods. We presented a general procedure for pre-processing gene expression datasets and pre-filtering significant genes by analytical methods. We then used machine learning models for further key gene filtering and secondary site tumour classification. Finally, we performed gene co-expression network analysis and community detection on samples from different prostate cancer secondary site types. In this work, 13 of the 14,379 genes were selected as the most metastatic prostate cancer related genes, achieving approximately 92 cross-validation. In addition, we provide preliminary insights into the co-expression patterns of genes in gene co-expression networks. Project code is available at https://github.com/zcablii/Master_cancer_project.

READ FULL TEXT

page 1

page 15

page 17

page 21

page 22

page 23

page 26

page 30

research
01/28/2023

Machine Learning Methods for Cancer Classification Using Gene Expression Data: A Review

Cancer is a term that denotes a group of diseases caused by abnormal gro...
research
10/26/2019

Gene expression and pathway bioinformatics analysis detect a potential predictive value of MAP3K8 in thyroid cancer progression

Thyroid cancer is the commonest endocrine malignancy. Mutation in the BR...
research
11/14/2021

Invariant Risk Minimisation for Cross-Organism Inference: Substituting Mouse Data for Human Data in Human Risk Factor Discovery

Human medical data can be challenging to obtain due to data privacy conc...
research
05/02/2018

Prediction of a Gene Regulatory Network from Gene Expression Profiles With Linear Regression and Pearson Correlation Coefficient

Reconstruction of gene regulatory networks is the process of identifying...
research
08/24/2023

Powerful Significance Testing for Unbalanced Clusters

Clustering methods are popular for revealing structure in data, particul...
research
02/19/2020

Towards a Complete Pipeline for Segmenting Nuclei in Feulgen-Stained Images

Cervical cancer is the second most common cancer type in women around th...

Please sign up or login with your details

Forgot password? Click here to reset