Bacteriophage classification for assembled contigs using Graph Convolutional Network

02/07/2021
by   Jiayu Shang, et al.
0

Motivation: Bacteriophages (aka phages), which mainly infect bacteria, play key roles in the biology of microbes. As the most abundant biological entities on the planet, the number of discovered phages is only the tip of the iceberg. Recently, many new phages have been revealed using high throughput sequencing, particularly metagenomic sequencing. Compared to the fast accumulation of phage-like sequences, there is a serious lag in taxonomic classification of phages. High diversity, abundance, and limited known phages pose great challenges for taxonomic analysis. In particular, alignment-based tools have difficulty in classifying fast accumulating contigs assembled from metagenomic data. Results: In this work, we present a novel semi-supervised learning model, named PhaGCN, to conduct taxonomic classification for phage contigs. In this learning model, we construct a knowledge graph by combining the DNA sequence features learned by convolutional neural network (CNN) and protein sequence similarity gained from gene-sharing network. Then we apply graph convolutional network (GCN) to utilize both the labeled and unlabeled samples in training to enhance the learning ability. We tested PhaGCN on both simulated and real sequencing data. The results clearly show that our method competes favorably against available phage classification tools.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2021

Detecting the hosts of bacteriophages using GCN-based semi-supervised learning

Motivation: Bacteriophages (aka phages) are viruses that infect bacteria...
research
06/02/2021

DNA-GCN: Graph convolutional networks for predicting DNA-protein binding

Predicting DNA-protein binding is an important and classic problem in bi...
research
04/25/2022

Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data

Unsupervised clustering on speakers is becoming increasingly important f...
research
05/16/2019

ncRNA Classification with Graph Convolutional Networks

Non-coding RNA (ncRNA) are RNA sequences which don't code for a gene but...
research
03/16/2021

Graph Convolutional Network for Swahili News Classification

This work empirically demonstrates the ability of Text Graph Convolution...
research
05/12/2020

Benchmark Tests of Convolutional Neural Network and Graph Convolutional Network on HorovodRunner Enabled Spark Clusters

The freedom of fast iterations of distributed deep learning tasks is cru...
research
07/07/2018

When Work Matters: Transforming Classical Network Structures to Graph CNN

Numerous pattern recognition applications can be formed as learning from...

Please sign up or login with your details

Forgot password? Click here to reset