A Linear Algebra Approach to Fast DNA Mixture Analysis Using GPUs

by   Siddharth Samsi, et al.

Analysis of DNA samples is an important step in forensics, and the speed of analysis can impact investigations. Comparison of DNA sequences is based on the analysis of short tandem repeats (STRs), which are short DNA sequences of 2-5 base pairs. Current forensics approaches use 20 STR loci for analysis. The use of single nucleotide polymorphisms (SNPs) has utility for analysis of complex DNA mixtures. The use of tens of thousands of SNPs loci for analysis poses significant computational challenges because the forensic analysis scales by the product of the loci count and number of DNA samples to be analyzed. In this paper, we discuss the implementation of a DNA sequence comparison algorithm by re-casting the algorithm in terms of linear algebra primitives. By developing an overloaded matrix multiplication approach to DNA comparisons, we can leverage advances in GPU hardware and algoithms for Dense Generalized Matrix-Multiply (DGEMM) to speed up DNA sample comparisons. We show that it is possible to compare 2048 unknown DNA samples with 20 million known samples in under 6 seconds using a NVIDIA K80 GPU.


page 3

page 4


Applications and Challenges of Real-time Mobile DNA Analysis

The DNA sequencing is the process of identifying the exact order of nucl...

Correlation over Decomposed Signals: A Non-Linear Approach to Fast and Effective Sequences Comparison

A novel non-linear approach to fast and effective comparison of sequence...

LRDB: LSTM Raw data DNA Base-caller based on long-short term models in an active learning environment

The first important step in extracting DNA characters is using the outpu...

Inference about complex relationships using peak height data from DNA mixtures

In both criminal cases and civil cases there is an increasing demand for...

Machine Learning Prediction of DNA Charge Transport

First-principles calculations of charge transfer in DNA molecules are co...

Analysis of a DNA mixture case involving Romani reference populations

Here we present an Italian criminal case that shows how statistical meth...

Using Deep Learning to Detect Digitally Encoded DNA Trigger for Trojan Malware in Bio-Cyber Attacks

This article uses Deep Learning technologies to safeguard DNA sequencing...

Please sign up or login with your details

Forgot password? Click here to reset