A novel framework to quantify uncertainty in peptide-tandem mass spectrum matches with application to nanobody peptide identification

10/15/2021
by   Chris McKennan, et al.
0

Nanobodies are small antibody fragments derived from camelids that selectively bind to antigens. These proteins have marked physicochemical properties that support advanced therapeutics, including treatments for SARS-CoV-2. To realize their potential, bottom-up proteomics via liquid chromatography-tandem mass spectrometry (LC-MS/MS) has been proposed to identify antigen-specific nanobodies at the proteome scale, where a critical component of this pipeline is matching nanobody peptides to their begotten tandem mass spectra. While peptide-spectrum matching is a well-studied problem, we show the sequence similarity between nanobody peptides violates key assumptions necessary to infer nanobody peptide-spectrum matches (PSMs) with the standard target-decoy paradigm, and prove these violations beget inflated error rates. To address these issues, we then develop a novel framework and method that treats peptide-spectrum matching as a Bayesian model selection problem with an incomplete model space, which are, to our knowledge, the first to account for all sources of PSM error without relying on the aforementioned assumptions. In addition to illustrating our method's improved performance on simulated and real nanobody data, our work demonstrates how to leverage novel retention time and spectrum prediction tools to develop accurate and discriminating data-generating models, and, to our knowledge, provides the first rigorous description of MS/MS spectrum noise.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2023

De-novo Identification of Small Molecules from Their GC-EI-MS Spectra

Identification of experimentally acquired mass spectra of unknown compou...
research
02/03/2019

GA-Novo: De Novo Peptide Sequencing via Tandem Mass Spectrometry using Genetic Algorithm

Proteomics is the large-scale analysis of the proteins. The common metho...
research
03/23/2022

DPST: De Novo Peptide Sequencing with Amino-Acid-Aware Transformers

De novo peptide sequencing aims to recover amino acid sequences of a pep...
research
11/08/2021

MassFormer: Tandem Mass Spectrum Prediction with Graph Transformers

Mass spectrometry is a key tool in the study of small molecules, playing...
research
01/26/2023

Efficiently predicting high resolution mass spectra with graph neural networks

Identifying a small molecule from its mass spectrum is the primary open ...
research
10/29/2014

Faster graphical model identification of tandem mass spectra using peptide word lattices

Liquid chromatography coupled with tandem mass spectrometry, also known ...

Please sign up or login with your details

Forgot password? Click here to reset