Exponential family measurement error models for single-cell CRISPR screens

01/06/2022
by   Timothy Barry, et al.
0

CRISPR genome engineering and single-cell RNA sequencing have transformed biological discovery. Single-cell CRISPR screens unite these two technologies, linking genetic perturbations in individual cells to changes in gene expression and illuminating regulatory networks underlying diseases. Despite their promise, single-cell CRISPR screens present substantial statistical challenges. We demonstrate through theoretical and real data analyses that a standard method for estimation and inference in single-cell CRISPR screens – "thresholded regression" – exhibits attenuation bias and a bias-variance tradeoff as a function of an intrinsic, challenging-to-select tuning parameter. To overcome these difficulties, we introduce GLM-EIV ("GLM-based errors-in-variables"), a new method for single-cell CRISPR screen analysis. GLM-EIV extends the classical errors-in-variables model to responses and noisy predictors that are exponential family-distributed and potentially impacted by the same set of confounding variables. We develop a computational infrastructure to deploy GLM-EIV across tens or hundreds of nodes on clouds (e.g., Microsoft Azure) and high-performance clusters. Leveraging this infrastructure, we apply GLM-EIV to analyze two recent, large-scale, single-cell CRISPR screen datasets, demonstrating improved performance in challenging problem settings.

READ FULL TEXT

page 7

page 12

research
02/13/2022

Robust Statistical Inference for Cell Type Deconvolution

Cell type deconvolution is a computational approach to infer proportions...
research
10/31/2022

CausalBench: A Large-scale Benchmark for Network Inference from Single-cell Perturbation Data

Mapping biological mechanisms in cellular systems is a fundamental step ...
research
04/04/2021

SimCD: Simultaneous Clustering and Differential expression analysis for single-cell transcriptomic data

Single-Cell RNA sequencing (scRNA-seq) measurements have facilitated gen...
research
05/06/2020

Cell Type Identification from Single-Cell Transcriptomic Data via Semi-supervised Learning

Cell type identification from single-cell transcriptomic data is a commo...
research
05/04/2020

Computational modelling in single-cell cancer genomics: methods and future directions

Single-cell technologies have revolutionized biomedical research by enab...
research
04/28/2022

Predicting single-cell perturbation responses for unseen drugs

Single-cell transcriptomics enabled the study of cellular heterogeneity ...
research
09/26/2016

Connecting the dots across time: Reconstruction of single cell signaling trajectories using time-stamped data

Single cell responses are shaped by the geometry of signaling kinetic tr...

Please sign up or login with your details

Forgot password? Click here to reset