An examination of the generalised pooled binomial distribution and its information properties

08/10/2021
by   Ben O'Neill, et al.
0

This paper examines the statistical properties of a distributional form that arises from pooled testing for the prevalence of a binary outcome. Our base distribution is a two-parameter distribution using a prevalence and excess intensity parameter; the latter is included to allow for a dilution or intensification effect with larger pools. We also examine a generalised form of the distribution where pools have covariate information that affects the prevalence through a linked linear form. We study the general pooled binomial distribution in its own right and as a special case of broader forms of binomial GLMs using the complementary log-log link function. We examine the information function and show the information content of individual sample items. We demonstrate that pooling reduces information content of sample units and we give simple heuristics for choosing an "optimal" pool size for testing. We derive the form of the log-likelihood function and its derivatives and give results for maximum likelihood estimation. We also discuss diagnostic testing of the positive pool probabilities, including testing for intensification/dilution in the testing mechanism. We illustrate the use of this distribution by applying it to pooled testing data on virus prevalence in a mosquito population.

READ FULL TEXT

page 11

page 14

page 26

research
11/01/2020

Informed Pooled Testing with Quantitative Assays

Pooled testing is widely used for screening for viral or bacterial infec...
research
11/11/2021

Pool samples to efficiently estimate pathogen prevalence dynamics

Estimating the prevalence of a disease is necessary for evaluating and m...
research
07/17/2020

Computing the Dirichlet-Multinomial Log-Likelihood Function

Dirichlet-multinomial (DMN) distribution is commonly used to model over-...
research
07/24/2019

Some computational aspects of maximum likelihood estimation of the skew-t distribution

Since its introduction, the skew-t distribution has received much attent...
research
04/17/2023

Overcoming Repeated Testing Schedule Bias in Estimates of Disease Prevalence

During the COVID-19 pandemic, many institutions such as universities and...
research
12/15/2020

Maximum log_q Likelihood Estimation for Parameters of Weibull Distribution and Properties: Monte Carlo Simulation

The maximum log_q likelihood estimation method is a generalization of th...
research
06/10/2022

Active information, missing data and prevalence estimation

The topic of this paper is prevalence estimation from the perspective of...

Please sign up or login with your details

Forgot password? Click here to reset