Analysis of a Two-Stage Negative Binomial Group Testing Model for Estimating the Prevalence of a Rare Trait ()
1. Introduction
The standard method of screening individuals for the presence or absence of a rare trait of interest (e.g. disease) is uneconomical, especially when the target population is large, and the prevalence is low. A feasible strategy is to pool individuals into groups which are then tested as units. Group testing suggests a considerable amount of savings in efficiency, and the number of tests performed compared to individual testing if rightly applied. The idea dates back to World War II and was applied to screen for syphilis antigen among army inductees [1] .
In statistical literature, group testing literature splits into two distinct areas. The first one is classification whose objective is to identify the positive units having a trait of interest. The first documented work in classification was applied to identify syphilitic men that were called for army induction during World War II [1] . The pooling strategy achieved a significant amount of savings by reducing the number of tests needed by testing groups of blood samples as opposed to testing individual blood samples of the army recruits. Since its inception, different authors have extended and generalized the procedure. Hierarchical testing schemes were shown to achieve greater savings [2] [3] . The procedure has been applied in a variety of fields such as in drug discovery [4] , quality control processes [5] , and concealing the identity of individuals when screening for HIV antibodies [6] .
This paper will focus on the second objective which is on estimation of the prevalence of a rare trait. Research in public health shows that when examining diseases transmitted by insect vectors even if disease identification is the main focus, it is equally important to estimate the prevalence (p) of a pathogen to curb and prevent an outbreak [7] . The first work in estimation was developed by [8] to estimate the proportion of insect vectors that are capable of transmitting the aster-yellow virus. In recent studies, different authors have extended [8] work by considering multi-stage pool testing schemes [9] [10] [11] . Retesting of pools when imperfect diagnostic tests are used has been examined [9] [12] [13] and sample size procedures for estimating the adventitious presence (AP) of transgenic plants in a population have been examined [14] [15] . Their research has largely developed under the binomial model which presumes a fixed number of groups to be tested for a trait of interest. In situations where an unknown number of pools are to be tested until a predetermined number of positive pools are observed, calls for the negative binomial model to be considered.
A procedure called inverse sampling is of great importance when sampling biological samples. It entails continuous sampling until a fixed number of samples having a trait of interest are observed. Inverse sampling was first used to estimate the frequencies of an attribute in a population [16] . A combination of inverse sampling and group testing was suggested to be viable in determining the infection rate of diseases transmitted by insect vectors [17] . It was noted that the sources of errors in group testing designs were attributed to the sampling process and not the pooling process. Thus, if sampling and pooling are done correctly, more desirable results can be achieved. The procedure was established to be suitable in situations that prompt immediate responses like disease outbreaks and natural disasters. Early detection of infectious diseases and quick countermeasures are important to mitigate the effect of an outbreak [18] . Inverse (negative) binomial group testing was applied to assess the transmission of the parasitic worm Onchocerciasis volvulus by blackflies which is responsible for causing blindness and skin diseases [19] . Several point estimators that minimized the bias of the estimator were considered. The results showed the potential applicability of the model in the screening of West Nile Virus (WNV) and Foot and Mouth Disease (FMD) [20] .
The foundation in estimation under the inverse binomial group testing model was established under the assumption that perfect diagnostic tests and pools of equal sizes are used [20] . The MLE of the proportion p was established to be positively biased, and an almost unbiased estimator within a region of interest was developed by applying a suitable correction method [7] . A comparison of group sizes was considered [21] and applied in the detection of the AP of transgenic plants in a population. In their subsequent work [22] examined the sample size procedures for estimating AP by considering the dilution effect. The MLE was applied and sample sizes that guaranteed a narrow confidence width were obtained. The optimal group sizes that minimized the asymptotic variance of the estimator when imperfect tests are used have also been examined [23] . Sequential estimation under the negative binomial group testing model has been considered and the results established that an unbiased estimator does not exist [24] .
The negative binomial group testing model has largely developed under the postulation that perfect tests are used (i.e., pools are not misclassified). However, misclassification errors can occur in an experimental design when the diagnostic tests used are imperfect (i.e. the sensitivity and the specificity of the diagnostic test are less than 1). The focus has been to examine efficient estimators and to determine optimal group sizes. Different authors have examined the retesting of pools under the binomial model in group testing, and have established that retesting improves the efficiency of an estimator, and it recovers the sensitivity of the diagnostic tests during estimation [9] [12] [13] . It is on this background that this paper is developed.
In this paper, we construct and analyze a testing scheme developed under the negative binomial model that incorporates retesting of pools when imperfect tests are used. A pool that tests positive in the initial test is sequentially given a retest. The properties of the estimator are discussed and the efficiency of the constructed estimator relative to the one-stage negative binomial group testing model with misclassification was analyzed.
2. Simulation
A generalized Monte-Carlo simulation algorithm used in this study involved the following:
Step 1: Set fixed values of n, k, and p at known sensitivity and specificity values.
Step 2: Generate N independent data sets from negative binomial
that is
for
.
Step 3: Compute the numerical value of the test statistics T for each data set
.
Step 4: If N is large enough, summary statistics
should be a good approximation to the true sampling properties of the test condition under the conditions of interest.
The study simulated different data sets of pool sizes,
. The number of positive groups that tested positive on retest was fixed at
. The sensitivity and specificity values of the tests are assumed to remain constant throughout the entire study and are set at 99%, 98%, 95%, and 90%.
3. Point Estimation
The study assumes that the outcome of individual disease statuses are independently and identically distributed (i.i.d) Bernoulli random variable. We also assume that imperfect diagnostic tests with known sensitivity and specificity defined as
and
respectively were used. The testing process is performed sequentially until the nth positive pool that tests positive on retesting is observed.
Suppose that several pools of size k are sequentially tested for the presence of a rare trait in a population until the predetermined nth pool that tests positive on a retest is observed. The total number of pools tested is denoted by
where
is the number of groups sequentially tested until the lth positive group that tests positive on retesting is observed on a retest. Let
be 1 if the lth group tests positive and 0 otherwise for
. Let
be 1 if the lth group is truly positive and 0 otherwise for
. If the proportion of the rare trait in a given population is denoted by p, using the theory of probability and indicator function, then the probability that a pool tests positive on retesting is
(1)
based on the sampling scheme, T follows the negative binomial distribution with parameters n and
. Then the Likelihood function is
(2)
and the Maximum likelihood Estimator (MLE) of proportion p is
(3)
The table below shows the MLE of p for different values of p, n, and k when the sensitivity and specificity of the tests are known to be 99%.
From Table 1, it can be observed that for any fixed values of n and k, the MLE of p increases with an increase in the prevalence. Also, for any fixed values of k and p, the MLE of p decreases as the waiting parameter n increases. A close approximation of the prevalence is observed at low values of p when both k and n are large, but high values of p when both k and n are large are sufficient to overestimate the prevalence level. Thus, scrutiny of the table reveals that it is possible to get a combination of n and k values that gives a close approximation of the prevalence.
Table 1. The MLE of p for
;
.
Next, we plot the relationship between the MLE and the prevalence p for different values of k and n when the assays used are 99% accurate.
We investigate the relationship between
and p when the sensitivity and specificity of the test are 99%. A linear relationship exists between
and p when the group size
. However, when
, it can be observed in Figure 1 that the relationship is monotonic at any predetermined value of the waiting parameter n. A striking feature to note is that the relationship is sensitive to both k and n when the sensitivity and specificity of the diagnostic test are known.
Figure 1. The relationship between
and p for k = 5, 15, 30 and n = 1, 5, 10 and 15.
4. Properties of the Estimator
In this section, the properties of the estimation such as the Biasedness, and the Mean Squared Error (MSE) are discussed.
4.1. Biasedness of the Estimator
The goal in estimation is to find an estimate of the proportion p in a population of interest. The bias of an estimator measures the average error incurred when using the estimate of a parameter. It was pointed out that bias is particularly useful in evaluating point estimates [8] . The exact bias of an estimator is given by:
(4)
Because
where
. The exact bias and the MSE can be expressed as an infinite sum. It was noted that the sums do not reduce to anything tractable [20] . Thus, the bias of an estimator
can be approximated as follows:
(5)
where
for v small. The value of
was taken throughout, making these approximations very close to the true values of bias and MSE.
Table 2 shows that for any fixed values of the group size and the waiting parameter n, the absolute bias of the estimator increases to a maximum as the optimal value of the prevalence is attained, and afterward decreases as the prevalence increases. Conversely, for a given group size and prevalence level, the absolute bias of the estimator decreases as the waiting parameter n increases. The results show that it is possible to estimate the bias of the estimator at a given level of p for a set of n and k values. For example, when
, the minimum bias can be observed at
and
.
Scrutiny of Table 3 shows that when the diagnostic tests are 95% accurate, the absolute bias of the estimator increases to a maximum as the optimal value of the prevalence is attained and afterward decreases for fixed group sizes and waiting
Table 2. Bias of
for various values of p with n = 1, 3, 5, 10, 15 and k = 5, 10, 30, 50 when the sensitivity and specificity of the tests are set at 99%.
Table 3. Bias of
for various values of p with n = 1, 3, 5, 10, 15 and k = 5, 10, 30, 50 when the sensitivity and specificity of the tests are set at 95%.
parameter n. Secondly, when the group size and the prevalence levels are fixed, the absolute bias decreases to a minimum value as the optimal value of the waiting parameter n is attained, and afterward, it increases. Lastly, the results show that one can estimate the prevalence level at a predetermined optimal value of the waiting parameter n that would register the least bias when the group size is known. For example, when
and
the least bias is observed at
.
4.2. Mean Squared Error of the Estimator
The Mean squared error of an estimator is the average squared deviation derived from the true value of the parameter, which incorporates measures of both accuracy (bias) and the precision (variance) of the estimator. It is used as a measure for the goodness of an estimator that is given by:
(6)
The variance of
as annexed in Appendix section can easily be shown to be
(7)
The MSE can also be expressed as an infinite sum, which was shown that they do not reduce to anything tractable [20] . Thus, the MSE of an estimator
can be approximated as
(8)
where
for v small. The value of
was taken throughout, making these approximations close to the true values of bias and MSE.
Since the sums do not reduce to anything tractable, we use Monte Carlo simulations to generate the MSE tables as outlined in Section 2.
Scrutiny of Table 4 shows that when the group size and the waiting parameter n are fixed, the MSE of
increases to a maximum value as the optimal value of the prevalence is attained before subsequently decreasing. It can also be noted that for any fixed values of the prevalence and group size, the MSE of the estimator decreases as the waiting parameter, n increases. Therefore, it is possible to get the minimum MSE of the prevalence from a combination of n and k when the sensitivity and specificity of the tests are 99%. For instance, when p = 0.005 the minimum value of MSE was obtained at n = 15 and k = 10.
When the assay used is 95% accurate, it can be seen from Table 5 that for any fixed values of the group size and the waiting parameter n, the MSE of the estimator increases to a maximum as the optimal value of the prevalence is attained and afterward decreases. Secondly, when both the group size and the prevalence rate are fixed, the MSE of the estimator decreased to a minimum as the optimal
Table 4. MSE of
for various values of p with n = 1, 3, 5, 10, 15 and k = 5, 10, 30, 50, and when the sensitivity and specificity of the tests are set at 99%.
Table 5. MSE of
for various values of p with n = 1, 3, 5, 10, 15 and k = 5, 10, 30, 50, and when the sensitivity and specificity of the tests are set at 95%.
value of the waiting parameter, n is attained and afterward increases. Finally, the results show that one can obtain a combination of n and k values that would yield a minimum approximation of the MSE at a given prevalence. For instance, the minimum value of the MSE is obtained at
when
and
.
The behavior of the MSE is investigated by plotting
against p for different k and n when the tests used are 99% accurate as shown in the figures below.
Next, the relationship between
and the prevalence is examined for different k and n when the tests used are accurate as presented below.
Figure 2 shows the MSE of the estimator plotted against the prevalence for different values of the waiting parameter n, and group sizes obtained by simulation. A striking feature to note is that the MSE is sensitive to the group size and the waiting parameter n. Also, the maximum value of the MSE increases with an increase in the group size at any predetermined waiting parameter n.
Figure 2. Plot of
as a function of p for different k = 5, 15, 30 and n = 1, 5, 10, 15 with sensitivity and specificity fixed at 0.99 when the sensitivity and specificity of the tests are set at 99%.
5. Model Comparison
In this section, the efficiency of the constructed estimator relative to other estimators in pool testing scheme was determined by computing the ARE and the RMSE and the results are discussed.
5.1. Asymptotic Relative Efficiency (ARE)
The sample size based on the one-stage negative binomial group testing scheme that has considered misclassification was examined by [23] . The variance of the estimator was computed as follows:
(9)
If the estimator of the one-stage negative binomial group testing scheme with misclassification is denoted by
, and the computed estimator is denoted
then
(10)
Therefore,
implies that the proposed model is more efficient than the one-stage negative binomial group testing model with misclassification.
Table 6. ARE of the proposed model relative to the one-stage negative binomial group testing model with misclassification for k = 5, 10, 15, 30, 50 with sensitivity and specificity of the tests set at 99%, 98%, and 95%.
Scrutiny of Table 6 shows that the sequential retesting of a pool that tests positive in the initial test infers that the two-stage negative binomial model is more efficient, especially at low prevalence. This is indicated by
It can also be observed that at a given prevalence of a rare trait, the efficiency of the proposed model decreases with an increase in group size. Similarly, for a fixed group size, the efficiency of the model decreases with an increase in prevalence. Finally, it can be observed that even when the accuracy of the diagnostic test is relatively low, retesting improves the efficiency of the model in a low-prevalence population. Thus, in cases where group testing is used to screen for a rare trait in a low prevalence population, retesting of pools is more desirable as it yields more accurate results when the accuracy of the diagnostic tests is less than 100%.
5.2. Relative Mean Squared Error (RMSE)
To compare the efficiency of a model, a convenient way is to compare the MSE of estimates of the same p with other existing models obtained using different experimental procedures. The MSE of the proposed model is compared with the MSE of the one-stage negative binomial group testing model with misclassification that is denoted by
. The RMSE is calculated as follows:
(11)
Table 7. Relative mean squared error of the estimator (RMSE) with sensitivity and specificity value set at 99%.
Scrutiny of Table 7 shows that for small group sizes, the proposed model outperforms the one-stage negative binomial group testing model with misclassification when the prevalence is low. This is indicated by the value of
. For instance, when
, and
, the two-stage negative binomial group testing model is observed to be 38.6 times more efficient than the one-stage negative binomial group testing model at
. In general, the proposed model is more efficient in a low-prevalence population when the size of the group is fairly small. Lastly, it can be observed that as the group size and the waiting parameter n increases, the efficiency of the model increases in a low-prevalence population.
Table 8. Relative mean squared error of the estimator (RMSE) at 95% sensitivity and specificity value.
Table 8 shows that the two-stage negative binomial group testing model outperforms the one-stage negative binomial group testing model with misclassification when the prevalence of a rare trait is low. It can be observed that the efficiency of our model increases with an increase in group size and the waiting parameter n, especially in low prevalence. This indicates that the asymptotic properties of the model are taking hold. Thus, it is possible to get a combination of k and n values where the model would be more efficient for a given prevalence. For instance, when
and
, the model is approximately 7446 more efficient for
.
6. Application of the Model to Real Data
The testing scheme was also applied to an investigation that involved the surveillance of West Nile Virus (WNV) conducted in Jefferson County by [25] in Florida following the 2001 outbreak of WNV transmitted by the North American mosquito, Culex nigripalpus. The authors documented the first field study on the mosquito transmission rate of WNV which was used by [20] to illustrate their procedure. A total of 11948 mosquitos were captured in the surveillance program and tested in various pool sizes using reverse-transcription polymerase chain reaction assays. A total of 14 mosquito pools tested positive for WNV, and by the end of the outbreak, 12 human cases were reported to have West Nile Meningoencephalitis and 483 documented cases among the horses.
The investigation by [25] did not consider inverse sampling when imperfect tests are used. However, this investigation was used as a basis to illustrate our testing procedures when imperfect tests with known sensitivity and specificity values are used. Based on the field study by [25] that focused on pools that tested positive, the estimated prevalence was reported to be approximately 0.005. Hence it was presumed that 0.005 was the true value of the prevalence. We have simulated data sets to illustrate our procedure for the waiting parameters n = 1, 5, 10, and 15 at equal group sizes k = 5, 15, 30, and 50 similar to [25] when the diagnostic tests are 99%, 95%, and 90% accurate.
Table 1 presents the results of the MLE of the population proportion p based on 10,000 Monte Carlo data sets. The performance of the constructed estimator overestimated the population proportion p as the prevalence increased. The MLE of the constructed estimator reaffirms the discovery that group testing is useful in a low-prevalence population. When both the group size k and the waiting parameter n are large, close estimates of the MLE were observed at
which is consistent with the estimated prevalence level reported by [25] . When the waiting parameter is small say
, the MLE was observed to be exorbitantly positively biased as the prevalence level increased as observed in Table 2 and Table 3. Alternative estimators have been developed by different scholars to reduce the bias when the waiting parameter n is small [7] [20] . Gart’s bias correction to the MLE was recommended by [26] as an effective estimator in reducing the bias when perfect tests are used. A striking feature to note is that the MSE is sensitive to the group size and the waiting parameter n as shown in Figure 2.
To access the efficiency of the testing scheme, the performance of the constructed estimator was compared to other estimators by computing the ARE and RMSE. The results showed that in lower values of p, the two-stage negative binomial group testing model was superior to the one-stage negative binomial group testing model with misclassification suggested by [23] . This can be observed in Table 6 where
and in Table 7 and Table 8 where
. Moreover, the proposed model was established to be efficient even in situations where the accuracy of the diagnostic tests are slightly low. Thus, retesting of positive pools improved the efficiency of the constructed estimator, which agrees with what other authors have already established [9] [12] [27] .
7. Conclusion
A two-stage negative binomial group testing procedure for estimating the prevalence of a rare trait has been constructed and analyzed. From the discussions, the two-stage negative binomial group testing model is superior to the one-stage negative binomial group testing model with misclassification. The constructed estimator performed better at a low prevalence level. Also, it performed better in slightly low values of sensitivity and specificity (95%) than at higher values of the diagnostic tests used that are 99% accurate. Thus, we recommend the retesting of pools in negative binomial group testing designs when imperfect diagnostics tests with known sensitivity and sensitivity are used during estimation.
Appendix
The log-likelihood function to base 10 is
The first-order derivative of the log maximum Likelihood is
The second-order derivative is
Based on the testing process, T follows a negative binomial distribution with parameters n and
. Therefore, the expectation of T is
, then
Since
, we have
.