Accurate interval estimation for the risk difference in an incomplete correlated 2 × 2 table: Calf immunity analysis

Hezhi Lu; Fengjing Cai; Yuan Li; Xionghui Ou

doi:10.1371/journal.pone.0272007

Abstract

Interval estimation with accurate coverage for risk difference (RD) in a correlated 2 × 2 table with structural zero is a fundamental and important problem in biostatistics. The score test-based and Bayesian tail-based confidence intervals (CIs) have good coverage performance among the existing methods. However, as approximation approaches, they have coverage probabilities lower than the nominal confidence level for finite and moderate sample sizes. In this paper, we propose three new CIs for RD based on the fiducial, inferential model (IM) and modified IM (MIM) methods. The IM interval is proven to be valid. Moreover, simulation studies show that the CIs of fiducial and MIM methods can guarantee the preset coverage rate even for small sample sizes. More importantly, in terms of coverage probability and expected length, the MIM interval outperforms other intervals. Finally, a real example illustrates the application of the proposed methods.

Citation: Lu H, Cai F, Li Y, Ou X (2022) Accurate interval estimation for the risk difference in an incomplete correlated 2 × 2 table: Calf immunity analysis. PLoS ONE 17(7): e0272007. https://doi.org/10.1371/journal.pone.0272007

Editor: Yasunori Sato, Keio University School of Medicine, JAPAN

Received: March 31, 2022; Accepted: July 11, 2022; Published: July 22, 2022

Copyright: © 2022 Lu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Data are available from the book: Agresti A. Categorical Data Analysis. New York: Wiley; 1990.R codes are available in the appendix.

Funding: This work was supported by the China Postdoctoral Science Foundation (2021M690774) and National Natural Science Foundation of China (11731015).

Competing interests: The authors have declared that no competing interests exist.

Introduction

An incomplete correlated 2 × 2 table exists in various biological studies, clinical trials and epidemiological studies [1] used this data structure to evaluate penicillin allergy [2,3] studied tuberculosis skin tests in a two-step procedural design. A typical example is given by [4] involving calf immunity data. Calves were first classified according to whether they had a primary pneumonia infection and then reclassified according to whether they developed a secondary infection within a certain time period after the first infection cleared up [5]. Since the subject cannot have secondary infection when it is not infected at the first stage, the cells corresponding to secondary infection without primary infection will not appear. Table 1 lists the observed data and related probabilities.

Download:

Table 1. Data and probability for an incomplete 2×2 table.

https://doi.org/10.1371/journal.pone.0272007.t001

Suppose there is a sample of n subjects, let X₁₁ be the number of subjects infected in both stages, X₁₂ be the number of subjects who have a primary infection but do not have a secondary infection, and X₂₂ be the number of subjects who are not infected in both stages, so that X₁₊ = X₁₁+X₁₂ and X₁₊+X₂₂ = n. Here, p₁₁, p₁₂ and p₂₂ denote the probabilities of the corresponding cells, with p₁₊ = p₁₁+p₁₂ and p₁₊+p₂₂ = 1. After the first infection clears up, one wants to infer the likelihood of developing a secondary infection due to the effect of the primary infection. One common comparative measure of interest is the risk difference (RD) between the primary infection and the secondary infection, given the primary infection. The RD δ is defined by

From the frequentist point of view [6], considered confidence intervals (CIs) for the RD based on Wald’s test statistic, the likelihood ratio test and the basic principle of Fieller’s theorem. Although these three approaches behave well in many practical problems, the CIs derived from Wald’s test statistic, the likelihood ratio test and Fieller’s theorem fail to reach the preset confidence level threshold even in moderate sample sizes. As an alternative [2], proposed a score test-based CI and, although the score test statistic is undefined in one scenario, the score CI outperforms all other frequentist intervals in terms of the coverage probability. More literature on risk factors and some potential methods can be found in [6–15].

If a prior distribution is available for the unknown parameter, the Bayesian posterior distribution provides a meaningful summary of the uncertainty about the parameter. To date, the Bayesian approach has been widely used in the interval estimation of the proportions in correlated tables [16] investigated the performance of Bayesian intervals using different priors, and they found that the Jeffreys prior is comparable to the score test-based CI [17] used a Bayesian estimation of the false-negative rate in a clinical trial of a sentinel node biopsy. Moreover [5], used Dirichlet priors to construct the tail-based interval for the RD. Simulation studies showed that the Bayesian CI at the Jeffreys prior has a shorter expected length than the score test-based CI.

The Bayesian choice of priors is a powerful tool for inferring purposes, but different priors can lead to different posterior distributions. To eliminate the influence of the prior distribution on the inference result, the generalized fiducial inference [18,19] is asymptotically correct and works well in applications [20,21]. However, the determination of the fiducial distribution of parameters remains a problem. Unlike the fiducial method, CIs derived from inferential models (IMs) [22–24] can always guarantee nominal coverage for all sample sizes. The main difference between the IM and the fiducial inference is that the IM method always carries out probability calculation in the auxiliary variable space, which can ensure that its inference is strict and correct. Moreover, the IM theory of precise inference needs some improvements. For example [25–29], constructed a randomized IM inference for discrete proportions.

The frequentist methods can be undefined in some cases. Moreover, as two representative CIs, the score and Bayesian intervals cannot guarantee the nominal coverage probability for small to moderate sample sizes. The aim of this paper is to construct three new CIs with accurate coverage for the RD mainly based on the fiducial, IM and randomized IM approaches.

Existing methods

Score test-based CI

From Table 1, the observed vector (X₁₁,X₁₂) comes from the trinomial distribution model (1)

Let be the maximum likelihood estimate of p₁₊; then, the score test statistic [2] for hypothesis H₀:δ = δ₀ is given by

It is well-known that T_S(X₁₁,X₁₂,X₂₂,δ₀) is an asymptotically standard normal distribution under H₀. Given the significance level α, the two-sided 1−α approximate confidence limits (δ_L and δ_U) for δ can be computed by solving equations The score test is comparable to that of the likelihood ratio test [6], but the likelihood ratio test can be undefined in many scenarios while the score test statistic is undefined when X₁₁ = X₁₂ = 0.

Bayesian tail-based CI

The conditional probability function of (x₁₁,x₁₂) given (p₁₁,p₁₂) (p₁₁+p₁₂<1) is the trinomial distribution with probability function where x₁₁+x₁₂≤n. Recalling Bayes’ rule, if one chooses a prior that is conjugate to the likelihood, then the posterior will have the same form as the conjugate prior distribution. Clearly, the Dirichlet D₃(k₁,k₂,k₃) prior for the parameter (p₁₁,p₁₂) leads to the posterior being a Dirichlet-type distribution D₃(x₁₁+k₁,x₁₂+k₂,n−x₁₁−x₁₂+k₃) with the posterior density function

Shi, Sun and Bai [5] studied the symmetric Dirichlet prior k_i = k, i = 1,2,3 for the RD. They recommended the Jeffreys prior D₃(1/2,1/2,1/2). Moreover, the posterior distribution function of δ given (x₁₁,x₁₂) has the following expression (2) where and for m_i>0, i = 1,2,3,4 is defined by (3) where . For the tail-based CI with a significance level α, the lower and upper limits (L,U) are obtained by minimizing the following function (4) Note that mathematical calculations of the integral formulae (3) and function (4) require some specific algorithms, such as the Nelder–Mead algorithm.

New confidence intervals

The small sample properties for approximate score and Bayesian methods may not be calibrated for meaningful probabilistic inference. These two CIs have lower coverage probability in some cases. To date, fiducial and IM-based methods have been used in other inference problems where classic approaches cannot lead to valid inference. Therefore, we consider fiducial and IM solutions for the RD.

Fiducial CI

Fiducial inference is based entirely on the fiducial distribution of the parameters. We first review the fiducial framework [18]. Suppose that X is a random variable indexed by a parameter θ which is generated using the structural equation given by where a is a measurable function and U is an auxiliary variable with a known distribution. Note that the distribution of U is free of θ.

Let x and u be realization of X and U, respectively. The information provided by x and u about θ is encapsulated in the set Clearly, x = a(θ,u) is equivalent to Θ(x,u)≠∅. The fiducial argument involves replacing u with U’ to obtain the random set Θ(x,U’), where U’ is an independent copy of U. Since the fiducial distribution of θ is given by Then, we can easily derive a fiducial CI for θ.

From Table 1, if the condition is based on the number (x₁₁+x₁₂), such that all marginal totals are fixed, the probability of observing x₁₁ follows a binomial distribution. By derivation, letting T = X₁₁+X₁₂ and X = X₁₁, the probability function of (x₁₁,x₁₂) in (1) can be simplified as the product of two independent binomial distributions as follows: (5) where θ = p₁₁+p₁₂ and φ = p₁₁/(p₁₁+p₁₂).

Naturally, we have T~Bin(n,θ) and X|T = t~Bin(t,φ). Moreover, if we define the value of the random variable to be 1 if a trial results in success, and 0 otherwise, then the structural equation can be given by where I_A(⋅) is the indicator function and U_i, i = 1,…,n, and V_j, j = 1,…,t, are independent uniform(0,1) random variables.

Due to the discrete nature, event is equivalent to event . Similarly, event is equivalent to event . Given T = t and X = x, we have Moreover, the fiducial quantity for the RD is given by (6) Since we can easily obtain a Monte Carlo approximation for the distribution of the two endpoints, constructing a fiducial CI for δ is not problematic.

Remark 1. Compared to Bayesian inference, the fiducial idea may be more attractive because no prior distributions are needed. Moreover, the fiducial idea has been shown to be asymptotically correct for a single discrete population. However, there is little research on risk factors between two independent binomial distributions. As an extension, our research can fill this gap and is easily applied to other multiparameter inference problems. Furthermore, our simulation studies will show that the fiducial method can guarantee nominal coverage even for small sample sizes.

IM-based CI

Martin and Liu [22] proposed an IM framework for valid probabilistic inference. The IM starts with an association between data, parameters and auxiliary variables. Using the optimal predictive random set to predict the auxiliary variables, the IM produces a postdata probabilistic measure function of uncertainty about the unknown parameter. To summarize, the IM has the following three steps:

A-step: From an appropriate mapping a: x = a(θ,U), the IM associates the parameter θ with (x,u) for each possible pair to obtain a collection of sets Θ_x(u) of candidate values.

P-step: Given data x , suppose θ* is the true value of θ, there exists a u*, such that x = a(θ*,u*). Moreover, the true value u* is predicted with a valid predictive random set S(u). The validity condition ensures that S(u) will hit u* with large probability.

C-step: The A-step and P-step are combined to obtain a final random set of θ, that is, Θ_x(S(u)) = ∪_u∈S(u)Θ_x(u). Then, for any assertion A about the parameter of interest θ, the probabilities that bel_x(A) = P{Θ_x(S(u))⊂A} and pl_x(A) = P{Θ_x(S(u))⊄A^c} are computed as two measure functions of the available evidence in x supporting A.

The belief function bel_x(A) and the plausibility function pl_x(A) are known as the minimum and maximum probabilities that support the truth of assertion A. It is more convenient to report the plausibility function, which can easily be used to create frequentist procedures. To test the assertion A = {δ:δ = δ₀}, we reject H₀:δ = δ₀ if pl_x(A)≤α for a significance level α, and this plausibility function yields a two-sided 1−α IM CI {δ: pl_x(A)>α}.

Theorem 1 [22]. Suppose X~P_X|θ and let A be an assertion of interest, the IM with the plausibility function pl_X(A) is valid for assertion A if, for each α∈(0,1),

The resulting IM CI can guarantee the nominal coverage probability when the plausibility function pl_X(A) is said to be valid. Moreover, if "≤α" can be replaced by "= α", then the IM CI controls the coverage probability exactly at the confidence level 1−α.

Here, we construct a new IM CI for the RD. Let F_n,θ(⋅) denote the distribution function of T~Bin(n,θ). Martin and Liu [22] gave an association linking t, θ, and an auxiliary variable u~P_u as follows: (7)

Moreover, the association model (7) can be simplified as (8)

Similarly, the association for x, given φ, may be written as (9)

To derive an initial IM’s association for δ = θ−φ, we will take advantage of the well-known relationship between the binomial and beta distribution functions, that is, , where G_a,b(⋅) is the beta (a, b) distribution function. Furthermore, we can rewrite joint associations (8) and (9) as follows: where u and v are i.i.d. uniform(0,1) random variables. Hence, the A-step of the IM for δ is (10)

A-step: Let H_t−1,x(⋅) and H_t,x−1(⋅) be the distribution functions of the two endpoints in (10). The association step of IM for the RD is

P-step: We are interested in two-sided CIs. Following [22], for a singleton assertion A = {δ}, the default predictive random set for the auxiliary variable w, ,

C-step: Combine Θ_t,x(w) and S to obtain a random set for δ. Since or H_t−1,x(δ)<inf S,

we find that the corresponding plausibility function for a singleton assertion A = {δ} is where the ‘‘+” superscript denotes the positive part.

Theorem 2. According to Theorem 1, the plausibility function pl_t,x(A) of our IM method is valid for assertion A if, for each α∈(0,1),

Proof. Given any α∈(0,1), we have

Then, Hence, the proof is complete.

For any α∈(0,1), if pl_t,x(A)≤α, then the assertion A is wrong. Moreover, this plausibility function yields an IM 1−α CI {δ: pl_t,x(A)>α}, such that δ∈[δ_L,δ_U], where δ_L and δ_U satisfy

Remark 2. The IM CI has the same form as the generalized fiducial CI. However, these two intervals are obtained under different theoretical derivations, fiducial approaches and IM theories, respectively. While the general fiducial distributions in (6) may not be calibrated for meaningful probabilistic inference, IM provides meaningful probabilistic summaries of the information in data concerning the quantity of interest. Moreover, the IM CI derived from a valid plausibility function of IM can guarantee the nominal coverage probability for all sample sizes.

Modified IM CI

In general, the association model (10) of IM is an interval, which will result in a conservative CI. Some adjustments are needed to handle this discreteness. Inspired by the randomized IM idea [25], we consider a modified IM (MIM) approach to modify in Eq (10) to an accurate equation so that we can improve the accuracy of the candidate value of δ.

Theorem 3 [25]. Suppose Y~Bin(m,ϕ), let ω be uniformly distributed in (0,1), and Y and ω are independent. Then

For association (8), since the auxiliary variable u is in the interval , there exists a weight ω₁ such that (11) where ω₁ follows a uniform(0,1) distribution and is independent of t. Note that is a strictly decreasing function of θ, for every u,ω₁∈(0,1), we can obtain a unique solution from Eq (11). Moreover, let its distribution function be G_t(⋅); then, the association model for θ can be rewritten as follows: (12)

Similarly, for inequality (9), if ω₂~Unif(0,1) and is independent of x, then we obtain (13) Given x, for every v,ω₂∈(0,1), Eq (12) gives a unique solution . Let its distribution function be H_x(⋅); then, the association model for φ is (14)

A’-step: Based on (12) and (14), our new association step of the MIM for the RD δ = θ−φ is where K_t,x(⋅) is the distribution function of .

P’-step: For a singleton assertion A, the default predictive random set for the auxiliary variable w is

C’-step: To obtain a final random set for δ, and S are combined as where W~Unif(0,1). Then, we can compute the plausibility function of A, that is, .

Theorem 4. Let S~P_S be a valid predictive random set for W~Unif(0,1), that is, P_S(w∈S)≥_stUnif(0,1), where "≥_st" means “stochastically no smaller than”. If K_t,x(δ)~Unif(0,1) for (t,x)~P_(t,x)|δ for all δ, then the MIM method is valid.

Proof. Given any α∈(0,1), since K_t,x(δ)~Unif(0,1) for (t,x)~P_(t,x)|δ for all δ, and

Moreover, the predictive random set S~P_S is valid, that is, P_S(w∈S)≥_stUnif(0,1).

Hence,

The MIM inference is valid, by Theorem 1. Hence, the proof is complete.

The plausibility function of our MIM method yields a new CI , where δ_L and δ_U satisfy K_t,x(δ_L) = α/2 and K_t,x(δ_U) = 1−α/2.

Remark 3. Similar to the classical approaches, the Monte Carlo method is also an approximate solution. The main difference between our MIM approach and other approximations is that the accuracy of MIM depends on the repetition times N, but accuracies of other approximations depend on the sample size n. We recommend N = 1,000,000 in practical applications to assure that there is a greater than 95% probability of the absolute error being less than 0.001.

Simulation results

The fiducial and IM CIs have the same form. The score, Bayesian, fiducial and MIM approaches are approximations. We conduct some Monte Carlo simulations to assess the performance of fiducial and MIM intervals, and compare them to the score and Bayesian intervals. Since it is often difficult to obtain explicit expressions of the fiducial and MIM CIs, we suggest approximating these two CIs using the following Monte Carlo algorithms. R codes are available in the S1 Appendix.

Table 2 lists four 95% CIs for various combinations including some special cases of zero cells. We see that the Bayesian, fiducial and MIM intervals are well defined for all cases. However, when X₁₁ = X₁₂ = 0, the score interval does not exist and the fiducial and MIM CIs have shorter widths than the score and Bayesian CIs. Note that the expected lengths of the score, Bayesian and MIM intervals are almost the same when the sample size increases.

Algorithm 1: Fiducial CI (δ_α/2 , δ_1−α/2 )

Step 1. For the given sample (x₁₁,x₁₂) from the incomplete correlated 2 × 2 table,

t = x₁₁+x₁₂ and x = x₁₁ are calculated;

Step 2. Then, u₁,u₂,…,u_t+1 and v₁,v₂,…,v_x+1 are generated from a uniform(0,1) distribution,

and δ is calculated using ;

Step 3. Step 2 is repeated N times (1,000,000 for example) to obtain N realizations of δ;

Step 4. The α/2 quantile of and the 1−α/2 quantile of

are calculated to approximate δ_α/2 and δ_1−α/2, respectively.

Algorithm 2: MIM CI (δ_α/2, δ_1−α/2)

Step 1. For the given sample (x₁₁,x₁₂), t = x₁₁+x₁₂ and x = x₁₁ are calculated;

Step 2. Then ω₁, ω₂, u and v are randomly sampled from the uniform(0, 1) distribution, and

equations and are solved to obtain the unique solution (θ,φ). Then, δ = θ−φ is calculated;

Step 3. Step 2 is repeated N times (1,000,000 for example) to obtain N realizations of δ;

Step 4. The α/2 and 1−α/2 quantiles of δ are calculated to approximate δ_L and δ_U,

respectively.

Download:

Table 2. The four 95% confidence intervals and widths for the selected combinations of (n,X₁₁,X₁₂).

https://doi.org/10.1371/journal.pone.0272007.t002

For comparison, simulation studies are conducted to examine the performances of the score, Bayesian, fiducial and MIM CIs under different sample sizes. The parameters of the comparison are mainly the coverage probability and expected length. Refer to [2,5] for the parameter settings. We consider p₁₊ = 0.3, 0.5 and 0.8; δ = −0.3 (0.1) 0.3; and n = 20, 50, 100 and 300. In each simulation, we first resample the observed value (X₁₁,X₁₂) 10,000 times from the trinomial distribution, calculate the four different CIs accordingly, and compute the corresponding frequencies that cover δ. We regard the coverage frequency as the coverage probability. According to the central limit theorem, the coverage frequency of the nominal 95% confidence level tends to fall in the interval (0.9457, 0.9543) for 10,000 experimental repetitions. Moreover, the expected length is where δ_U and δ_L are the upper and lower limits of the interval, respectively.

We report the simulation results in Tables 3 and 4. Cases in which the coverage probability is less than 0.9457 appear in bold underlined. Clearly, the score and Bayesian CIs cannot guarantee the nominal coverage probability for small to moderate samples, except for the fiducial and MIM CIs. Moreover, the score and MIM intervals have similar coverage in most cases, but the expected lengths of score CIs are longer than those of MIM CIs. For example, when n = 20, p₁₊ = 0.3 and δ = 0.2, the coverage rates of the score and MIM CIs are 0.9647 and 0.9679, respectively. In this case, the expected lengths of the score CI (0.93) are significantly longer than those of the MIM CI (0.70). Furthermore, it seems that the score CI has a more accurate coverage probability than the MIM CI in some cases such as n = 50, p₁₊ = 0.3 and δ = 0.1, but our MIM CI also has a shorter length than the score CI. Note that our MIM method uses a shorter interval to obtain higher coverage, which shows that the MIM CI outperforms the score CI.

Download:

Table 3. The coverage probability (CP) and the expected length (EL) of various 95% CIs.

https://doi.org/10.1371/journal.pone.0272007.t003

Download:

Table 4. The coverage probability (CP) and the expected length (EL) of various 95% CIs.

https://doi.org/10.1371/journal.pone.0272007.t004

From Table 4, when the sample size increases, the coverage probabilities of the score, Bayesian and modified IM CIs all fall in the interval (0.9457, 0.9543). Although the fiducial CI has a slightly larger coverage rate than other intervals, the expected lengths of the four CIs are almost the same. In this sense, the fiducial method is not inferior to existing methods. Moreover, according to the central limit theorem, the large sample properties indicate that the theoretical results of the score, Bayesian, fiducial and modified IM methods will tend to be consistent. In summary, the fiducial and MIM methods can improve the poor coverage probabilities of the score and Bayesian approaches for small to moderate sample sizes. For larger samples, the CIs of score, Bayesian and MIM are the same, and the fiducial interval is not inferior to the other three intervals. Compared with the fiducial interval, the MIM interval exhibits more accurate coverage with a shorter expected length for all sample sizes. Hence, in terms of coverage probability and expected length, the MIM CI is the best of all.

To obtain a better understanding of the different performances of various intervals for small sample sizes. Let n = 30. Figs 1–3 give plots of the coverage probabilities and expected lengths of four CIs versus p₁₂ for fixed p₁₁ = 0.1, 0.2 and 0.5, respectively. Here, we draw a dashed line (y = 0.9457) as the maximum lower bound of the coverage probability. Clearly, the score and Bayesian intervals have coverage probabilities lower than 0.9457 in some cases, especially when p₁₂ is very close to zero. In contrast, the fiducial approach can always guarantee nominal coverage for all cases, and the MIM CI improves the conservative coverage of the fiducial CI. For expected length, the fiducial and MIM methods have shorter expected lengths than score and Bayesian approaches when p₁₂ is close to zero. Moreover, although the score and Bayesian CIs have shorter expected lengths in most cases, differences between various expected lengths are small when p₁₂ becomes larger. Note that the score and Bayesian CIs cannot guarantee the preset coverage probability; hence, the fiducial and MIM intervals are superior to the score and Bayesian intervals.

Download:

Fig 1. Coverage probabilities and expected lengths of the Score, Bayes, fiducial and MIM confidence intervals versus P₁₂ for fixed P₁₁, where n = 30 and P₁₁ = 0.1.

https://doi.org/10.1371/journal.pone.0272007.g001

Download:

Fig 2. Coverage probabilities and expected lengths of the Score, Bayes, fiducial and MIM confidence intervals versus P₁₂ for fixed P₁₁, where n = 30 and P₁₁ = 0.2.

https://doi.org/10.1371/journal.pone.0272007.g002

Download:

Fig 3. Coverage probabilities and expected lengths of the Score, Bayes, fiducial and MIM confidence intervals versus P₁₂ for fixed P₁₁, where n = 30 and P₁₁ = 0.5.

https://doi.org/10.1371/journal.pone.0272007.g003

Following [30], when the observed data (t,x) cannot be separated from the auxiliary variables (u’,v’), the validity condition K_t,x(δ)~Unif(0,1) may not be automatic. However, different from other approaches, we can check the good coverage performance of the MIM method for different parameter settings, p₁₁ = 0.1 (0.2) 0.5; p₁₂ = 0.1 (0.1) 0.5; n = 20 and 50. In each simulation, we take 10,000 samples. The corresponding Monte Carlo estimators of the distribution function of K_t,x(δ) in Figs 4–9 show that the approximate is valid, that is, , where W~Unif(0,1). In particular, from Fig 7–9, the distribution function of K_t,x(δ) is very close to that of Unif(0, 1) for a moderate sample size, hence the MIM method has accurate coverage.

Download:

Fig 4. Distribution functions of K_t,x(δ) (black) compared with that of Unif(0, 1) (gray) based on Monte Carlo samples from the trinomial distribution versus p₁₂ for fixed n = 20 and p₁₁ = 0.1. Panel (a): p₁₂ = 0.1. Panel (b): p₁₂ = 0.2. Panel (c): p₁₂ = 0.3. Panel (d): p₁₂ = 0.5.

https://doi.org/10.1371/journal.pone.0272007.g004

Download:

Fig 5. Distribution functions of K_t,x(δ) (black) compared with that of Unif(0, 1) (gray) based on Monte Carlo samples from the trinomial distribution versus p₁₂ for fixed n = 20 and p₁₁ = 0.3. Panel (a): p₁₂ = 0.1. Panel (b): p₁₂ = 0.2. Panel (c): p₁₂ = 0.3. Panel (d): p₁₂ = 0.5.

https://doi.org/10.1371/journal.pone.0272007.g005

Download:

Fig 6. Distribution functions of K_t,x(δ) (black) compared with that of Unif(0, 1) (gray) based on Monte Carlo samples from the trinomial distribution versus p₁₂ for fixed n = 20 and p₁₁ = 0.5. Panel (a): p₁₂ = 0.1. Panel (b): p₁₂ = 0.2. Panel (c): p₁₂ = 0.3. Panel (d): p₁₂ = 0.5.

https://doi.org/10.1371/journal.pone.0272007.g006

Download:

Fig 7. Distribution functions of K_t,x(δ) (black) compared with that of Unif(0, 1) (gray) based on Monte Carlo samples from the trinomial distribution versus p₁₂ for fixed n = 50 and p₁₁ = 0.1. Panel (a): p₁₂ = 0.1. Panel (b): p₁₂ = 0.2. Panel (c): p₁₂ = 0.3. Panel (d): p₁₂ = 0.5.

https://doi.org/10.1371/journal.pone.0272007.g007

Download:

Fig 8. Distribution functions of K_t,x(δ) (black) compared with that of Unif(0, 1) (gray) based on Monte Carlo samples from the trinomial distribution versus p₁₂ for fixed n = 50 and p₁₁ = 0.3. Panel (a): p₁₂ = 0.1. Panel (b): p₁₂ = 0.2. Panel (c): p₁₂ = 0.3. Panel (d): p₁₂ = 0.5.

https://doi.org/10.1371/journal.pone.0272007.g008

Download:

Fig 9. Distribution functions of K_t,x(δ) (black) compared with that of Unif(0, 1) (gray) based on Monte Carlo samples from the trinomial distribution versus p₁₂ for fixed n = 50 and p₁₁ = 0.5. Panel (a): p₁₂ = 0.1. Panel (b): p₁₂ = 0.2. Panel (c): p₁₂ = 0.3. Panel (d): p₁₂ = 0.4.

https://doi.org/10.1371/journal.pone.0272007.g009

A real data analysis

We illustrate the application of the proposed methods with a real example. A sample of 156 dairy calves born in Florida, were classified according to whether they had pneumonia within 60 days after birth [4]. Calves that got a pneumonia infection were also classified according to whether they got a secondary infection within two weeks after the first infection cleared up. Table 5 shows the data. Calves that did not get a primary infection could not get a secondary infection, so no observations can fall in the cell for “no” primary infection and “yes” secondary infection. The goal of this study was to test whether the probability of primary infection was the same as the conditional probability of secondary infection, given that the calf got the primary infection.

Download:

Table 5. Primary and secondary pneumonia infections of calves.

https://doi.org/10.1371/journal.pone.0272007.t005

Here we used the RD to study the effect of primary infection on the likelihood of developing secondary infection. Under the 95% confidence level, the score, Bayesian, fiducial and MIM CIs for δ are (0.15, 0.39), (0.15, 0.39), (0.14, 0.40) and (0.15, 0.39), respectively. Clearly, the lower bounds of the four intervals are all larger than 0. It is suggested that the primary infection of pneumonia should stimulate a natural immunity to reduce the likelihood of secondary infection. Hence, the fiducial and MIM methods work well with calf immunity data. More importantly, the MIM method also provides probabilistic summaries of the information in data concerning the quantity of interest. To be more informative, we plot the plausibility function , as a function of δ in Fig 10. By locating α = 0.05 on the vertical axis, we can easily find that the lower bound (0.15) and the upper bound (0.39) are in the MIM CI. Furthermore, the plausibility function shows that each point δ in the MIM interval is individually sufficiently plausible. Clearly, no frequentist or Bayesian interval can assign such a meaning to the individual elements it contains. In this sense, the proposed MIM interval is recommended for practical use.

Download:

Fig 10. MIM’s Plausibility function of δ in the calf immunity data.

https://doi.org/10.1371/journal.pone.0272007.g010

Discussion

The RD is a comparative measure between the probability of the primary infection and the conditional probability of the secondary infection, given the primary infection. The confidence intervals of the score and Bayesian methods have poor coverage performance for small to moderate sample sizes. In this paper, we propose three valid CIs based on the fiducial, IM and MIM approaches for the RD. The fiducial and IM-based CIs have more accurate coverage performance than the score and Bayesian CIs. Compared with the fiducial approach, IM-based approaches can provide meaningful probabilistic summaries of the information in data concerning the quantity of interest. Moreover, the MIM method uses a randomized IM idea to modify the two inequation associations of IM to an accurate equation model. A real data example shows that the proposed methods work well for the calf immunity data.

Different from other approximate approaches, our MIM solution has the advantage that its approximation precision, which only depends on the simulation times rather than the sample size, may tend to be as high as possible whether the sample size is large or small. Moreover, the IM’s output is posterior-probabilistic in nature and, therefore, has a meaningful interpretation within and not just across experiments. Moreover, our fiducial and IM-based methods are general ideas that can be applied to infer other risk factors, such as the risk ratio. Finally, since the effect of the observed data cannot be separated from the auxiliary variables, there could be interest in the simultaneous prediction of several auxiliary variables. The best choice of predictive random set needs further study.

Supporting information

S1 Appendix.

https://doi.org/10.1371/journal.pone.0272007.s001

(DOCX)

Acknowledgments

We are grateful to the academic-editor and two anonymous referees whose comments helped to significantly improve our manuscript. We thank Prof. JH and Prof. LC for their helpful suggestions about the manuscript.

References

1. Garcia JJ, Blanca M, Moreno F, et al. Determination of IgE antibodies to the benzylpenicilloyl determinant: a comparison of the sensitivity and specificity of three radio allegro Sorbent test methods. Journal of Clinical Laboratory Analysis. 1997; 11:251–257. pmid:9292392
- View Article
- PubMed/NCBI
- Google Scholar
2. Tang NS, Tang ML. Statistical inference for risk difference in an incomplete correlated 2×2 table. Biometrical Journal. 2003; 45:34–46.
- View Article
- Google Scholar
3. Toyota M, Kudo K, Sumiya M, Kobori O. High frequency of individuals with strong reaction to tuberculosis among clinical trainees. Japanese Journal of Infectious Diseases. 1999; 52:128–129. pmid:10507995
- View Article
- PubMed/NCBI
- Google Scholar
4. Agresti A. Categorical Data Analysis. New York: Wiley; 1990.
5. Shi L, Sun HY, Bai P. Bayesian confidence interval for difference of the proportions in a 2×2 table with structural zero. Journal of Applied Statistics. 2009; 36:483–394.
- View Article
- Google Scholar
6. Lui KJ. Confidence interval of the simple difference between the proportions of a primary infection and a secondary infection, given the primary infection. Biomedical Journal. 2000; 42:59–69.
- View Article
- Google Scholar
7. Aslam M. A new method to analyze rock joint roughness coefficient based on neutrosophic statistics. Measurement. 2019; 146:65–71.
- View Article
- Google Scholar
8. Aslam M, Rao GS, Khan N. Single-stage and two-stage total failure-based group-sampling plans for the Weibull distribution under neutrosophic statistics. Complex and intelligent systems. 2021; 7: 891–900.
- View Article
- Google Scholar
9. Afzal U, Ahmad N, Zafar Q, et al. Fabrication of a surface type humidity sensor based on methyl green thin film, with the analysis of capacitance and resistance through neutrosophic statistics. RSC Advances. 2021; 11:38674–38682. pmid:35493226
- View Article
- PubMed/NCBI
- Google Scholar
10. Bai P, Gan W, Shi L. Bayesian confidence interval for the risk ratio in a correlated 2 × 2 table with structural zero. Journal of Applied Statistics. 2011; 38:2805–2817.
- View Article
- Google Scholar
11. Gupta RC, Tian S. Statistical inference for the risk ratio in 2 × 2 binomial trails with structural zero. Computational Statistics & Data Analysis. 2007; 51:3070–3084.
- View Article
- Google Scholar
12. Hwang JS, Biswas A. Odds ratio for a single 2 × 2 table with correlated binomials for two margins. Statistical Methods and Applications. 2008; 17:483–497.
- View Article
- Google Scholar
13. Stamey JD, Seaman JW, Young DM. Bayesian inference for a correlated 2 × 2 table with a structural zero. Biometrical Journal. 2006; 48:233–244. pmid:16708775
- View Article
- PubMed/NCBI
- Google Scholar
14. Tang NS, Tang ML. Exact unconditional inference for risk ratio in a correlated 2 × 2 table with structural zero. Biometrics. 2002; 58:972–980. pmid:12495152
- View Article
- PubMed/NCBI
- Google Scholar
15. Wang SF, Wang XR. Statistical inference of risk ratio in a correlated 2 × 2 table with structural zero. Computational Statistics. 2013; 28:1599–1615.
- View Article
- Google Scholar
16. Agresti A, Min YY. Frequentist performance of Bayesian confidence intervals for comparing proportions in 2 × 2 tables. Biometrics. 2005; 61:515–523. pmid:16011699
- View Article
- PubMed/NCBI
- Google Scholar
17. Newcombe RG. Bayesian estimation of false-negative rate in a clinical trial of sentinel node biopsy. Statistics in Medicine. 2007; 26:3429–3442. pmid:17133626
- View Article
- PubMed/NCBI
- Google Scholar
18. Hanning J, Iyer HK, Patterson P. Fiducial Generalized Confidence Intervals. Journal of the American Statistical Association. 2006; 101:254–269.
- View Article
- Google Scholar
19. Hannig J. On generalized fiducial inference. Statistical Since. 2009; 19:491–544.
- View Article
- Google Scholar
20. E L, Hannig J, Iyer HK. Fiducial intervals for variance components in an unbalanced two-component normal mixed linear model. Journal of the American Statistical Association. 2008; 103:854–865.
- View Article
- Google Scholar
21. Iyer HK, Wang CM, Matthew T. Models and confidence intervals for true values in interlaboratory trials. Journal of the American Statistical Association. 2004; 99:1060–1071.
- View Article
- Google Scholar
22. Martin R, Liu CH. Inferential Models: A Framework for Prior-Free Posterior Probabilistic Inference. Journal of the American Statistical Association. 2013; 108: 301–313.
- View Article
- Google Scholar
23. Martin R, Liu CH. Conditional inferential models: combining information for prior-free probabilistic inference. Journal of the Royal Statistical Society Series B-Statistical Methodology. 2015; 77:195–217.
- View Article
- Google Scholar
24. Martin R, Liu CH. Marginal inferential models: prior-free probabilistic inference on interest parameters. Journal of the American Statistical Association. 2015; 110:1621–1631.
- View Article
- Google Scholar
25. Lu HZ, Jin H, Wang ZN, Chen C, Lu Y. Prior-free probabilistic interval estimation for binomial proportion. TEST. 2019; 28:522–542.
- View Article
- Google Scholar
26. Lu H, Jin H. A new prediction interval for binomial random variable based on Inferential Models. Journal of Statistical Planning and Inference. 2020; 205:156–174.
- View Article
- Google Scholar
27. Lu HZ, Jin H, Li Y, Wang ZN. Confidence intervals for a Poisson parameter with background. Communications in Statistics—Theory and Methods. Forthcoming 2022. pmid:35399822
- View Article
- PubMed/NCBI
- Google Scholar
28. Wang ZN, Jin H, Lu HZ, Jin YL. An efficient test based on the Inferential Model for the non-inferiority of odds ratio in matched-pairs design. Statistical Methods in Medical Research. 2018; 27:2831–2841. pmid:28093963
- View Article
- PubMed/NCBI
- Google Scholar
29. Wang ZN, Jin H, Lu HZ. An IM-based efficient test for noninferiority of the odds ratio between two independent binomial proportions, Communications in Statistics—Theory and Methods. Forthcoming 2021. https://doi.org/10.1080/03610926.2021.1926507.
- View Article
- Google Scholar
30. Martin R, Lingham R. Prior-Free Probabilistic Prediction of Future Observations. Technometrics. 2016; 58:225–235.
- View Article
- Google Scholar

[ref1] 1. Garcia JJ, Blanca M, Moreno F, et al. Determination of IgE antibodies to the benzylpenicilloyl determinant: a comparison of the sensitivity and specificity of three radio allegro Sorbent test methods. Journal of Clinical Laboratory Analysis. 1997; 11:251–257. pmid:9292392
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Tang NS, Tang ML. Statistical inference for risk difference in an incomplete correlated 2×2 table. Biometrical Journal. 2003; 45:34–46.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref3] 3. Toyota M, Kudo K, Sumiya M, Kobori O. High frequency of individuals with strong reaction to tuberculosis among clinical trainees. Japanese Journal of Infectious Diseases. 1999; 52:128–129. pmid:10507995
View Article
PubMed/NCBI
Google Scholar

[9] View Article

[10] PubMed/NCBI

[11] Google Scholar

[ref4] 4. Agresti A. Categorical Data Analysis. New York: Wiley; 1990.

[ref5] 5. Shi L, Sun HY, Bai P. Bayesian confidence interval for difference of the proportions in a 2×2 table with structural zero. Journal of Applied Statistics. 2009; 36:483–394.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Lui KJ. Confidence interval of the simple difference between the proportions of a primary infection and a secondary infection, given the primary infection. Biomedical Journal. 2000; 42:59–69.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref7] 7. Aslam M. A new method to analyze rock joint roughness coefficient based on neutrosophic statistics. Measurement. 2019; 146:65–71.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref8] 8. Aslam M, Rao GS, Khan N. Single-stage and two-stage total failure-based group-sampling plans for the Weibull distribution under neutrosophic statistics. Complex and intelligent systems. 2021; 7: 891–900.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref9] 9. Afzal U, Ahmad N, Zafar Q, et al. Fabrication of a surface type humidity sensor based on methyl green thin film, with the analysis of capacitance and resistance through neutrosophic statistics. RSC Advances. 2021; 11:38674–38682. pmid:35493226
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref10] 10. Bai P, Gan W, Shi L. Bayesian confidence interval for the risk ratio in a correlated 2 × 2 table with structural zero. Journal of Applied Statistics. 2011; 38:2805–2817.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref11] 11. Gupta RC, Tian S. Statistical inference for the risk ratio in 2 × 2 binomial trails with structural zero. Computational Statistics & Data Analysis. 2007; 51:3070–3084.
View Article
Google Scholar

[33] View Article

[34] Google Scholar

[ref12] 12. Hwang JS, Biswas A. Odds ratio for a single 2 × 2 table with correlated binomials for two margins. Statistical Methods and Applications. 2008; 17:483–497.
View Article
Google Scholar

[36] View Article

[37] Google Scholar

[ref13] 13. Stamey JD, Seaman JW, Young DM. Bayesian inference for a correlated 2 × 2 table with a structural zero. Biometrical Journal. 2006; 48:233–244. pmid:16708775
View Article
PubMed/NCBI
Google Scholar

[39] View Article

[40] PubMed/NCBI

[41] Google Scholar

[ref14] 14. Tang NS, Tang ML. Exact unconditional inference for risk ratio in a correlated 2 × 2 table with structural zero. Biometrics. 2002; 58:972–980. pmid:12495152
View Article
PubMed/NCBI
Google Scholar

[43] View Article

[44] PubMed/NCBI

[45] Google Scholar

[ref15] 15. Wang SF, Wang XR. Statistical inference of risk ratio in a correlated 2 × 2 table with structural zero. Computational Statistics. 2013; 28:1599–1615.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref16] 16. Agresti A, Min YY. Frequentist performance of Bayesian confidence intervals for comparing proportions in 2 × 2 tables. Biometrics. 2005; 61:515–523. pmid:16011699
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref17] 17. Newcombe RG. Bayesian estimation of false-negative rate in a clinical trial of sentinel node biopsy. Statistics in Medicine. 2007; 26:3429–3442. pmid:17133626
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref18] 18. Hanning J, Iyer HK, Patterson P. Fiducial Generalized Confidence Intervals. Journal of the American Statistical Association. 2006; 101:254–269.
View Article
Google Scholar

[58] View Article

[59] Google Scholar

[ref19] 19. Hannig J. On generalized fiducial inference. Statistical Since. 2009; 19:491–544.
View Article
Google Scholar

[61] View Article

[62] Google Scholar

[ref20] 20. E L, Hannig J, Iyer HK. Fiducial intervals for variance components in an unbalanced two-component normal mixed linear model. Journal of the American Statistical Association. 2008; 103:854–865.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref21] 21. Iyer HK, Wang CM, Matthew T. Models and confidence intervals for true values in interlaboratory trials. Journal of the American Statistical Association. 2004; 99:1060–1071.
View Article
Google Scholar

[67] View Article

[68] Google Scholar

[ref22] 22. Martin R, Liu CH. Inferential Models: A Framework for Prior-Free Posterior Probabilistic Inference. Journal of the American Statistical Association. 2013; 108: 301–313.
View Article
Google Scholar

[70] View Article

[71] Google Scholar

[ref23] 23. Martin R, Liu CH. Conditional inferential models: combining information for prior-free probabilistic inference. Journal of the Royal Statistical Society Series B-Statistical Methodology. 2015; 77:195–217.
View Article
Google Scholar

[73] View Article

[74] Google Scholar

[ref24] 24. Martin R, Liu CH. Marginal inferential models: prior-free probabilistic inference on interest parameters. Journal of the American Statistical Association. 2015; 110:1621–1631.
View Article
Google Scholar

[76] View Article

[77] Google Scholar

[ref25] 25. Lu HZ, Jin H, Wang ZN, Chen C, Lu Y. Prior-free probabilistic interval estimation for binomial proportion. TEST. 2019; 28:522–542.
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref26] 26. Lu H, Jin H. A new prediction interval for binomial random variable based on Inferential Models. Journal of Statistical Planning and Inference. 2020; 205:156–174.
View Article
Google Scholar

[82] View Article

[83] Google Scholar

[ref27] 27. Lu HZ, Jin H, Li Y, Wang ZN. Confidence intervals for a Poisson parameter with background. Communications in Statistics—Theory and Methods. Forthcoming 2022. pmid:35399822
View Article
PubMed/NCBI
Google Scholar

[85] View Article

[86] PubMed/NCBI

[87] Google Scholar

[ref28] 28. Wang ZN, Jin H, Lu HZ, Jin YL. An efficient test based on the Inferential Model for the non-inferiority of odds ratio in matched-pairs design. Statistical Methods in Medical Research. 2018; 27:2831–2841. pmid:28093963
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref29] 29. Wang ZN, Jin H, Lu HZ. An IM-based efficient test for noninferiority of the odds ratio between two independent binomial proportions, Communications in Statistics—Theory and Methods. Forthcoming 2021. https://doi.org/10.1080/03610926.2021.1926507.
View Article
Google Scholar

[93] View Article

[94] Google Scholar

[ref30] 30. Martin R, Lingham R. Prior-Free Probabilistic Prediction of Future Observations. Technometrics. 2016; 58:225–235.
View Article
Google Scholar

[96] View Article

[97] Google Scholar

Figures

Abstract

Introduction

Existing methods

Score test-based CI

Bayesian tail-based CI

New confidence intervals

Fiducial CI

IM-based CI

Modified IM CI

Simulation results

A real data analysis

Discussion

Supporting information

S1 Appendix.

Acknowledgments

References