Finite sample size errors in the context of multiple error sources in quantitative medical imaging: An evaluation for breast magnetic resonance diffusion-weighted imaging

Jessica V. Eberle; Sebastian Bickelhaupt; Lorenz A. Kapsner; Sabine Ohlmeyer; Evelyn Wenkel; Michael Uder; Dominika Skwierawska; Katharina Tkotz; Dominique Hadler; Tristan A. Kuder; Frederik B. Laun

doi:10.1371/journal.pone.0341201

Abstract

Background

Selecting appropriate sample sizes in magnetic resonance imaging studies is a complex process that requires to balance statistical rigor with the practical challenges of measuring a large patient population. In this Institutional Review Board approved study, we evaluate the dominant error types (“finite N” errors versus precision errors) for apparent diffusion coefficient (ADC)-based lesion characterization in diffusion-weighted magnetic resonance imaging (DWI) of the female breast in a local dataset and compare our results with current literature.

Methods

First, in a literature review including 24 published breast DWI studies, the standard error of the area under the receiver operating characteristic curve as a measure of sample size-related errors (finite N errors) was estimated for the reported ADC values and compared to the values, derived from expert readings of a university hospital’s cohort of 171 patients with suspicious breast lesions. Second, precision errors were assessed based on published analyses of the coefficient of variation of ADC values, measured in breast DWI exams.

Results

Finite N errors were dominant in the in-house study and most of the 24 reviewed studies. The median sample size at which finite N errors and precision errors were equal was determined to be n = 932.

Discussion

This analysis of dominant error types shows that the required sample sizes for the considered use case are not unreasonably large and that reducing sample sizes may not be justified based on the merits of the conducted analysis. Nonetheless, incorporating dominant error type assessments into future studies may provide valuable insights for optimizing study design and improving methodological rigor.

Citation: Eberle JV, Bickelhaupt S, Kapsner LA, Ohlmeyer S, Wenkel E, Uder M, et al. (2026) Finite sample size errors in the context of multiple error sources in quantitative medical imaging: An evaluation for breast magnetic resonance diffusion-weighted imaging. PLoS One 21(6): e0341201. https://doi.org/10.1371/journal.pone.0341201

Editor: Pascal A. T. Baltzer, Medical University of Vienna, AUSTRIA

Received: July 26, 2025; Accepted: May 15, 2026; Published: June 4, 2026

Copyright: © 2026 Eberle et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting Information files.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Abbreviations: ADC, apparent diffusion coefficient; AUC, area under the receiver operating characteristic curve; BI-RADS, Breast Imaging Reporting and Data System; COV, coefficient of variation; DWI, diffusion-weighted imaging; MRI, magnetic resonance imaging; PDF, probability density function; Std, standard deviation

Introduction

Choosing an adequate sample size is a key task in research, whether for planning a study, obtaining institutional review board approval, or during the publication review process. Established methods to determine an adequate sample size are often based on (estimated) effect sizes, the desired significance level, and statistical power. These methods are well established and widely used in research; however, they also have limitations. For example, effect sizes may not be known a priori, and there are no strict rules on how to choose the significance level [1–3]. A standard level for the significance threshold is 0.05, but there are also reasons to choose other values, such as 0.005 [4].

Given this uncertainty, examining established practices may provide useful guidance. In the field of magnetic resonance imaging (MRI) research, for example, Hanspach et al. and Bögerl et al. investigated the sample sizes in methodological and clinical MRI studies, with median sample sizes of n = 6 [2] and n = 74 [5], respectively. While these provided descriptive information, they did not assess the suitability of the sample sizes used. To address this limitation, the present study assesses the adequacy of sample sizes, following a methodology common in measurement science – namely, estimating individual uncertainty contributions and identifying which contributes most to the total uncertainty (see Ch. 2–3 of [6]). This can be used to identify the limiting factor in diagnostic performance and guide methodological optimization.

Uncertainty in a quantitative MRI research study may be introduced by using a finite sample size, leading to a “finite N” error. Naturally, further error sources will be present in any study. At a conceptual level, these error types may be classified into accuracy and precision errors. Precision refers to the test-retest-reproducibility, whereas accuracy refers to how close the mean measured quantitative value is to the true value. Generally, accuracy is much harder to assess in quantitative medical imaging studies, where a reliable ground truth is usually missing. As reports on precision are thus generally more readily available, we focused on the comparison of finite N errors and precision errors in the present investigation.

Such an assessment may guide study planning. For example, when the precision error dominates relative to the finite N error, further increasing the sample size may have a limited effect, and efforts may be better directed toward improving measurement precision rather than recruiting additional patients and burdening them with MRI exams.

For our analysis, we chose a use case that we deemed representative of the field – apparent diffusion coefficient (ADC)-based lesion characterization in diffusion-weighted magnetic resonance imaging (DWI) of the female breast, which is an established and relevant application field with sufficient high-quality studies for our analysis to rely on. Usually, the water ADC in malignant breast lesions is lower than in benign lesions, which enables discrimination between the two lesion types [7,8]. A standard approach to assess the clinical value of such quantitative evaluations is receiver operating characteristic (ROC) analysis, which yields the area under the curve (AUC). An AUC of 1 indicates a perfect separation of the two classes, whereas an AUC of 0.5 indicates that the classification performance is statistically not better than random chance. The AUC obtained in a study depends on the specific samples, that is, on the actually measured patients. The standard error of the AUC therefore provides a useful estimate of the finite N error arising from sampling variability. In the present analysis, this error is compared to the precision error, which is derived from studies on the test-retest-reproducibility of breast DWI.

To address this, we applied a methodological framework from physical measurement science that focuses on identifying the dominant source of error. This approach enables us to assess whether finite sample size or measurement imprecision limits diagnostic performance in current practice. Although the present review focuses on ADC-based lesion characterization in breast DWI, the question of whether diagnostic performance is limited by sample size or by measurement precision is relevant across quantitative imaging research. In many applications, diagnostic performance is assessed in the absence of a known ground truth, and characterizing the relative contributions of sampling variability and measurement uncertainty therefore provides critical context for interpreting study results in a broader quantitative imaging setting [9,10].

Methods

Data acquisition for the in-house study

This retrospective study was approved by the ethics committee of the Friedrich-Alexander Universität (FAU) Erlangen-Nürnberg, Erlangen, Germany, waiving the need for informed consent. The data for research purposes was accessed from April 1, 2021 until December 31, 2021. The authors were blinded to information that could lead to the identification of individual participants during or after data extraction. Consecutive MRI examinations of n = 359 women from October 2015 to December 2019 were included, reflecting the unbiased routine spectrum of clinically indicated breast MRI examinations.

Inclusion and exclusion criteria, details about the histopathological analysis serving as ground truth, the imaging protocol, and the statistical analysis of the in-house study, as well as its limitations, are provided as supporting information (see S1 File). The images were evaluated by a medical student (J.V.E., two years of experience in breast lesion segmentation) who was supervised by a board-certified radiologist (S.B., > 10 years experience in breast MRI). They were not aware of the histopathologic results, but were informed of the BI-RADS classification and the radiology report. Lesions were identified on T₁-weighted post-contrast subtraction images, taking the radiology report into account. Lesions were manually segmented in 3D Slicer (version 4.11.20210226) on the axial slice of the DWI b = 1500 s/mm² data where they appeared largest (see Fig 1). Boundary voxels that contained fat tissue were excluded from the segmentation.

Download:

Fig 1. Segmentation example of a malignant breast lesion (in-house study).

Example images of a 64-year-old woman with one malignant breast lesion (radiological Breast Imaging Reporting and Data System (BI-RADs) 4, histopathological B5b). a) Axial slice from the dynamic contrast-enhanced early subtraction image, an acquisition commonly used in clinical breast MRI to identify enhancing lesions. The subtraction image was generated by subtracting T₁-weighted MR images acquired before the administration of contrast agent from those acquired after the administration. The malignant breast lesion accumulated contrast agent and thus appears bright in this image. b) Corresponding DWI image (b = 1500 s/mm²) with the region of interest (ROI) placed conservatively to minimize partial volume effects. Segmentation volume 0.2 cm³, apparent diffusion coefficient = 0.79 ± 0.12 µm²/ms. The contrast in this image is not generated by a contrast agent. Instead, a diffusion-weighting is applied. The lower the diffusion coefficient, the lower is the signal loss generated by the diffusion-weighting, which leads to a hyperintense signal in this low-diffusivity lesion.

https://doi.org/10.1371/journal.pone.0341201.g001

Histopathology served as ground-truth. 3D Slicer (version 4.11.20210226) was used to calculate the lesion size and the segmentation-averaged ADC values. For the computation of the ADC values, the ADC maps provided by the scanner were used.

To improve the comparability of the ADC values derived from the different publications throughout the literature review, eliminate systematic shifts, and facilitate subsequent statistical simulations, the mean values and standard deviations were normalized according to Equations (1) and (2). The measured data points of class 1 (malignant) and data points of class 2 (benign) are labeled as and , respectively. Here, “q” stands for “quantitative (value)”, i.e., the ADC, for example. The following normalization was performed:

(1)

(2)

where and are the normalized quantitative values. This normalization simplifies the analytical analysis since the mean of becomes zero and the mean of becomes one. Consequently, the number of variables that must be tracked in the analytical analysis is reduced from four (the means and standard deviations of and ) to two (the standard deviations of and ).

An overview of the variables used in this study is provided in Table 1.

Download:

Table 1. Overview and definitions of variables used in the analysis.

https://doi.org/10.1371/journal.pone.0341201.t001

Literature search for breast diffusion-weighted imaging studies

The selected use case “DWI in breast MRI” was selected for its relevance and the availability of reported data. A literature research was performed using the PubMed database with the search term “Mamma AND DWI AND ADC” in December 2021. The retrospective search yielded n = 174 studies published between 2002 and 2021. From the resulting 174 studies (published between 2002 and 2021), 24 publications met the inclusion criteria (for further details please refer to Fig 2 and the supporting information (S1 File)). For each study, the reported mean and standard deviation of the ADCs of malignant and benign lesions were retrieved (malignant: and , benign: and ), along with the group sizes of the malignant class () and the benign class ().

Download:

Fig 2. Flowchart of the literature selection process.

From an initial 174 studies, 24 met the predefined inclusion criteria for quantitative ADC evaluation in breast DWI. Some inclusion criteria were necessary to make the included studies suitable for our analysis (e.g., data on both malignant and benign lesions was required, as well as the standard deviations of the measured ADCs). Other criteria were defined restrictively to ensure a high comparability of the included studies (e.g., due to the exclusion of studies performed with scanners from different vendors or of older scanners) and to ensure that high-quality studies were included (e.g., with sample sizes larger than 50). ADC = apparent diffusion coefficient; DWI = diffusion-weighted image; std = standard deviation.

https://doi.org/10.1371/journal.pone.0341201.g002

Similarly as in the in-house study, the mean values of the two classes were normalized to 0 and 1, respectively ( and ). The corresponding normalized standard deviations (malignant: and benign: ) for both the in-house study and the reported studies were determined as follows:

(3)

(4)

Thus, one value of , , , , , and was obtained for each of the included studies.

Literature search for reported coefficients of variation

While reports on the accuracy of MRI are available [11,12], accuracy is usually more difficult to assess than precision. Therefore, we here focused on precision errors, which can be expressed by the coefficient of variation (. A literature search was performed for reported values in breast DWI using the Pubmed database and the search term “coefficient of variation AND “breast OR mamma” AND ADC AND DWI” in December 2021, with 10 studies included. From the 10 values of the 10 studies, the mean, , was computed (see further details, including inclusion criteria, in the supporting information (S1 File)).

The was converted into a standard deviation . To obtain this standard deviation, the was multiplied with the mean ADC value. One could treat benign and malignant lesions separately and obtain two standard deviations. However, for simplicity, the overall mean ADC value of all lesions including a conversion to the normalized space was calculated here as follows:

(5)

Thus, was kept fixed. However, varied among the included breast DWI studies because their and values differed. For the in-house study, was calculated with the same formula.

Finite N error: Assessment with Monte-Carlo simulations and kernel density estimations

For the quantitative evaluation of both types of error, the AUC was used as a common measure of diagnostic discriminatory power. In the following steps, the strength with which it is influenced by sampling variability (finite N error) and how it is influenced by measurement imprecision (precision error) were analyzed. The standard error of the AUC, , was estimated by means of Monte Carlo simulations performed in Matlab (Version 2022b, MathWorks, Natick, USA). In the AUC analysis, cutoff values were used that ranged from to . In this analysis, lesions with q-values smaller than the cutoff value were classified as malignant while the remaining lesions were classified as benign. The simulation was performed in normalized space for comparability and was carried out as described in the following. random numbers were generated for the malignant class and random numbers for the benign class.

These random numbers were drawn from Gaussian distributions using the normalized means ( and ) and standard deviations ( and ) of the respective study. In pseudo-code, the random numbers were generated as follows:

(6)

(7)

Here, is the Matlab function that generates a normally distributed random number with a mean 𝜇 = 0 and a standard deviation 𝜎 = 1.

In contrast to the analysis of the literature studies, where only summary statistics (mean and standard deviation) were available and therefore a normal distribution was assumed, all individual ADC values were available for the in-house study. This allowed for the underlying probability density functions (PDFs) of the malignant and benign lesions to be determined empirically using kernel density estimation rather than prescribing a specific distribution shape. The advantage of this approach is that the complete distribution structure of the data, including possible asymmetries or multi-peakedness, is preserved. This allowed for the class separation and the finite N error to be modeled more realistically. The “kernel” parameter controlled the degree of smoothing of the empirical density function by specifying the width of the Gaussian kernel used. For the in-house study, the random numbers were generated by picking a random normalized ADC value that had been obtained in the study ( for the malignant and for the benign class) and adding a small normally distributed random variable. In pseudo-code:

(8)

(9)

Here, draws an integer from a uniform distribution between 1 and . The value was set to 0.1. The purpose of introducing the -term was to mimic a kernel density estimation of the true distribution from the available data points. The choice of is described in detail in the subsection “Additional precision errors for the internal study”.

For both in-house study and literature studies, the AUCs were calculated from these random -values. This process was repeated 10,000 times. From the thus obtained 10,000 AUC values (per study), the mean and the standard error, , were computed. Using this approach, quantified the variation of the that originated from the random sampling of -values. The precision error was not explicitly included in this calculation. However, it was contained implicitly because the published values of and were derived with an immanent precision error that contributes to the standard deviation of the -values found in the studies. The thus obtained value was the finite N error.

Figs 3a and 3b visualize the finite N error, represented by the standard deviation of the AUC, which arises from random variability in the estimated PDFs due to limited sample size. Due to limited precision, one does not measure the true quantitative value , where is the normalized ADC in the case of DWI, but an imprecise estimate . describes the difference between the ideal ADC (without measurement imprecision) and the ADC reduced by limited precision. The PDF of is .

Download:

Fig 3. Illustration of error types affecting class separation.

a) Ideal case with perfectly known PDFs for two classes. b) Finite N error leads to incorrectly estimated PDFs. Consequently, cutoff and AUC are randomly estimated, resulting in the standard error of AUC, . c) Precision error broadens PDFs due to measurement noise, reducing AUC by . AUC = area under the receiver operating characteristic curve; = sample size; PDF = probability density function.

https://doi.org/10.1371/journal.pone.0341201.g003

Precision error: Analytical assessment using normal probability density functions

The precision error was assessed with the following theoretical consideration. Let the quantitative parameter be called . Once again, represents the ADC normalized such that the mean of classes 1 and 2 equal 0 and 1, respectively. The distribution of for malignant and benign classes is described by the two PDFs and , respectively. Here, and were modeled as normal functions ; see Fig 3a:

(10)

(11)

Here, and are the variances of the two normal distributions. The means of the distributions are equal to 0 and 1, respectively. The respective AUC is given below (see supporting information (S1 File)):

(12)

where denotes the error function. This AUC value represents the ideal case of vanishing precision error and and .

As visualized in Fig 3c, the PDFs are broadened by this error. This broadening can be described with a convolution:

(13)

(14)

While may have involved functional shapes, the simplification that it is Gaussian with mean = 0 and standard deviation was made. That is, is the standard deviation that one obtains when measuring the same dataset several times. For simplicity, was assumed to be identical for both classes here. Thus,

(15)

The convolution of a Gaussian function with a Gaussian function is a Gaussian function. Hence, in the case of two Gaussian PDFs, this broadening leads to the following PDFs:

(16)

(17)

Accordingly, the variances and are additive, as are and .

The corresponding AUC is as follows:

(18)

This AUC value represents the case with limited precision () and and .

The drop in AUC due to limited precision is as follows:

(19)

is the precision error.

Additional precision error for the in-house study: Numerical assessment using probability density functions

For the in-house study, the data points of class 1 and data points of class 2 were called and , respectively. In the absence of a precision error, the PDFs for the in-house study were estimated with a kernel density estimation:

(20)

(21)

where is a normal function with zero mean and standard deviation . Fig 4 shows the reconstructed PDFs for 0.05, 0.1, 0.15, and 0.2. In the subsequent analysis, was used. This value was considered a good compromise between retaining detail and reducing spurious peaks from sampling noise. One the one hand, it smeared out most peaks visible with , which were assumed to be sampling artifacts. On the other hand, was kept as small as possible to minimize the blurring of the PDFs that accompanies larger values.

Download:

Fig 4. Estimated probability density functions for malignant and benign lesions at different kernel widths.

The range of different kernel widths presented in Figs 4a–d is = 0.05–0.2. a) 0.05, b) 0.1, c) 0.15, and d) 0.2. Each panel shows the effect of kernel size on the smoothness of the reconstructed distributions based on the in-house dataset. Red lines represent the malignant class, blue lines the benign class. () = probability density function; = normalized quantitative parameter.

https://doi.org/10.1371/journal.pone.0341201.g004

For the case with measurement imprecision, that is, for , the approach used for the literature studies was adopted to estimate the PDFs as follows:

(22)

(23)

Then, the area under the curve was computed by numerically computing these PDFs at 5000 points ranging from −2.5 to 3 and integrating numerically using the following equations. Different numbers of points were tested in preparatory evaluations, and 5000 points were found to be sufficient to ensure convergence of the numerical result.

(24)

(25)

(26)

Then, the drop in AUC due to was computed with the formula

(27)

Sample size needed for equivalence of errors

The AUC’s standard error, scales as follows:

(28)

to a good approximation, where . If for the particular value of a particular study, one obtains an equivalence of errors by increasing (or decreasing) the sample size:

(29)

was computed for all previous studies and the in-house study using the mean coefficient of variation , which had been obtained from the literature search.

Dependence of the sample size needed for equivalence of errors on the coefficient of variation.

was additionally computed for the following values: 1%, 2%, 3%, 5%, 7%, 10%, 13%, 16%, and 20% (also with 10,000 repetitions each). The mutual dependency was then evaluated with the fit of a power law relation:

(30)

For the fit, the median among all studies was used, and Eq. 31 was linearized: . Then, and were fitted with a Levenberg–Marquardt fit.

Theoretically, one would expect the following relationship. For a small (or small values), may be approximated as follows (see supporting information (S1 File)):

(31)

Thus, scales like , and scales like (see Eq. 29)

(32)

Availability of code

The programming MATLAB code for the described analyses is provided as supporting information (S2 File) as well as a MATLAB code to perform your own analysis (S3 File). Additional files to run both Matlab codes are provided (S4 File, S5 File, S6 File).

Results

Apparent diffusion coefficient values in benign and malignant breast lesions

In-house-study.

In total, 171 lesions were included in our in-house study. The mean ADCs per class (0.79 µm²/ms for malignant lesions and 1.32 µm²/ms for benign lesions) and their respective standard deviations (0.20 µm²/ms and 0.32 µm²/ms) are given in Table 2. The demographic characteristics, selection process, histopathological classification, and imaging protocol are detailed in the supporting material (S1 File). Table 3 shows the normalized in-house study values of , , , and . The values of and just equal zero and one, respectively, due to the chosen normalization.

Download:

Table 2. Mean and standard deviation of ADC values for malignant and benign breast lesions in the in-house study and 24 published studies.

https://doi.org/10.1371/journal.pone.0341201.t002

Download:

Table 3. Normalized mean and standard deviation of ADC values, and

for malignant and benign lesions in the in-house and reported studies.

https://doi.org/10.1371/journal.pone.0341201.t003

Literature search.

The literature search yielded n = 24 studies. The retrieved means and standard deviations are summarized in Table 2. The total sample size ranged from 41 to 326, with a mean value of 129. The ADC values of malignant lesions ranged from 0.79 µm²/ms to 1.20 µm²/ms, with a mean value of 0.97 µm²/ms. The ADC values of benign lesions were higher in each study and ranged from 1.10 µm²/ms to 1.99 µm²/ms, with a mean value of 1.50 µm²/ms.

The mean value of the included studies was 0.51, and the mean value was 0.71 (see Table 3). For example, study 10 [22] stood out somewhat with rather small normalized standard deviations ( 0.28 and 0.27). Studies 5 [17] and 21 [33] had the largest and values among the considered studies (study 5: 1.71 and 2.07; study 21: 1.33 and 1.62).

Literature search: Reported coefficients of variation

Table 4 summarizes the 10 considered studies on the in breast DWI. The actual study design varied between the studies regarding the segmentation procedure, the retest approach, and the considered tissue. The mean across all 10 studies, , was 7.7% ± 3.9%. The values for the 24 studies derived with this value are stated in Table 3. They ranged from 0.11 (for study 3 [15]) to 0.65 (for study 5 [17]). The mean value was 0.21.

Download:

Table 4. Reported test–retest coefficients of variation (

) for mean ADC values in benign and malignant breast lesions.

https://doi.org/10.1371/journal.pone.0341201.t004

Finite N error: Results from Monte-Carlo- and kernel-size-simulations and PDF analysis

Table 5 summarizes the results from the Monte-Carlo simulations and the PDF-based analysis. The values obtained with the Monte-Carlo simulation closely matched those obtained from the PDF analysis. The mean Monte-Carlo-simulation-derived was 0.892, whereas the mean PDF-derived was 0.891. The minimal values were obtained for study 5 [17] ( = 0.646) and study 21 [33] ( = 0.683). The maximal was obtained for study 10 [22] ( = 0.995).

Download:

Table 5. Results of Monte Carlo simulations and probability density function (PDF)–based error assessments.

https://doi.org/10.1371/journal.pone.0341201.t005

The mean value of was 3.01%. Generally, a negative correlation was seen between and : higher tended to go with lower . The minimal was obtained for study 10 ( = 0.52%). The maximal values were obtained for study 21 (= 6.16%) and study 7 [19] ( = 6.47%).

For the in-house study, both the results for normal PDFs and the kernel density derived PDFs are stated in Table 5. The difference between the two approaches is small (e.g., 875 vs 868).

Precision error: Results of the probability density function analysis

Table 5 also summarizes the values obtained from the PDF analysis. The mean value was 1.00%. The minimal was obtained for study 21 [33] ( = 0.49%). The maximal was obtained for study 9 [21] (= 2.28%).

Comparison of precision and finite N errors

Fig 5a shows and for the published studies and the in-house study as a bar plot.

Download:

Fig 5. Comparison of finite N and precision errors across all studies.

a) Bar plot of both error types, and . IH-KD = In-house study with kernel density PDF. IH-G = In-house study with Gaussian PDF approach. b) Respective scatter plot showing their relationship. Selected studies are labeled to illustrate representative positions across the range of finite N and precision errors, including our in-house study, outliers, and studies near the error-equivalence line. Square = IH-KD. Diamond = IH-G. c) Required sample size for equal contribution of both errors is plotted. , assuming a scaling of . = area under the curve; IH = in-house; std = standard deviation; = error due to finite N; = error due to imprecision.

https://doi.org/10.1371/journal.pone.0341201.g005

Fig 5b shows a scatter plot of the two errors. The finite N error is dominant (i.e., ) for all studies except studies 10 [22] and 16 [28]. Note that study 16 had the largest . Study 10 had the lowest and values that originated from the rather small standard deviations and giving rise to a very high . In this case, the reported class separation was extremely good, so the variation in due to the finite sample size became very small.

For the in-house study, 2.14% and 0.94% with the kernel density PDF approach and 2.11% and 0.94% with the normal PDF approach. Due to the similarity of these values, the two data points lie closely together in Fig 5b (the gray-filled square and diamond). For some studies, came close to . For other studies, . For example, study 7 [19] stood out, with a very large ratio between 6.47% and 0.65%.

Fig 5c shows the sample size needed for an equality of the two errors. The bars have different colors: whenever , the blue part represents . The orange part represents . Thus, the total height of the bars represents . Whenever , the green part represents and the white part represents . Thus, the total height of the bars represents .

The y-axis is cut at 2000, since some studies exhibited very large . For example, study 21 [33] stood out again, with 18,000. The median of among all studies was 932.

Dependence of the sample sized needed for equivalence of errors on the coefficient of variation

Fig 6 shows the dependency of on the . The fit is in agreement with the dependency predicted by Eq. 33. This illustrates that even small improvements in measurement precision (i.e., lower ) can lead to a disproportionately large reduction in the required sample size.

Download:

Fig 6. Dependency of the required sample size for error equivalence on the coefficient of variation.

The median of among all studies for several values (%) was fit by a power law fit. The curve shows that small changes in lead to large increases in the sample size required to balance finite N and precision errors. = coefficient of variation.

https://doi.org/10.1371/journal.pone.0341201.g006

Discussion

In this study, we investigated the size of two common error types in DWI ADC-based assessments of breast lesions: finite N errors represented by the standard error of the AUC and precision errors represented by . For the in-house study and the 24 considered published studies, we generally found with two exceptions (studies 7 [19] and 16 [28]). The median sample size of the considered studies was 109. The median sample size for which was found to be 887 under the assumption that scales like (Eq. 28).

In this scenario of a dominant finite N error, the preferred action would thus generally be to increase , if possible. In practice, the preferred action will depend on a variety of factors. Naturally, one would strive to minimize all sources of error as much as possible. However, a certain amount of error reduction may be associated with varying costs. Here, “cost” subsumes not only monetary costs but also other factors such as the burden that patients must face, for example, due to an increased scan time. Considering the sample size, the costs will usually increase linearly with . Unfortunately, the associated standard deviation decreases only with (see Eq. 28). Thus, the cost of improving will generally scale like , which can quickly become insurmountable. In addition, going beyond a certain becomes increasingly ineffective. For example, using N = 1000 for our in-house study would create a situation where the finite N error is no longer dominant. Then, investing time and effort in increasing the sample size would potentially not be as useful as spending resources to improve precision. For example, if resources are available, one could use a scanner with a higher field strength that provides higher Signal-to-Noise ratio [47], employ a receiver coil with more channels [48], use field probes to improve the image quality [49], invest in more elaborate, computationally demanding sequence and image reconstruction approaches [50,51], or allow more readers to evaluate the data. The most desirable action will depend heavily on the circumstances (e.g., availability of scanners, computation power, or availability of readers). Moreover, retrospective analyses of data available in a database, as in our in-house study, will naturally be assessed differently than prospective studies that involve the acquisition of new data. Generally, it will become increasingly difficult to reduce a certain error type, for example, the ; thus, a certain level of error may have to be accepted [38,52,53].

An evaluation such as that shown in Figs 5 and 6 can nonetheless help to guide one’s decisions and may be useful under different circumstances. For example, in the preparation phase of a prospective study, one usually performs sample size estimation and considers the effect size, the estimated measurement variability, the desired statistical power, the significance criterion, and the intended type of analysis (e.g., one- or two-tailed) [52,54–60]. Such an analysis could be supported by a joint consideration of the other errors to sharpen the judgment. For example, if one is not in the finite N error-dominated space (e.g., above the lines in Fig 5b), one might argue for reducing the sample size (which may have been inflated due to unrealistic expectations applied to its determination).

Similar to traditional power analyses, an evaluation such as that shown in Figs 5 and 6 could be used for sample size planning. This would include the following steps. First, retrieve the means and standard deviations of the two classes under consideration (e.g., benign and malignant) from the literature or from a pilot study and use them to compute the normalized standard deviations and (Eqs. 3 and 4). Second, obtain an estimate of the coefficient of variation from the literature or a pilot study and normalize it (see Eq. 5). Third, assume normality of the PDFs so that Eqs. 12 and 18 can be used to compute the decrease in AUC, (Eq. 19), that arises from limited precision. Our finding that Gaussian PDFs yielded essentially the same results as kernel-density-based PDFs (square and diamond markers in Fig 5b) supports the general use of Gaussian PDFs, although further research may be warranted. Fourth, run a Monte Carlo simulation (Eqs. 6 and 7) to determine the standard error of the AUC, , for the anticipated sample size, or for a range of anticipated sample sizes. Fifth, compare and to determine the relative magnitude of these two errors and whether the precision error or the finite-N error is dominant. If the aim is to perform sample size planning, one potential approach is to set the anticipated sample size to (Eq. 29). Such a pre-study analysis might help to avoid an insufficient sample size. We provide MATLAB code for such an analysis (see S2 File).

In a post hoc analysis, it might be worthwhile to make a judgment based on the sizes of various errors (finite N, imprecision) to better estimate the trustworthiness of the obtained results. If the finite N error is dominant, researchers might mention this as a relevant caveat in the limitations section of their study. Importantly, our finding that the finite N error was dominant in most of the literature studies does not invalidate the respective study results (nor those of other studies, for example in the field of quantitative imaging). It is rather a call for considering the finite N error and reporting the estimated .

The scaling law is intriguing. It predicts that increases quickly as the decreases. This entails the need for caution when interpreting the found in our (or any) analysis. A small change in can lead to substantially different values. This is important because the reported values differ substantially among the available publications (see Table 3). For the smallest reported of 3.2% [41], our analysis yields 28,000, a number that is hardly ever reached in clinical MRI studies [5]. For the largest reported of 20% [42], our analysis yields 19, a sample size much smaller than is used in typical MRI studies [5]. Thus, an improve in using better methodology will generally lead to much larger values (which may justify the use of larger sample sizes).

We only considered one use case, DWI of the breast involving ADC-based characterizations of the lesion type. However, considering that the size of errors is of similar magnitude in applications of DWI to other disease types [61,62], we suspect that our findings generalize to most other DWI studies and presumably also to many MRI studies in general, wherein finite N errors are likely to be dominant.

Our work has several limitations. For simplicity, we only assumed Gaussian distributions for the ADC values reported in the published studies. This limitation could be overcome if the circumstances demand it, potentially by using numerical approaches and retrieving the individual data points from a published study. Some studies could not be included because some key parameters such as standard deviations or coefficients of variation were not reported. This highlights a potential broader issue in the field, where insufficient reporting of variability may limit reproducibility and secondary analyses. Moreover, we focused on precision errors; inaccuracy errors could be treated similarly by calculating from . Our focus on precision errors is a limitation, as it ignores potential inaccuracy errors, e.g., systematic deviations between measured and true ADC values. While these were assumed negligible, future studies could explicitly model both error types, for example using phantom calibrations or multicenter datasets to assess inter-scanner bias. Another limitation of this study is that literature data were modeled assuming normal distributions. However, this reflects data availability and was chosen to preserve distributional features where individual data were accessible.

In conclusion, we found that finite N errors were generally dominant in the analyzed test case of DWI of the female breast. Based on our study, it appears worthwhile to consider the relative magnitudes of various error types when planning or evaluating studies involving quantitative parameters.

Supporting information

S1 File. Additional information.

https://doi.org/10.1371/journal.pone.0341201.s001

(DOCX)

S2 File. Matlab code used to generate Figs 4–6, and supporting figures.

https://doi.org/10.1371/journal.pone.0341201.s002

(M)

S3 File. Matlab code to perform your own analysis.

https://doi.org/10.1371/journal.pone.0341201.s003

(M)

S4 File. Function for ROC analysis.

https://doi.org/10.1371/journal.pone.0341201.s004

(M)

S5 File. ADC values of in-house study.

https://doi.org/10.1371/journal.pone.0341201.s005

(MAT)

S6 File. Function AUC from PDFs.

https://doi.org/10.1371/journal.pone.0341201.s006

(M)

Acknowledgments

This present work was performed by the first author, J.V.E., in fulfillment of the requirements for the degree “Dr. med.” at the Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU). Chat GPT-4-turbo was partly used to improve the manuscript text with the command “Improve the text: …”. The text was also reviewed by Cambridge Proofreading and Editing LLC (Editor: H. Subbaraman, PhD).

References

1. Schuler A. Designing efficient randomized trials: Power and sample size calculation when using semiparametric efficient estimators. Int J Biostat. 2021;18(1):151–71. pmid:34364314
- View Article
- PubMed/NCBI
- Google Scholar
2. Hanspach J, Nagel AM, Hensel B, Uder M, Koros L, Laun FB. Sample size estimation: Current practice and considerations for original investigations in MRI technical development studies. Magn Reson Med. 2021;85(4):2109–16. pmid:33058265
- View Article
- PubMed/NCBI
- Google Scholar
3. Zhou Y, Wang D, Dong X, Zhang H, Huo H, Zhang Y. Sample size estimation in acupuncture imaging research. Zhongguo Zhen Jiu. 2024;44(1):34–8. pmid:38191156
- View Article
- PubMed/NCBI
- Google Scholar
4. Di Leo G, Sardanelli F. Statistical significance: p value, 0.05 threshold, and applications to radiomics-reasons for a conservative approach. Eur Radiol Exp. 2020;4(1):18. pmid:32157489
- View Article
- PubMed/NCBI
- Google Scholar
5. Bögerl CM, Laun FB, Nagel AM, Bickelhaupt S, Uder M, Hanspach J. Analysis of the sample size used in clinical MRI studies. PLoS One. 2025;20(3):e0316611. pmid:40029860
- View Article
- PubMed/NCBI
- Google Scholar
6. Taylor JR. An introduction to error analysis: The study of uncertainties in physical measurements. University Science Books. 1996.
7. Partridge SC, Mullins CD, Kurland BF, Allain MD, DeMartini WB, Eby PR, et al. Apparent diffusion coefficient values for discriminating benign and malignant breast MRI lesions: Effects of lesion type and size. AJR Am J Roentgenol. 2010;194(6):1664–73. pmid:20489111
- View Article
- PubMed/NCBI
- Google Scholar
8. Partridge SC, McDonald ES. Diffusion weighted magnetic resonance imaging of the breast. Magnetic Resonance Imaging Clinics of North America. 2013;21(3):601–24.
- View Article
- Google Scholar
9. Obuchowski NA, Buckler AJ. Estimating the precision of quantitative imaging biomarkers without test-retest studies. Acad Radiol. 2022;29(4):543–9. pmid:34272163
- View Article
- PubMed/NCBI
- Google Scholar
10. Pierce TT, Sirlin CB, Fowler KJ, Buckler AJ, Hall TJ, Obuchowski NA. Understanding repeatability and reproducibility coefficients for quantitative imaging biomarkers. Radiology. 2025;316(2):e250279. pmid:40793946
- View Article
- PubMed/NCBI
- Google Scholar
11. Li H, Gatsonis C. Sample size estimation for time-dependent receiver operating characteristic. Stat Med. 2014;33(6):958–70. pmid:24123273
- View Article
- PubMed/NCBI
- Google Scholar
12. Kuhl CK, Mielcareck P, Klaschik S, Leutner C, Wardelmann E, Gieseke J, et al. Dynamic breast MR imaging: Are signal intensity time course data useful for differential diagnosis of enhancing lesions?. Radiology. 1999;211(1):101–10. pmid:10189459
- View Article
- PubMed/NCBI
- Google Scholar
13. Yadav P, Harit S, Kumar D. Efficacy of high-resolution, 3-D diffusion-weighted imaging in the detection of breast cancer compared to dynamic contrast-enhanced magnetic resonance imaging. Pol J Radiol. 2021;86:e277–86. pmid:34136045
- View Article
- PubMed/NCBI
- Google Scholar
14. Duran B, Agridag Ucpinar B. Four different apparent diffusion coefficient measurement methods in breast masses. J Coll Physicians Surg Pak. 2021;31(9):1024–9.
- View Article
- Google Scholar
15. Ohlmeyer S, Laun FB, Palm T, Janka R, Weiland E, Uder M, et al. Simultaneous multislice Echo planar imaging for accelerated diffusion-weighted imaging of malignant and benign breast lesions. Invest Radiol. 2019;54(8):524–30. pmid:30946181
- View Article
- PubMed/NCBI
- Google Scholar
16. Kul S, Metin Y, Kul M, Metin N, Eyuboglu I, Ozdemir O. Assessment of breast mass morphology with diffusion-weighted MRI: Beyond apparent diffusion coefficient. J Magn Reson Imaging. 2018;48(6):1668–77. pmid:29734493
- View Article
- PubMed/NCBI
- Google Scholar
17. Chen Y, Wu B, Liu H, Wang D, Gu Y. Feasibility study of dual parametric 2D histogram analysis of breast lesions with dynamic contrast-enhanced and diffusion-weighted MRI. J Transl Med. 2018;16(1):325. pmid:30470241
- View Article
- PubMed/NCBI
- Google Scholar
18. Zhang M, Horvat JV, Bernard-Davila B, Marino MA, Leithner D, Ochoa-Albiztegui RE, et al. Multiparametric MRI model with dynamic contrast-enhanced and diffusion-weighted imaging enables breast cancer diagnosis with high accuracy. J Magn Reson Imaging. 2019;49(3):864–74. pmid:30375702
- View Article
- PubMed/NCBI
- Google Scholar
19. Fan WX, Chen XF, Cheng FY, Cheng YB, Xu T, Zhu WB, et al. Retrospective analysis of the utility of multiparametric MRI for differentiating between benign and malignant breast lesions in women in China. Medicine (Baltimore). 2018;97(4):e9666. pmid:29369183
- View Article
- PubMed/NCBI
- Google Scholar
20. An YY, Kim SH, Kang BJ. Differentiation of malignant and benign breast lesions: Added value of the qualitative analysis of breast lesions on diffusion-weighted imaging (DWI) using readout-segmented echo-planar imaging at 3.0 T. PLoS One. 2017;12(3):e0174681. pmid:28358833
- View Article
- PubMed/NCBI
- Google Scholar
21. Liu H-L, Zong M, Wei H, Lou J-J, Wang S-Q, Zou Q-G, et al. Preoperative predicting malignancy in breast mass-like lesions: Value of adding histogram analysis of apparent diffusion coefficient maps to dynamic contrast-enhanced magnetic resonance imaging for improving confidence level. Br J Radiol. 2017;90(1079):20170394. pmid:28876982
- View Article
- PubMed/NCBI
- Google Scholar
22. Yamaguchi K, Nakazono T, Egashira R, Komori Y, Nakamura J, Noguchi T, et al. Diagnostic performance of diffusion tensor imaging with readout-segmented echo-planar imaging for invasive breast cancer: correlation of ADC and FA with pathological prognostic markers. Magn Reson Med Sci. 2017;16(3):245–52. pmid:27853053
- View Article
- PubMed/NCBI
- Google Scholar
23. Teruel JR, Goa PE, Sjøbakk TE, Østlie A, Fjøsne HE, Bathen TF. A simplified approach to measure the effect of the microvasculature in diffusion-weighted mr imaging applied to breast tumors: Preliminary results. Radiology. 2016;281(2):373–81. pmid:27128662
- View Article
- PubMed/NCBI
- Google Scholar
24. Jiang R, Zeng X, Sun S, Ma Z, Wang X. Assessing detection, discrimination, and risk of breast cancer according to anisotropy parameters of diffusion tensor imaging. Med Sci Monit. 2016;22:1318–28. pmid:27094307
- View Article
- PubMed/NCBI
- Google Scholar
25. Onaygil C, Kaya H, Ugurlu MU, Aribal E. Diagnostic performance of diffusion tensor imaging parameters in breast cancer and correlation with the prognostic factors. J Magn Reson Imaging. 2017;45(3):660–72. pmid:27661775
- View Article
- PubMed/NCBI
- Google Scholar
26. Akın Y, Uğurlu MÜ, Kaya H, Arıbal E. Diagnostic value of diffusion-weighted imaging and apparent diffusion coefficient values in the differentiation of breast lesions, histpathologic subgroups and correlatıon with prognostıc factors using 3.0 Tesla MR. J Breast Health. 2016;12(3):123–32. pmid:28331748
- View Article
- PubMed/NCBI
- Google Scholar
27. Spick C, Pinker-Domenig K, Rudas M, Helbich TH, Baltzer PA. MRI-only lesions: Application of diffusion-weighted imaging obviates unnecessary MR-guided breast biopsies. Eur Radiol. 2014;24(6):1204–10. pmid:24706105
- View Article
- PubMed/NCBI
- Google Scholar
28. Sharma U, Sah RG, Agarwal K, Parshad R, Seenu V, Mathur SR, et al. Potential of Diffusion-Weighted Imaging in the Characterization of Malignant, Benign, and Healthy Breast Tissues and Molecular Subtypes of Breast Cancer. Front Oncol. 2016;6:126. pmid:27242965
- View Article
- PubMed/NCBI
- Google Scholar
29. Ertas G, Onaygil C, Akin Y, Kaya H, Aribal E. Quantitative differentiation of breast lesions at 3T diffusion-weighted imaging (DWI) using the ratio of distributed diffusion coefficient (DDC). J Magn Reson Imaging. 2016;44(6):1633–41. pmid:27284961
- View Article
- PubMed/NCBI
- Google Scholar
30. Sun K, Chen X, Chai W, Fei X, Fu C, Yan X, et al. Breast Cancer: Diffusion Kurtosis MR Imaging-Diagnostic Accuracy and Correlation with Clinical-Pathologic Factors. Radiology. 2015;277(1):46–55. pmid:25938679
- View Article
- PubMed/NCBI
- Google Scholar
31. Teruel JR, Goa PE, Sjøbakk TE, Østlie A, Fjøsne HE, Bathen TF. Diffusion weighted imaging for the differentiation of breast tumors: From apparent diffusion coefficient to high order diffusion tensor imaging. J Magn Reson Imaging. 2016;43(5):1111–21. pmid:26494124
- View Article
- PubMed/NCBI
- Google Scholar
32. Yoo H, Shin HJ, Baek S, Cha JH, Kim H, Chae EY, et al. Diagnostic performance of apparent diffusion coefficient and quantitative kinetic parameters for predicting additional malignancy in patients with newly diagnosed breast cancer. Magn Reson Imaging. 2014;32(7):867–74. pmid:24907855
- View Article
- PubMed/NCBI
- Google Scholar
33. Satake H, Nishio A, Ikeda M, Ishigaki S, Shimamoto K, Hirano M, et al. Predictive value for malignancy of suspicious breast masses of BI-RADS categories 4 and 5 using ultrasound elastography and MR diffusion-weighted imaging. AJR Am J Roentgenol. 2011;196(1):202–9. pmid:21178068
- View Article
- PubMed/NCBI
- Google Scholar
34. Inoue K, Kozawa E, Mizukoshi W, Tanaka J, Saeki T, Sakurai T, et al. Usefulness of diffusion-weighted imaging of breast tumors: quantitative and visual assessment. Jpn J Radiol. 2011;29(6):429–36. pmid:21786099
- View Article
- PubMed/NCBI
- Google Scholar
35. Bogner W, Gruber S, Pinker K, Grabner G, Stadlbauer A, Weber M, et al. Diffusion-weighted MR for differentiation of breast lesions at 3.0 T: How does selection of diffusion protocols affect diagnosis?. Radiology. 2009;253(2):341–51. pmid:19703869
- View Article
- PubMed/NCBI
- Google Scholar
36. Tozaki M, Fukuma E. 1H MR spectroscopy and diffusion-weighted imaging of the breast: Are they useful tools for characterizing breast lesions before biopsy?. AJR Am J Roentgenol. 2009;193(3):840–9. pmid:19696300
- View Article
- PubMed/NCBI
- Google Scholar
37. Jerome NP, Vidić I, Egnell L, Sjøbakk TE, Østlie A, Fjøsne HE, et al. Understanding diffusion-weighted MRI analysis: Repeatability and performance of diffusion models in a benign breast lesion cohort. NMR Biomed. 2021;34(7):e4508. pmid:33738878
- View Article
- PubMed/NCBI
- Google Scholar
38. Newitt DC, Amouzandeh G, Partridge SC, Marques HS, Herman BA, Ross BD, et al. Repeatability and Reproducibility of ADC Histogram Metrics from the ACRIN 6698 Breast Cancer Therapy Response Trial. Tomography. 2020;6(2):177–85. pmid:32548294
- View Article
- PubMed/NCBI
- Google Scholar
39. Newitt DC, Zhang Z, Gibbs JE, Partridge SC, Chenevert TL, Rosen MA, et al. Test-retest repeatability and reproducibility of ADC measures by breast DWI: Results from the ACRIN 6698 trial. J Magn Reson Imaging. 2019;49(6):1617–28. pmid:30350329
- View Article
- PubMed/NCBI
- Google Scholar
40. de Almeida JRM, Gomes AB, Barros TP, Fahel PE, Rocha M de S. Diffusion-weighted imaging of suspicious (BI-RADS 4) breast lesions: Stratification based on histopathology. Radiol Bras. 2017;50(3):154–61. pmid:28670026
- View Article
- PubMed/NCBI
- Google Scholar
41. Spick C, Bickel H, Pinker K, Bernathova M, Kapetas P, Woitek R, et al. Diffusion-weighted MRI of breast lesions: A prospective clinical investigation of the quantitative imaging biomarker characteristics of reproducibility, repeatability, and diagnostic accuracy. NMR Biomed. 2016;29(10):1445–53. pmid:27553252
- View Article
- PubMed/NCBI
- Google Scholar
42. Aliu SO, Jones EF, Azziz A, Kornak J, Wilmes LJ, Newitt DC, et al. Repeatability of quantitative MRI measurements in normal breast tissue. Transl Oncol. 2014;7(1):130–7. pmid:24772216
- View Article
- PubMed/NCBI
- Google Scholar
43. Mürtz P, Tsesarskiy M, Kowal A, Träber F, Gieseke J, Willinek WA, et al. Diffusion-weighted magnetic resonance imaging of breast lesions: The influence of different fat-suppression techniques on quantitative measurements and their reproducibility. Eur Radiol. 2014;24(10):2540–51. pmid:24898097
- View Article
- PubMed/NCBI
- Google Scholar
44. Tagliafico A, Rescinito G, Monetti F, Villa A, Chiesa F, Fisci E, et al. Diffusion tensor magnetic resonance imaging of the normal breast: Reproducibility of DTI-derived fractional anisotropy and apparent diffusion coefficient at 3.0 T. Radiol Med. 2012;117(6):992–1003. pmid:22580812
- View Article
- PubMed/NCBI
- Google Scholar
45. Partridge SC, Murthy RS, Ziadloo A, White SW, Allison KH, Lehman CD. Diffusion tensor magnetic resonance imaging of the normal breast. Magn Reson Imaging. 2010;28(3):320–8. pmid:20061111
- View Article
- PubMed/NCBI
- Google Scholar
46. Partridge SC, McKinnon GC, Henry RG, Hylton NM. Menstrual cycle variation of apparent diffusion coefficients measured in the normal breast using MRI. J Magn Reson Imaging. 2001;14(4):433–8. pmid:11599068
- View Article
- PubMed/NCBI
- Google Scholar
47. Korteweg MA, Veldhuis WB, Visser F, Luijten PR, Mali WPTM, van Diest PJ, et al. Feasibility of 7 Tesla breast magnetic resonance imaging determination of intrinsic sensitivity and high-resolution magnetic resonance imaging, diffusion-weighted imaging, and (1)H-magnetic resonance spectroscopy of breast cancer patients receiving neoadjuvant therapy. Invest Radiol. 2011;46(6):370–6. pmid:21317792
- View Article
- PubMed/NCBI
- Google Scholar
48. Del Bosque R, Cui J, Ogier S, Cheshkov S, Dimitrov IE, Malloy C, et al. A 32-channel receive array coil for bilateral breast imaging and spectroscopy at 7T. Magn Reson Med. 2021;85(1):551–9. pmid:32820540
- View Article
- PubMed/NCBI
- Google Scholar
49. Wilm BJ, Nagy Z, Barmet C, Vannesjo SJ, Kasper L, Haeberlin M, et al. Diffusion MRI with concurrent magnetic field monitoring. Magn Reson Med. 2015;74(4):925–33. pmid:26183218
- View Article
- PubMed/NCBI
- Google Scholar
50. Zhao Y, Yi Z, Xiao L, Lau V, Liu Y, Zhang Z, et al. Joint denoising of diffusion-weighted images via structured low-rank patch matrix approximation. Magn Reson Med. 2022;88(6):2461–74. pmid:36178232
- View Article
- PubMed/NCBI
- Google Scholar
51. Pan Z, Ma X, Dai E, Auerbach EJ, Guo H, Uğurbil K, et al. Reconstruction for 7T high-resolution whole-brain diffusion MRI using two-stage N/2 ghost correction and L1-SPIRiT without single-band reference. Magn Reson Med. 2023;89(5):1915–30. pmid:36594439
- View Article
- PubMed/NCBI
- Google Scholar
52. Ge X, Quirk JD, Engelbach JA, Bretthorst GL, Li S, Shoghi KI, et al. Test-retest performance of a 1-hour multiparametric mr image acquisition pipeline with orthotopic triple-negative breast cancer patient-derived tumor xenografts. Tomography. 2019;5(3):320–31. pmid:31572793
- View Article
- PubMed/NCBI
- Google Scholar
53. Jerome NP, Miyazaki K, Collins DJ, Orton MR, d’Arcy JA, Wallace T, et al. Repeatability of derived parameters from histograms following non-Gaussian diffusion modelling of diffusion-weighted imaging in a paediatric oncological cohort. Eur Radiol. 2017;27(1):345–53. pmid:27003140
- View Article
- PubMed/NCBI
- Google Scholar
54. Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982;143(1):29–36. pmid:7063747
- View Article
- PubMed/NCBI
- Google Scholar
55. Eng J. Sample size estimation: How many individuals should be studied?. Radiology. 2003;227(2):309–13. pmid:12732691
- View Article
- PubMed/NCBI
- Google Scholar
56. Kang H. Sample size determination and power analysis using the G*Power software. J Educ Eval Health Prof. 2021;18:17. pmid:34325496
- View Article
- PubMed/NCBI
- Google Scholar
57. Keller A, Conradi J, Weber C, Failing K, Wergin M. Efficacy of Nx4 to Reduce Plasma Cortisol and Gastrin Levels in Norwegian Sled Dogs During an Exercise Induced Stress Response: A Prospective, Randomized, Double Blinded, Placebo-Controlled Cohort Study. Front Vet Sci. 2021;8:741459. pmid:34765666
- View Article
- PubMed/NCBI
- Google Scholar
58. Loi E, Zavattari C, Tommasi A, Moi L, Canale M, Po A, et al. HOXD8 hypermethylation as a fully sensitive and specific biomarker for biliary tract cancer detectable in tissue and bile samples. Br J Cancer. 2022;126(12):1783–94. pmid:35177798
- View Article
- PubMed/NCBI
- Google Scholar
59. Kazi SA, Siddiqui M, Majid S. Stroke outcome prediction using admission nihss in anterior and posterior circulation stroke. J Ayub Med Coll Abbottabad. 2021;33(2):274–8. pmid:34137544
- View Article
- PubMed/NCBI
- Google Scholar
60. Cohn ER, Qian T, Murphy SA. Sample size considerations for micro-randomized trials with binary proximal outcomes. Stat Med. 2023;42(16):2777–96. pmid:37094566
- View Article
- PubMed/NCBI
- Google Scholar
61. Ay H, Arsava EM, Vangel M, Oner B, Zhu M, Wu O. Interexaminer difference in infarct volume measurements on MRI. Stroke. 2008;39(4):1171–6.
- View Article
- Google Scholar
62. Kim WH, Adluru N, Chung MK, Charchut S, GadElkarim JJ, Altshuler L. Multi-resolutional brain network filtering and analysis via wavelets on non-Euclidean space. Medical Image Computing and Computer-Assisted Intervention – MICCAI 2013, 2013.
- View Article
- Google Scholar

[ref1] 1. Schuler A. Designing efficient randomized trials: Power and sample size calculation when using semiparametric efficient estimators. Int J Biostat. 2021;18(1):151–71. pmid:34364314
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Hanspach J, Nagel AM, Hensel B, Uder M, Koros L, Laun FB. Sample size estimation: Current practice and considerations for original investigations in MRI technical development studies. Magn Reson Med. 2021;85(4):2109–16. pmid:33058265
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Zhou Y, Wang D, Dong X, Zhang H, Huo H, Zhang Y. Sample size estimation in acupuncture imaging research. Zhongguo Zhen Jiu. 2024;44(1):34–8. pmid:38191156
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Di Leo G, Sardanelli F. Statistical significance: p value, 0.05 threshold, and applications to radiomics-reasons for a conservative approach. Eur Radiol Exp. 2020;4(1):18. pmid:32157489
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Bögerl CM, Laun FB, Nagel AM, Bickelhaupt S, Uder M, Hanspach J. Analysis of the sample size used in clinical MRI studies. PLoS One. 2025;20(3):e0316611. pmid:40029860
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Taylor JR. An introduction to error analysis: The study of uncertainties in physical measurements. University Science Books. 1996.

[ref7] 7. Partridge SC, Mullins CD, Kurland BF, Allain MD, DeMartini WB, Eby PR, et al. Apparent diffusion coefficient values for discriminating benign and malignant breast MRI lesions: Effects of lesion type and size. AJR Am J Roentgenol. 2010;194(6):1664–73. pmid:20489111
View Article
PubMed/NCBI
Google Scholar

[23] View Article

[24] PubMed/NCBI

[25] Google Scholar

[ref8] 8. Partridge SC, McDonald ES. Diffusion weighted magnetic resonance imaging of the breast. Magnetic Resonance Imaging Clinics of North America. 2013;21(3):601–24.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref9] 9. Obuchowski NA, Buckler AJ. Estimating the precision of quantitative imaging biomarkers without test-retest studies. Acad Radiol. 2022;29(4):543–9. pmid:34272163
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref10] 10. Pierce TT, Sirlin CB, Fowler KJ, Buckler AJ, Hall TJ, Obuchowski NA. Understanding repeatability and reproducibility coefficients for quantitative imaging biomarkers. Radiology. 2025;316(2):e250279. pmid:40793946
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref11] 11. Li H, Gatsonis C. Sample size estimation for time-dependent receiver operating characteristic. Stat Med. 2014;33(6):958–70. pmid:24123273
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref12] 12. Kuhl CK, Mielcareck P, Klaschik S, Leutner C, Wardelmann E, Gieseke J, et al. Dynamic breast MR imaging: Are signal intensity time course data useful for differential diagnosis of enhancing lesions?. Radiology. 1999;211(1):101–10. pmid:10189459
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref13] 13. Yadav P, Harit S, Kumar D. Efficacy of high-resolution, 3-D diffusion-weighted imaging in the detection of breast cancer compared to dynamic contrast-enhanced magnetic resonance imaging. Pol J Radiol. 2021;86:e277–86. pmid:34136045
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref14] 14. Duran B, Agridag Ucpinar B. Four different apparent diffusion coefficient measurement methods in breast masses. J Coll Physicians Surg Pak. 2021;31(9):1024–9.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref15] 15. Ohlmeyer S, Laun FB, Palm T, Janka R, Weiland E, Uder M, et al. Simultaneous multislice Echo planar imaging for accelerated diffusion-weighted imaging of malignant and benign breast lesions. Invest Radiol. 2019;54(8):524–30. pmid:30946181
View Article
PubMed/NCBI
Google Scholar

[53] View Article

[54] PubMed/NCBI

[55] Google Scholar

[ref16] 16. Kul S, Metin Y, Kul M, Metin N, Eyuboglu I, Ozdemir O. Assessment of breast mass morphology with diffusion-weighted MRI: Beyond apparent diffusion coefficient. J Magn Reson Imaging. 2018;48(6):1668–77. pmid:29734493
View Article
PubMed/NCBI
Google Scholar

[57] View Article

[58] PubMed/NCBI

[59] Google Scholar

[ref17] 17. Chen Y, Wu B, Liu H, Wang D, Gu Y. Feasibility study of dual parametric 2D histogram analysis of breast lesions with dynamic contrast-enhanced and diffusion-weighted MRI. J Transl Med. 2018;16(1):325. pmid:30470241
View Article
PubMed/NCBI
Google Scholar

[61] View Article

[62] PubMed/NCBI

[63] Google Scholar

[ref18] 18. Zhang M, Horvat JV, Bernard-Davila B, Marino MA, Leithner D, Ochoa-Albiztegui RE, et al. Multiparametric MRI model with dynamic contrast-enhanced and diffusion-weighted imaging enables breast cancer diagnosis with high accuracy. J Magn Reson Imaging. 2019;49(3):864–74. pmid:30375702
View Article
PubMed/NCBI
Google Scholar

[65] View Article

[66] PubMed/NCBI

[67] Google Scholar

[ref19] 19. Fan WX, Chen XF, Cheng FY, Cheng YB, Xu T, Zhu WB, et al. Retrospective analysis of the utility of multiparametric MRI for differentiating between benign and malignant breast lesions in women in China. Medicine (Baltimore). 2018;97(4):e9666. pmid:29369183
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref20] 20. An YY, Kim SH, Kang BJ. Differentiation of malignant and benign breast lesions: Added value of the qualitative analysis of breast lesions on diffusion-weighted imaging (DWI) using readout-segmented echo-planar imaging at 3.0 T. PLoS One. 2017;12(3):e0174681. pmid:28358833
View Article
PubMed/NCBI
Google Scholar

[73] View Article

[74] PubMed/NCBI

[75] Google Scholar

[ref21] 21. Liu H-L, Zong M, Wei H, Lou J-J, Wang S-Q, Zou Q-G, et al. Preoperative predicting malignancy in breast mass-like lesions: Value of adding histogram analysis of apparent diffusion coefficient maps to dynamic contrast-enhanced magnetic resonance imaging for improving confidence level. Br J Radiol. 2017;90(1079):20170394. pmid:28876982
View Article
PubMed/NCBI
Google Scholar

[77] View Article

[78] PubMed/NCBI

[79] Google Scholar

[ref22] 22. Yamaguchi K, Nakazono T, Egashira R, Komori Y, Nakamura J, Noguchi T, et al. Diagnostic performance of diffusion tensor imaging with readout-segmented echo-planar imaging for invasive breast cancer: correlation of ADC and FA with pathological prognostic markers. Magn Reson Med Sci. 2017;16(3):245–52. pmid:27853053
View Article
PubMed/NCBI
Google Scholar

[81] View Article

[82] PubMed/NCBI

[83] Google Scholar

[ref23] 23. Teruel JR, Goa PE, Sjøbakk TE, Østlie A, Fjøsne HE, Bathen TF. A simplified approach to measure the effect of the microvasculature in diffusion-weighted mr imaging applied to breast tumors: Preliminary results. Radiology. 2016;281(2):373–81. pmid:27128662
View Article
PubMed/NCBI
Google Scholar

[85] View Article

[86] PubMed/NCBI

[87] Google Scholar

[ref24] 24. Jiang R, Zeng X, Sun S, Ma Z, Wang X. Assessing detection, discrimination, and risk of breast cancer according to anisotropy parameters of diffusion tensor imaging. Med Sci Monit. 2016;22:1318–28. pmid:27094307
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref25] 25. Onaygil C, Kaya H, Ugurlu MU, Aribal E. Diagnostic performance of diffusion tensor imaging parameters in breast cancer and correlation with the prognostic factors. J Magn Reson Imaging. 2017;45(3):660–72. pmid:27661775
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref26] 26. Akın Y, Uğurlu MÜ, Kaya H, Arıbal E. Diagnostic value of diffusion-weighted imaging and apparent diffusion coefficient values in the differentiation of breast lesions, histpathologic subgroups and correlatıon with prognostıc factors using 3.0 Tesla MR. J Breast Health. 2016;12(3):123–32. pmid:28331748
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref27] 27. Spick C, Pinker-Domenig K, Rudas M, Helbich TH, Baltzer PA. MRI-only lesions: Application of diffusion-weighted imaging obviates unnecessary MR-guided breast biopsies. Eur Radiol. 2014;24(6):1204–10. pmid:24706105
View Article
PubMed/NCBI
Google Scholar

[101] View Article

[102] PubMed/NCBI

[103] Google Scholar

[ref28] 28. Sharma U, Sah RG, Agarwal K, Parshad R, Seenu V, Mathur SR, et al. Potential of Diffusion-Weighted Imaging in the Characterization of Malignant, Benign, and Healthy Breast Tissues and Molecular Subtypes of Breast Cancer. Front Oncol. 2016;6:126. pmid:27242965
View Article
PubMed/NCBI
Google Scholar

[105] View Article

[106] PubMed/NCBI

[107] Google Scholar

[ref29] 29. Ertas G, Onaygil C, Akin Y, Kaya H, Aribal E. Quantitative differentiation of breast lesions at 3T diffusion-weighted imaging (DWI) using the ratio of distributed diffusion coefficient (DDC). J Magn Reson Imaging. 2016;44(6):1633–41. pmid:27284961
View Article
PubMed/NCBI
Google Scholar

[109] View Article

[110] PubMed/NCBI

[111] Google Scholar

[ref30] 30. Sun K, Chen X, Chai W, Fei X, Fu C, Yan X, et al. Breast Cancer: Diffusion Kurtosis MR Imaging-Diagnostic Accuracy and Correlation with Clinical-Pathologic Factors. Radiology. 2015;277(1):46–55. pmid:25938679
View Article
PubMed/NCBI
Google Scholar

[113] View Article

[114] PubMed/NCBI

[115] Google Scholar

[ref31] 31. Teruel JR, Goa PE, Sjøbakk TE, Østlie A, Fjøsne HE, Bathen TF. Diffusion weighted imaging for the differentiation of breast tumors: From apparent diffusion coefficient to high order diffusion tensor imaging. J Magn Reson Imaging. 2016;43(5):1111–21. pmid:26494124
View Article
PubMed/NCBI
Google Scholar

[117] View Article

[118] PubMed/NCBI

[119] Google Scholar

[ref32] 32. Yoo H, Shin HJ, Baek S, Cha JH, Kim H, Chae EY, et al. Diagnostic performance of apparent diffusion coefficient and quantitative kinetic parameters for predicting additional malignancy in patients with newly diagnosed breast cancer. Magn Reson Imaging. 2014;32(7):867–74. pmid:24907855
View Article
PubMed/NCBI
Google Scholar

[121] View Article

[122] PubMed/NCBI

[123] Google Scholar

[ref33] 33. Satake H, Nishio A, Ikeda M, Ishigaki S, Shimamoto K, Hirano M, et al. Predictive value for malignancy of suspicious breast masses of BI-RADS categories 4 and 5 using ultrasound elastography and MR diffusion-weighted imaging. AJR Am J Roentgenol. 2011;196(1):202–9. pmid:21178068
View Article
PubMed/NCBI
Google Scholar

[125] View Article

[126] PubMed/NCBI

[127] Google Scholar

[ref34] 34. Inoue K, Kozawa E, Mizukoshi W, Tanaka J, Saeki T, Sakurai T, et al. Usefulness of diffusion-weighted imaging of breast tumors: quantitative and visual assessment. Jpn J Radiol. 2011;29(6):429–36. pmid:21786099
View Article
PubMed/NCBI
Google Scholar

[129] View Article

[130] PubMed/NCBI

[131] Google Scholar

[ref35] 35. Bogner W, Gruber S, Pinker K, Grabner G, Stadlbauer A, Weber M, et al. Diffusion-weighted MR for differentiation of breast lesions at 3.0 T: How does selection of diffusion protocols affect diagnosis?. Radiology. 2009;253(2):341–51. pmid:19703869
View Article
PubMed/NCBI
Google Scholar

[133] View Article

[134] PubMed/NCBI

[135] Google Scholar

[ref36] 36. Tozaki M, Fukuma E. 1H MR spectroscopy and diffusion-weighted imaging of the breast: Are they useful tools for characterizing breast lesions before biopsy?. AJR Am J Roentgenol. 2009;193(3):840–9. pmid:19696300
View Article
PubMed/NCBI
Google Scholar

[137] View Article

[138] PubMed/NCBI

[139] Google Scholar

[ref37] 37. Jerome NP, Vidić I, Egnell L, Sjøbakk TE, Østlie A, Fjøsne HE, et al. Understanding diffusion-weighted MRI analysis: Repeatability and performance of diffusion models in a benign breast lesion cohort. NMR Biomed. 2021;34(7):e4508. pmid:33738878
View Article
PubMed/NCBI
Google Scholar

[141] View Article

[142] PubMed/NCBI

[143] Google Scholar

[ref38] 38. Newitt DC, Amouzandeh G, Partridge SC, Marques HS, Herman BA, Ross BD, et al. Repeatability and Reproducibility of ADC Histogram Metrics from the ACRIN 6698 Breast Cancer Therapy Response Trial. Tomography. 2020;6(2):177–85. pmid:32548294
View Article
PubMed/NCBI
Google Scholar

[145] View Article

[146] PubMed/NCBI

[147] Google Scholar

[ref39] 39. Newitt DC, Zhang Z, Gibbs JE, Partridge SC, Chenevert TL, Rosen MA, et al. Test-retest repeatability and reproducibility of ADC measures by breast DWI: Results from the ACRIN 6698 trial. J Magn Reson Imaging. 2019;49(6):1617–28. pmid:30350329
View Article
PubMed/NCBI
Google Scholar

[149] View Article

[150] PubMed/NCBI

[151] Google Scholar

[ref40] 40. de Almeida JRM, Gomes AB, Barros TP, Fahel PE, Rocha M de S. Diffusion-weighted imaging of suspicious (BI-RADS 4) breast lesions: Stratification based on histopathology. Radiol Bras. 2017;50(3):154–61. pmid:28670026
View Article
PubMed/NCBI
Google Scholar

[153] View Article

[154] PubMed/NCBI

[155] Google Scholar

[ref41] 41. Spick C, Bickel H, Pinker K, Bernathova M, Kapetas P, Woitek R, et al. Diffusion-weighted MRI of breast lesions: A prospective clinical investigation of the quantitative imaging biomarker characteristics of reproducibility, repeatability, and diagnostic accuracy. NMR Biomed. 2016;29(10):1445–53. pmid:27553252
View Article
PubMed/NCBI
Google Scholar

[157] View Article

[158] PubMed/NCBI

[159] Google Scholar

[ref42] 42. Aliu SO, Jones EF, Azziz A, Kornak J, Wilmes LJ, Newitt DC, et al. Repeatability of quantitative MRI measurements in normal breast tissue. Transl Oncol. 2014;7(1):130–7. pmid:24772216
View Article
PubMed/NCBI
Google Scholar

[161] View Article

[162] PubMed/NCBI

[163] Google Scholar

[ref43] 43. Mürtz P, Tsesarskiy M, Kowal A, Träber F, Gieseke J, Willinek WA, et al. Diffusion-weighted magnetic resonance imaging of breast lesions: The influence of different fat-suppression techniques on quantitative measurements and their reproducibility. Eur Radiol. 2014;24(10):2540–51. pmid:24898097
View Article
PubMed/NCBI
Google Scholar

[165] View Article

[166] PubMed/NCBI

[167] Google Scholar

[ref44] 44. Tagliafico A, Rescinito G, Monetti F, Villa A, Chiesa F, Fisci E, et al. Diffusion tensor magnetic resonance imaging of the normal breast: Reproducibility of DTI-derived fractional anisotropy and apparent diffusion coefficient at 3.0 T. Radiol Med. 2012;117(6):992–1003. pmid:22580812
View Article
PubMed/NCBI
Google Scholar

[169] View Article

[170] PubMed/NCBI

[171] Google Scholar

[ref45] 45. Partridge SC, Murthy RS, Ziadloo A, White SW, Allison KH, Lehman CD. Diffusion tensor magnetic resonance imaging of the normal breast. Magn Reson Imaging. 2010;28(3):320–8. pmid:20061111
View Article
PubMed/NCBI
Google Scholar

[173] View Article

[174] PubMed/NCBI

[175] Google Scholar

[ref46] 46. Partridge SC, McKinnon GC, Henry RG, Hylton NM. Menstrual cycle variation of apparent diffusion coefficients measured in the normal breast using MRI. J Magn Reson Imaging. 2001;14(4):433–8. pmid:11599068
View Article
PubMed/NCBI
Google Scholar

[177] View Article

[178] PubMed/NCBI

[179] Google Scholar

[ref47] 47. Korteweg MA, Veldhuis WB, Visser F, Luijten PR, Mali WPTM, van Diest PJ, et al. Feasibility of 7 Tesla breast magnetic resonance imaging determination of intrinsic sensitivity and high-resolution magnetic resonance imaging, diffusion-weighted imaging, and (1)H-magnetic resonance spectroscopy of breast cancer patients receiving neoadjuvant therapy. Invest Radiol. 2011;46(6):370–6. pmid:21317792
View Article
PubMed/NCBI
Google Scholar

[181] View Article

[182] PubMed/NCBI

[183] Google Scholar

[ref48] 48. Del Bosque R, Cui J, Ogier S, Cheshkov S, Dimitrov IE, Malloy C, et al. A 32-channel receive array coil for bilateral breast imaging and spectroscopy at 7T. Magn Reson Med. 2021;85(1):551–9. pmid:32820540
View Article
PubMed/NCBI
Google Scholar

[185] View Article

[186] PubMed/NCBI

[187] Google Scholar

[ref49] 49. Wilm BJ, Nagy Z, Barmet C, Vannesjo SJ, Kasper L, Haeberlin M, et al. Diffusion MRI with concurrent magnetic field monitoring. Magn Reson Med. 2015;74(4):925–33. pmid:26183218
View Article
PubMed/NCBI
Google Scholar

[189] View Article

[190] PubMed/NCBI

[191] Google Scholar

[ref50] 50. Zhao Y, Yi Z, Xiao L, Lau V, Liu Y, Zhang Z, et al. Joint denoising of diffusion-weighted images via structured low-rank patch matrix approximation. Magn Reson Med. 2022;88(6):2461–74. pmid:36178232
View Article
PubMed/NCBI
Google Scholar

[193] View Article

[194] PubMed/NCBI

[195] Google Scholar

[ref51] 51. Pan Z, Ma X, Dai E, Auerbach EJ, Guo H, Uğurbil K, et al. Reconstruction for 7T high-resolution whole-brain diffusion MRI using two-stage N/2 ghost correction and L1-SPIRiT without single-band reference. Magn Reson Med. 2023;89(5):1915–30. pmid:36594439
View Article
PubMed/NCBI
Google Scholar

[197] View Article

[198] PubMed/NCBI

[199] Google Scholar

[ref52] 52. Ge X, Quirk JD, Engelbach JA, Bretthorst GL, Li S, Shoghi KI, et al. Test-retest performance of a 1-hour multiparametric mr image acquisition pipeline with orthotopic triple-negative breast cancer patient-derived tumor xenografts. Tomography. 2019;5(3):320–31. pmid:31572793
View Article
PubMed/NCBI
Google Scholar

[201] View Article

[202] PubMed/NCBI

[203] Google Scholar

[ref53] 53. Jerome NP, Miyazaki K, Collins DJ, Orton MR, d’Arcy JA, Wallace T, et al. Repeatability of derived parameters from histograms following non-Gaussian diffusion modelling of diffusion-weighted imaging in a paediatric oncological cohort. Eur Radiol. 2017;27(1):345–53. pmid:27003140
View Article
PubMed/NCBI
Google Scholar

[205] View Article

[206] PubMed/NCBI

[207] Google Scholar

[ref54] 54. Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982;143(1):29–36. pmid:7063747
View Article
PubMed/NCBI
Google Scholar

[209] View Article

[210] PubMed/NCBI

[211] Google Scholar

[ref55] 55. Eng J. Sample size estimation: How many individuals should be studied?. Radiology. 2003;227(2):309–13. pmid:12732691
View Article
PubMed/NCBI
Google Scholar

[213] View Article

[214] PubMed/NCBI

[215] Google Scholar

[ref56] 56. Kang H. Sample size determination and power analysis using the G*Power software. J Educ Eval Health Prof. 2021;18:17. pmid:34325496
View Article
PubMed/NCBI
Google Scholar

[217] View Article

[218] PubMed/NCBI

[219] Google Scholar

[ref57] 57. Keller A, Conradi J, Weber C, Failing K, Wergin M. Efficacy of Nx4 to Reduce Plasma Cortisol and Gastrin Levels in Norwegian Sled Dogs During an Exercise Induced Stress Response: A Prospective, Randomized, Double Blinded, Placebo-Controlled Cohort Study. Front Vet Sci. 2021;8:741459. pmid:34765666
View Article
PubMed/NCBI
Google Scholar

[221] View Article

[222] PubMed/NCBI

[223] Google Scholar

[ref58] 58. Loi E, Zavattari C, Tommasi A, Moi L, Canale M, Po A, et al. HOXD8 hypermethylation as a fully sensitive and specific biomarker for biliary tract cancer detectable in tissue and bile samples. Br J Cancer. 2022;126(12):1783–94. pmid:35177798
View Article
PubMed/NCBI
Google Scholar

[225] View Article

[226] PubMed/NCBI

[227] Google Scholar

[ref59] 59. Kazi SA, Siddiqui M, Majid S. Stroke outcome prediction using admission nihss in anterior and posterior circulation stroke. J Ayub Med Coll Abbottabad. 2021;33(2):274–8. pmid:34137544
View Article
PubMed/NCBI
Google Scholar

[229] View Article

[230] PubMed/NCBI

[231] Google Scholar

[ref60] 60. Cohn ER, Qian T, Murphy SA. Sample size considerations for micro-randomized trials with binary proximal outcomes. Stat Med. 2023;42(16):2777–96. pmid:37094566
View Article
PubMed/NCBI
Google Scholar

[233] View Article

[234] PubMed/NCBI

[235] Google Scholar

[ref61] 61. Ay H, Arsava EM, Vangel M, Oner B, Zhu M, Wu O. Interexaminer difference in infarct volume measurements on MRI. Stroke. 2008;39(4):1171–6.
View Article
Google Scholar

[237] View Article

[238] Google Scholar

[ref62] 62. Kim WH, Adluru N, Chung MK, Charchut S, GadElkarim JJ, Altshuler L. Multi-resolutional brain network filtering and analysis via wavelets on non-Euclidean space. Medical Image Computing and Computer-Assisted Intervention – MICCAI 2013, 2013.
View Article
Google Scholar

[240] View Article

[241] Google Scholar

Figures

Abstract

Background

Methods

Results

Discussion

Introduction

Methods

Data acquisition for the in-house study

Literature search for breast diffusion-weighted imaging studies

Literature search for reported coefficients of variation

Finite N error: Assessment with Monte-Carlo simulations and kernel density estimations

Precision error: Analytical assessment using normal probability density functions

Additional precision error for the in-house study: Numerical assessment using probability density functions

Sample size needed for equivalence of errors

Dependence of the sample size needed for equivalence of errors on the coefficient of variation.

Availability of code

Results

Apparent diffusion coefficient values in benign and malignant breast lesions

In-house-study.

Literature search.

Literature search: Reported coefficients of variation

Finite N error: Results from Monte-Carlo- and kernel-size-simulations and PDF analysis

Precision error: Results of the probability density function analysis

Comparison of precision and finite N errors

Dependence of the sample sized needed for equivalence of errors on the coefficient of variation

Discussion

Supporting information

S1 File. Additional information.

S2 File. Matlab code used to generate Figs 4–6, and supporting figures.

S3 File. Matlab code to perform your own analysis.

S4 File. Function for ROC analysis.

S5 File. ADC values of in-house study.

S6 File. Function AUC from PDFs.

Acknowledgments

References