Figures
Abstract
Background
Spatial heterogeneity within bone marrow significantly affects diagnostic accuracy in human and veterinary medicine, where single-site sampling may fail to detect focal disease processes. However, optimal bone marrow sampling strategies for canine hematologic disease diagnosis remain unclear, representing a critical knowledge gap in veterinary diagnostic pathology. We hypothesized that multi-site bone marrow core sampling would provide superior diagnostic accuracy compared to current single-site sampling standards in canine patients. The primary aim was to evaluate diagnostic capture probability of sampling one to four bone marrow sites in dogs with suspected hematologic disease.
Methods and findings
Sixteen dogs with suspected hematologic disease underwent bone marrow trephine biopsies from four anatomical locations (bilateral proximal humerus and iliac crest) using the ARROW OnControl Powered Driver system. Two board-certified clinical pathologists independently evaluated 64 masked bone marrow samples. Diagnostic accuracy was assessed using truth set methodology, with statistical analysis including mixed-effects logistic regression and bootstrap confidence intervals. Multi-site core sampling significantly improved diagnostic capture probability. Moving from one to two sites increased diagnostic accuracy from 76.6% to 94.8% under the permissive rule (18.2% improvement, P < 0.0001), and from 28.1% to 47.9% when both pathologists agreed (19.8% improvement). Significant site-specific differences were observed in myeloid-to-erythroid ratios and megakaryocyte counts. Overall, 13.3% of samples were nondiagnostic, with modest inter-pathologist agreement (κ = 0.30, 42% agreement).
Citation: Smiley KM, Connolly SL, Raskin R, Rosser MF, Schnelle AN, Lopez-Villalobos N, et al. (2026) Multi-site bone marrow core biopsy improves diagnostic accuracy in dogs with hematologic disease. PLoS One 21(2): e0343251. https://doi.org/10.1371/journal.pone.0343251
Editor: Zivanai Cuthbert Chapanduka, Stellenbosch University Faculty of Medicine and Health Sciences, SOUTH AFRICA
Received: September 17, 2025; Accepted: February 3, 2026; Published: February 25, 2026
Copyright: © 2026 Smiley et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The data used for analysis are available as a supplementary file (SUPP TABLE 1).
Funding: The authors received funding for this work from the University of Illinois Urbana-Champaign Companion Animal Research Grant, Wayne and Josephine Spangler Fund. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors declare that no competing interests exist.
Abbreviations: BM, Bone marrow; Bx Site, Biopsy site; CBC, Complete blood count; CI, Confidence interval; GLIMMIX, Generalized linear mixed model; ICC, Intraclass correlation coefficient; LH, Left humerus; LI, Left iliac crest; MER, Myeloid-to-erythroid ratio; OR, Odds ratio; RI, Right iliac crest; RH, Right humerus; SE, Standard error.
Introduction
Bone marrow evaluation represents a cornerstone diagnostic procedure in veterinary hematology, providing essential insights into hematopoietic disorders that cannot be adequately assessed through peripheral blood analysis alone. [1–3] In dogs, bone marrow examination is primarily indicated for investigating persistent cytopenias, abnormal cell morphologies, suspected neoplastic processes, and unexplained hematologic abnormalities.[1,4–11] The procedure involves both aspiration and core biopsy collection, with samples typically obtained from anatomically accessible sites, including the proximal humerus, iliac crest, proximal femur, and sternum. [12–14] Despite its clinical importance, current veterinary practice predominantly relies on single-site sampling protocols, which may inadequately capture the full diagnostic potential of bone marrow evaluation due to inherent spatial heterogeneity within hematopoietic tissues. [3,6,7,10,15]
Recent advances in human medicine have increasingly recognized the phenomenon of spatial heterogeneity within bone marrow, where cellular composition, disease processes, and pathological features can vary significantly across different anatomical locations within the same patient. [16–18] This heterogeneity has profound implications for diagnostic accuracy, as single-site sampling may fail to detect focal disease processes or provide representative samples of the overall marrow status. [19] In multiple myeloma, studies have demonstrated that malignant plasma cells exhibit spatially restricted distributions, with significant genomic and phenotypic differences observed between bone marrow sites and focal lesions. [20,21] Similarly, investigations using spatial transcriptomics have revealed complex microenvironmental gradients within bone marrow, with distinct cellular niches and signaling domains that support different hematopoietic populations. [22]
The limitations of single-site sampling have been highlighted in human hematologic oncology, where a limited number of studies demonstrated that multi-region sampling can improve the detection of focal or spatially heterogeneous disease compared to conventional single-site methods. [23] These findings support the broader understanding that hematopoietic disorders may exhibit regionally distributed lesions rather than uniform involvement, suggesting that multi-site sampling could enhance diagnostic accuracy in selected human conditions. [24,25] In veterinary hematology, however, single-site sampling protocols remain standard practice, [10] and the potential diagnostic benefit of multi-site approaches for canine bone marrow evaluation has not been systematically investigated. [12,13,26]
The diagnostic accuracy of bone marrow evaluation is further complicated by technical factors including sample quality, cellular composition variability, and inter-observer interpretation differences. [1,12,13,27] Studies in both human and veterinary medicine have reported significant variability in sample adequacy, with non-diagnostic rates ranging from 8–15% depending on sampling technique and anatomical location. [12,13] Additionally, site-specific differences in cellular composition, including variations in myeloid-to-erythroid ratios, megakaryocyte numbers, and iron content, have been documented, suggesting that anatomical location significantly influences both sample characteristics and diagnostic interpretability. [13,28–30] Gal et al. demonstrated that the site of bone marrow acquisition significantly affects the myeloid-to-erythroid ratio in apparently healthy dogs, with consistent differences observed between anatomical locations. [28] Similarly, variations in bone marrow iron stores have been documented across sampling sites in dogs, with implications for the interpretation of iron-restricted erythropoiesis. [13,29]
Previous veterinary studies have provided limited evidence supporting multi-site sampling approaches. Abrams-Ogg et al. compared canine core bone marrow biopsies from multiple sites using different techniques and needles, demonstrating variability in sample quality and cellular composition between anatomical locations. [12] Defarges et al. compared sternal, iliac, and humeral bone marrow aspiration in Beagle dogs, revealing site-dependent differences in sample characteristics. [13] Furthermore, Aubry et al. evaluated bone marrow aspirates from multiple sites for staging of canine lymphoma and mast cell tumors, suggesting that multi-site sampling may improve diagnostic accuracy in neoplastic conditions. [26] However, these studies have been limited by small sample sizes and focus on specific disease conditions rather than a comprehensive evaluation of diagnostic accuracy across the spectrum of hematologic diseases encountered in clinical practice.
Given the critical role of bone marrow evaluation in canine hematologic disease diagnosis and the emerging evidence for spatial heterogeneity in hematopoietic tissues, there exists a significant knowledge gap regarding the optimal sampling strategy for veterinary bone marrow examination. The potential for improved diagnostic accuracy through multi-site sampling approaches, as demonstrated in selected conditions in human medicine, warrants systematic investigation in veterinary patients to establish evidence-based protocols that maximize diagnostic yield while maintaining procedural safety and feasibility. [12–14,26]
We hypothesized that multi-site bone marrow core sampling would provide superior diagnostic accuracy compared to current single-site sampling standards in canine hematology, and that anatomical location would significantly influence sample quality and diagnostic interpretability. The primary aim of this study was to evaluate the diagnostic capture probability of sampling one to four bone marrow core biopsy sites in dogs with suspected hematologic disease. Secondary objectives included assessing site-specific differences in sample quality, cellular composition, and inter-pathologist agreement, while determining the optimal number and combination of sampling sites to maximize diagnostic accuracy in clinical veterinary practice.
Methods
This randomized, blinded clinical trial, carried out at the University of Illinois Veterinary Teaching Hospital, prospectively enrolled client-owned dogs needing bone marrow (BM) evaluation as part of their diagnostic assessment for hematologic illness between July 2020 and April 2023. The research received approval from the University of Illinois Institutional Animal Care and Use Committee (IACUC #19119), and the owners granted informed written consent at the point of study enrollment. Criteria for inclusion were a body weight of more than 7 kg and clinical indications for BM examination, such as unexplained cytopenias or suspected marrow-based infectious, neoplastic, or immune-mediated disorders. All owners gave written informed consent prior to participation, and all procedures were performed by a board-certified veterinary internist (AG).
The ARROW OnControl Powered Driver system (Teleflex, Morrisville, NC, USA) was used to streamline the BM collection and reduce the duration of the procedure. Each dog underwent BM evaluation from four anatomical locations, the right and left proximal sections of the humerus and the iliac crest on both sides, according to a randomized block schedule (see Table 1). The rationale for randomization was to eliminate potential sequence effects, such as changes in operator technique, fatigue, or minor procedural variability that could systematically bias sample quality or diagnostic yield at specific sites. At the time of marrow sampling, a complete blood count (CBC) including a differential was obtained to enhance histopathological interpretation.
Sampling of the iliac crest was performed at the widest cranial-dorsal section of the wing, with the needle inserted parallel to its long axis. For the humeral sites, the needle was placed perpendicular to the craniolateral bone surface, lateral and distal to the greater tubercle. Core bone biopsy samples were collected using an 11-gauge BM trephine biopsy needle (OnControl system). After sample collection, all skin incisions were sealed with tissue adhesive. The BM trephine biopsies were fixed in 10% buffered formalin for 24h, followed by 24h of decalcification in 5% aqueous solution of EDTA disodium salt dihydrate. The UIUC Veterinary Histology Laboratory was contracted to process the formalin-fixed decalcified BM trephine biopsies, embed them in paraffin wax, section 3-µm thick histological sections, and stain samples from each site with hematoxylin and eosin, periodic acid–Schiff and Giemsa (utilizing both stains allowed for better differentiation between erythroid, myeloid and lymphoid cells), impregnated silver stain for reticulin, and Prussian blue stain for iron. Immunohistochemical stains were not performed.
Each specimen received a randomized accession identifier to guarantee that dog identity and sampling site were concealed from the two experienced veterinary clinical pathologists raters. To maintain an unbiased assessment, clinical pathologists remained blinded to all samples throughout slide preparation and analysis.
Primary Outcome: site-specific clinical diagnoses were established based on BM histopathology in the context of CBC results. For data analysis, every site within a given dog was assigned a numeric code as per a preset scheme, and up to three diagnostic codes could be attributed to each site by the reviewing clinical pathologists (see Table 2, S1 Table).
Secondary Outcomes: secondary measures included percent BM cellularity, myeloid-to-erythroid ratio (MER; based on a 300-cell differential), blast cell percentage (out of 300 cells), average megakaryocyte count (based on 10 random 40 × microscopic fields), ordinal assessment of overall sample quality (scored as non-diagnostic [=1], poor [=2], adequate [=3], or excellent [=4]), 1–4 ordinal iron stores score (1 = absent; 2 = minimal [rare, small foci of faintly visible granules]; 3 = moderate [clearly visible multifocal deposits in macrophages or along trabeculae]; 4 = abundant [dense, coalescing granules widely distributed throughout the marrow], based on 10 random 40 × microscopic fields), and whether lymphocytosis and plasmacytosis exceeded 5% and 2%, respectively.
Power sample size estimation
To ensure the study had adequate statistical power, two investigators (AG and NLV) conducted a power analysis using bootstrap simulations in R software (version 3.5.1). This analysis involved simulating 10 replicate experiments, with each experiment consisting of 1,000 resamples. In each simulation, a cohort of 15 dogs was created, and discordant BM results were generated using a Bernoulli process, assuming a 5% discordance rate based on previous findings in healthy dogs. For each simulated dog, sampling from four anatomical sites was modeled with two independent ratings per site, resulting in eight data points per animal. The simulated discordance rate from each resample was then tested against a clinically expected discordance rate of 10% using a chi-squared test with a significance level of α = 0.05. The results of this simulation indicated that a sample size of 15 dogs would provide approximately 91% power (95% CI: 90.2%–91.4%) to detect a significant difference between the observed and expected rates of discordance. Anticipating a non-diagnostic rate of around 33% based on prior work, a target enrollment of 20 dogs was initially set to account for potential sample attrition. The final study ultimately included 16 dogs that met all inclusion criteria.
Statistical methods
Statistical analyses were conducted by two investigators (AG and NLV) using SAS version 9.4 (SAS Institute Inc., Cary, NC). Two board-certified clinical pathologists (RR and SC) independently rated masked bone marrow biopsy cores. Nondiagnostic reads (histopathology code = 9) were summarized and then excluded from primary analyses. To judge accuracy without an external gold standard, a dog-specific “truth set” was created using the histopathology code(s) most frequently observed across that dog’s sites and both pathologists (δ = 0; ties retained) (S2 Table). For each dog–site pair, we evaluated three truth-set consistency rules: ANY (at that site, at least one pathologist’s interpretation is in the truth set), BOTH_any (at that site, both pathologists’ interpretations are in the truth set; if the truth set contains ties, the two interpretations may differ), and BOTH_same (at that site, both interpretations are in the truth set and both pathologists selected the same truth-set code; this agreement requirement applies even when the truth set contains ties). A read was deemed correct when its code belonged to the dog’s truth set. To determine how many of the 4 sites were correct, the probability of capturing ≥1 correct site was computed when sampling k = 1–4 sites (,
). These probabilities were averaged across dogs, and 95% CIs were obtained using 2,000 resamples under bootstrapping sampling (stratified by rule and k); paired differences between successive k values were bootstrapped similarly, and medians were tested against 0 with nonparametric location tests. To evaluate site effects on correctness, a mixed-effects logistic regression was fitted with biopsy site and pathologist as fixed effects and a random intercept for dog; Tukey adjustment was used for pairwise site comparisons, and model-based probabilities with 95% CIs are reported. Inter-pathologist agreement was assessed overall and by site using Cohen’s kappa and percent agreement with exact (Clopper–Pearson) 95% CIs; category levels were restricted to those observed by both readers to ensure valid tables. Intra-observer agreement (within pathologist, across site pairs within dog) used pairwise κ and percent agreement with exact 95% CIs; Light’s κ (mean of pairwise κ) summarized within-reader agreement.
Results
Among 128 possible bone-marrow interpretations (16 dogs × 4 sites × 2 clinical pathologists), 18 (13.3%) were nondiagnostic and occurred in 7 dogs; most nondiagnostic reads were from the right ilium and right humerus, and pathologist 1 contributed two-thirds of these calls (Table 3).
Two core biopsies (RH and LI from two different dogs) could not be obtained (missing data). Overall code frequencies are summarized in Table 3. Code 3 (leukemia/round-cell neoplasia) was assigned in 16 of the BM interpretations in 6/16 dogs (37.5%). Among these, 3 dogs had multi-site involvement (2 dogs with concordant Code-3 calls by both pathologists across ≥2 sites; 1 dog with multi-site Code-3 calls from one pathologist only). The remaining 3 dogs had single-site Code-3 assignments (1 dog concordant across both pathologists; 2 dogs called by one pathologist only). The per-dog site count for Code 3 ranged from 1 to 3 of 4; no dog was Code 3 at all four sites. The chance of obtaining at least one truth-consistent site increased with the number of sites sampled (Table 4, Fig 1).
Lines show means across dogs; bands show 95% bootstrap CIs. m_any = ≥1 pathologist’s read in the truth set; m_both_any = both in the truth set (not necessarily equal); m_both_same = both agree and the agreed code is in the truth set.
Moving from one to two sites produced the largest gain across all rules, with smaller increments thereafter, particularly under the permissive ANY rule (Table 5).
In a mixed-effects logistic model, neither site nor pathologist influenced per-read correctness; model-based site probabilities had wide, overlapping CIs, and all pairwise comparisons were non-significant (Table 6, Fig 2).
GLIMMIX least-squares means with 95% CIs for LH, LI, RH, RI on the probability scale. No site effect (p = 0.599) and no pathologist effect (p = 0.781). Point: LS-mean predicted probability; Horizontal line: 95% CI.
Inter-pathologist agreement was modest overall (κ = 0.30, 95% CI 0.11–0.49; 42% agreement, 95% CI 28.2–56.8%) and varied by site, being highest at RH and lowest at LH (Table 7).
Intra-observer agreement ranged from poor to substantial, depending on the site pair, with Pathologist 1 strongest at LH–RH and Pathologist 2 showing fair-to-moderate agreement across pairs; Light’s κ indicated overall fair repeatability for both (Table 8).
When pooled across pathologists, the highest agreement remained for LH–RH and LH–RI, with the lowest for LI–RH and RH–RI (Table 9, Fig 3).
Simple κ with 95% CIs for each site pair pooled across pathologists, with N overlaid. Highlights that LI–RH shows the highest agreement. Open circle: κ estimate; Horizontal line: 95% CI.
Secondary variables were compared across the four anatomical sampling sites (Table 10).
The MER, assessed by a single pathologist, showed significant site-dependent differences, with the highest ratios consistently observed at the right ilium. Least-squares mean logMER values were lower at the left humerus (LH) and right humerus (RH) compared to the right ilium (RI). Pairwise site comparisons confirmed that logMER was significantly reduced at LH (p = 0.0309) and RH (p = 0.0092) when contrasted with RI, while differences among LH, LI, and RH were not significant. There was no lateral (left–right) asymmetry between the LI–RI (p = 0.1375) or LH–RH (p = 0.9135). Sampling site significantly affected the mean megakaryocyte numbers (p = 0.0026), with LI samples containing significantly fewer megakaryocytes than LH (p = 0.0084) and RH (p = 0.0034); RI did not differ significantly from the other sites. Blast percentage, but not percentage cellularity, differed significantly between sites, with humeri generally having a higher blast percentage (LH–RI, p = 0.0354; LH-LI, p = 0.0628; RH–RI, p = 0.0877; RH–LI, p = 0.1331). Notably, substantial inter-dog variability was observed for all marrow features. For MER, the calculated intraclass correlation coefficient (ICC) was 0.54, indicating that over half of the total variance in logMER was attributable to differences between dogs; the inter-dog coefficient of variation was 46%, reflecting considerable biological diversity among individuals.
The likelihood of lymphocytosis and plasmacytosis was evaluated across bone marrow sampling sites in dogs (Table 11).
For both outcomes, there were no significant site differences, but the probability for lymphocytosis differed significantly between the raters (p < 0.0001). Individual dog differences accounted for a substantial portion of variation for lymphocytosis (ICC = 0.58), indicating that more than half of the variability in lymphocytosis could be attributed to differences between dogs. For plasmacytosis, dog-level differences were less pronounced (ICC = 0.20), reflecting lower inter-dog variability for this parameter.
Lastly, the sample quality and iron stores ordinal scores were evaluated across the bone marrow sites (Table 12).
There were significant differences among sites for both outcomes, with sites having significant effects on sample quality (p = 0.002) and iron store scoring (p = 0.004). Pairwise contrasts from mixed-effects ordinal models showed that ilial samples (left and right ilium) were consistently of higher quality and had greater iron stores compared to humeral samples (left and right humerus). There was considerable variability between dogs for both sample quality and iron stores. Dog-level clustering was modest for sample quality (ICC = 0.02; dog-level variance = 0.06), but substantial for iron stores (ICC = 0.70; dog-level variance = 7.54), indicating strong between-dog differences in iron store scores and reflecting important inter-individual differences in marrow composition across the dogs, and highlighting that anatomic location and individual factors jointly shape the quality and histologic interpretation of bone marrow core biopsies in clinical practice.
Discussion
This randomized clinical trial provides compelling evidence that multi-site bone marrow core sampling significantly improves diagnostic accuracy compared to the current veterinary standard of single-site biopsy. [6,7,10] The study demonstrates clear benefits to sampling multiple anatomical sites, with diagnostic capture probability increasing substantially from one to two sites and continuing to improve with additional sites. These findings fundamentally challenge established veterinary practice and align with emerging recognition of spatial heterogeneity in hematopoietic diseases. [16–18,22,25]
The most striking finding was the significant improvement in diagnostic capture when moving from one to two sampling sites. Under the ANY rule (one pathologist correct), the probability increased from 76.6% to 94.8%, representing an 18.2% improvement that was highly significant (P < 0.0001). Even under the more stringent BOTH_same rule (both pathologists agree on a correct diagnosis), the probability increased from 28.1% to 47.9%, a 19.8% improvement. These substantial gains with the addition of just one additional site underscore the significant risk of diagnostic error when relying on single-site sampling, consistent with recent evidence highlighting sampling limitations in selected human bone marrow pathologies. [23,31–33]
The observed benefit aligns with findings from human oncology, where multi-site tumor sampling has been shown to outperform routine single-site approaches in detecting intratumoral heterogeneity and high-grade disease components. [20,21,23] When heterogeneity is regionally rather than randomly distributed, as is characteristic of hematopoietic diseases, sampling multiple sites provides substantially greater diagnostic information than collecting larger volumes from a single location. [17,20,22,24]
Analysis of secondary variables revealed significant site-dependent differences that further strengthen the case for multi-site evaluation. Core biopsies from the iliac crest consistently yielded samples of higher quality and contained greater iron stores compared to those from the humerus (p = 0.002 for sample quality; p = 0.004 for iron stores). This finding may have critical clinical implications, as suboptimal sample quality can render specimens non-diagnostic, while accurate iron assessment is essential for evaluating iron deficiency anemia, chronic inflammatory conditions, and other systemic disorders. Site differences between the iliac crest and humerus may reflect anatomical and physiological differences between flat and long bones. The iliac crest contains predominantly cancellous bone with extensive hematopoietic tissue, while the humerus may require greater penetration to reach the medullary cavity. This aligns with findings in human medicine, where site-specific differences in cellular composition are also well-documented. However, the superior site may vary, as seen in humans, where humeral sites sometimes yield higher mesenchymal stem cell concentrations than the iliac crest. [13,28,29,34–37]
Substantial inter-dog variability in iron stores (ICC = 0.70) was observed, reflecting biological diversity and differences related to disease status among dogs. These findings indicate that, beyond anatomical site effects, individual patient factors also influence marrow composition. Nonetheless, the significant site-dependent differences identified in our study demonstrate that sampling location itself contributes independently to diagnostic variability, supporting the value of multi-site evaluation. [29,38,39]
The modest overall inter-pathologist agreement (κ = 0.30, 42% agreement) observed in our study is consistent with reported variability in human bone marrow pathology, where inter-observer agreement ranges from poor to moderate depending on the specific diagnostic criteria and disease entity. [40–42] Site-specific variation in agreement (highest at right humerus, lowest at left humerus) suggests that anatomical location may influence diagnostic confidence, adding another dimension to the argument for multi-site sampling. These findings mirror challenges documented in human hematopathology, where substantial inter-observer variability has been reported for morphologic assessments of dysplasia, blast counts, and specific disease classifications. [31,33] If certain anatomical sites are inherently more difficult to interpret accurately, whether due to technical factors during sampling, [38,39] processing artifacts, or inherent tissue characteristics, multi-site sampling provides opportunities to capture more diagnostically reliable material. [10,19,34,38,39,43–46]
The concept of spatial heterogeneity in bone marrow is increasingly recognized in both human and veterinary medicine. [16–18,22,25,28,47] Recent advances in spatial transcriptomics have revealed complex microenvironmental gradients within bone marrow, with distinct cellular niches and signaling domains. Hematopoietic stem cells show preferential localization to specific anatomical regions, and malignant cells can exhibit spatially restricted distributions. [16,17,22,48] For example, in multiple myeloma, studies have demonstrated that plasma cells are not homogeneously distributed throughout the bone marrow, with osteolytic lesions representing areas of concentrated infiltration that may contain biologically distinct cellular populations. [20,21,49,50] Similarly, clonal hematopoiesis studies have identified intra-patient spatial heterogeneity, with mutant clones detected at one anatomical location but not at contralateral sites. [24,25,51] This spatial organization likely reflects the complex architecture of bone marrow microenvironments, where different anatomical sites may support distinct cellular populations and disease processes. The observed differences in sample quality and iron content between sites in our study may represent manifestations of this underlying spatial organization of hematopoietic tissue.
The findings of this study have immediate practical implications for veterinary hematologic diagnosis. The current standard of single-site sampling carries a significant risk of false-negative results, particularly for patchy or regionally distributed diseases. [20,21,23] The 18.2% improvement in diagnostic accuracy achieved by adding a second site represents a clinically meaningful enhancement that could substantially impact patient outcomes. The site-specific differences in sample quality also inform optimal sampling strategies. Based on site-specific differences, the combination of one iliac and one humeral site appears most advantageous. Iliac crest biopsies provided higher sample quality and iron content, whereas humeral sites demonstrated slightly greater diagnostic correctness. Sampling from both a flat (ilium) and a long bone (humerus), therefore, captures complementary marrow characteristics and likely optimizes diagnostic yield when a two-site approach is used. [13,28,35] From a procedural standpoint, multi-site sampling using powered biopsy systems appears feasible and well-tolerated, as evidenced by the successful completion of four-site sampling in all study dogs. [14,43–46] The incremental time and cost associated with additional sites must be weighed against the substantial diagnostic benefits [50,51] demonstrated in this study.
Several limitations should be acknowledged when interpreting these results. First, the relatively small sample size (16 dogs) limits the statistical power for detecting smaller effect sizes, particularly for subgroup analyses of specific disease entities. The study population was also restricted to dogs requiring bone marrow evaluation for clinical indications, which may not represent the full spectrum of hematopoietic diseases encountered in veterinary practice. Second, the “truth set” methodology used to assess diagnostic accuracy, while innovative in the absence of an external gold standard, relies on the assumption that the most frequent diagnosis across sites and pathologists represents the correct diagnosis. This approach may potentially bias results toward more common conditions or may not account for cases where different sites genuinely harbor different pathologic processes. Furthermore, routine bone marrow histopathology evaluated without immunophenotyping or molecular assays has inherent limitations for resolving lineage specificity (myeloid vs lymphoid) and definitively distinguishing acute leukemia/myelodysplastic syndrome from other round-cell neoplasms (e.g., lymphoma, plasma-cell neoplasia). Conversely, applying diagnosis-specific diagnostic weighting or reclassifying cases post-hoc would introduce information bias and circular reasoning, as these corrections would use the same histologic data that required reclassification, creating a methodological loop that cannot be validated in the absence of an external reference standard. Third, the study was conducted at a single institution with a standardized protocol and an experienced operator, which may limit generalizability to other clinical settings with different expertise levels or equipment. The learning curve associated with multi-site sampling techniques and potential variations in sample processing across institutions could influence the reproducibility of these findings. [32,38,39,43–46] Fourth, while this study has proven that multi-site biopsy is better than a single biopsy from a single site, it was not designed to assess whether multi-site biopsy is better than multiple biopsies from the same anatomical site (an important future study). However, our data and biological rationale still support prioritizing two different sites when a second core is taken, because within-dog analyses showed meaningful across-site discordance in correctness, and because sampling two distinct compartments better addresses spatial heterogeneity than repeating the same site. Finally, the clinical outcomes associated with the improved diagnostic accuracy demonstrated in this study were not factored into the study design and were not assessed. While higher diagnostic yield intuitively suggests better patient care, future studies should evaluate whether multi-site sampling translates to improved therapeutic decision-making and patient outcomes.
This study demonstrated that moving from one to two sites materially increases the probability of capturing a correct diagnosis, and supports choosing two distinct anatomic sites to maximize complementary yield. The substantial improvement in diagnostic capture probability, coupled with site-specific differences in sample quality and iron content, demonstrates that anatomical location significantly influences both the adequacy and interpretability of bone marrow specimens. These findings align with emerging understanding of BM spatial heterogeneity and support consideration of multi-site sampling as a best-practice approach to improve diagnostic accuracy in veterinary bone-marrow evaluation. The modest additional procedural complexity appears justified by the significant diagnostic benefits, potentially leading to more accurate diagnoses and improved patient management in canine hematologic diseases.
Acknowledgments
The authors gratefully acknowledge the Small Animal Internal Medicine Technicians at the University of Illinois for helping with the bone marrow procedures
References
- 1. Turinelli V, Gavazza A, Stock G, Fournel-Fleury C. Canine bone marrow cytological examination, classification and reference values: A retrospective study of 295 cases. Res Vet Sci. 2015;103:224–30. pmid:26679822
- 2. Weiss DJ. A retrospective study of the incidence and the classification of bone marrow disorders in the dog at a veterinary teaching hospital (1996-2004). J Vet Intern Med. 2006;20(4):955–61. pmid:16955822
- 3. Raskin RE, Krehbiel JD. Prevalence of leukemic blood and bone marrow in dogs with multicentric lymphoma. J Am Vet Med Assoc. 1989;194(10):1427–9. pmid:2470709
- 4. Brown MR, Rogers KS. Neutropenia in dogs and cats: a retrospective study of 261 cases. J Am Anim Hosp Assoc. 2001;37(2):131–9. pmid:11300519
- 5. Couto CG. Clinicopathologic aspects of acute leukemias in the dog. J Am Vet Med Assoc. 1985;186(7):681–5. pmid:3857221
- 6. Grindem CB. Bone marrow biopsy and evaluation. Vet Clin North Am Small Anim Pract. 1989;19(4):669–96. pmid:2672535
- 7. Harvey JW. Canine bone-marrow - normal hematopoiesis, biopsy techniques, and cell identification and evaluation. Compendium on Continuing Education for the Practicing Veterinarian. 1984;6(10):909–27. pmid:WOS:A1984TN72500004
- 8. Lucidi C de A, de Rezende CLE, Jutkowitz LA, Scott MA. Histologic and cytologic bone marrow findings in dogs with suspected precursor-targeted immune-mediated anemia and associated phagocytosis of erythroid precursors. Vet Clin Pathol. 2017;46(3):401–15. pmid:28582594
- 9. Miller MD, Lunn KF. Diagnostic use of cytologic examination of bone marrow from dogs with thrombocytopenia: 58 cases (1994-2004). J Am Vet Med Assoc. 2007;231(10):1540–4. pmid:18020998
- 10. Raskin RE, Messick JB. Bone marrow cytologic and histologic biopsies: indications, technique, and evaluation. Vet Clin North Am Small Anim Pract. 2012;42(1):23–42. pmid:22285155
- 11. Weiss DJ, Evanson OA, Sykes J. A retrospective study of canine pancytopenia. Vet Clin Pathol. 1999;28(3):83–8. pmid:12075515
- 12. Abrams-Ogg ACG, Defarges A, Foster RA, Bienzle D. Comparison of canine core bone marrow biopsies from multiple sites using different techniques and needles. Vet Clin Pathol. 2012;41(2):235–42. pmid:22486437
- 13. Defarges A, Abrams-Ogg A, Foster RA, Bienzle D. Comparison of sternal, iliac, and humeral bone marrow aspiration in Beagle dogs. Vet Clin Pathol. 2013;42(2):170–6. pmid:23577956
- 14. Woods GA, Simpson M, Boag A, Paris J, Piccinelli C, Breheny C. Complications associated with bone marrow sampling in dogs and cats. J Small Anim Pract. 2021;62(3):209–15. pmid:33274762
- 15. Raskin RE, Krehbiel JD. Histopathology of canine bone marrow in malignant lymphoproliferative disorders. Vet Pathol. 1988;25(1):83–8. pmid:3422768
- 16. Bandyopadhyay S, Duffy MP, Ahn KJ, Sussman JH, Pang M, Smith D, et al. Mapping the cellular biogeography of human bone marrow niches using single-cell transcriptomics and proteomic imaging. Cell. 2024;187(12):3120-3140.e29. pmid:38714197
- 17. Iga T, Kobayashi H, Kusumoto D, Sanosaka T, Fujita N, Tai-Nagara I, et al. Spatial heterogeneity of bone marrow endothelial cells unveils a distinct subtype in the epiphysis. Nat Cell Biol. 2023;25(10):1415–25. pmid:37798545
- 18. Sarachakov A, Varlamova A, Svekolkin V, Polyakova M, Valencia I, Unkenholz C, et al. Spatial mapping of human hematopoiesis at single-cell resolution reveals aging-associated topographic remodeling. Blood. 2023;142(26):2282–95. pmid:37774374
- 19. Pichardo JC, Milner RJ, Bolch WE. MRI measurement of bone marrow cellularity for radiation dosimetry. J Nucl Med. 2011;52(9):1482–9. pmid:21799087
- 20. Merz M, Merz AMA, Wang J, Wei L, Hu Q, Hutson N, et al. Deciphering spatial genomic heterogeneity at a single cell resolution in multiple myeloma. Nat Commun. 2022;13(1):807. pmid:35145077
- 21.
Sudupe L, Muiños-Lopez E, Lopez-Perez AR, Vilas-Zornoza A, Sarvide S, Ripalda-Cemborain P, et al. Bone Marrow Spatial Transcriptomics Reveals a Myeloma Cell Architecture with Dysfunctional T-Cell Distribution, Neutrophil Traps, and Inflammatory Signaling. openRxiv. 2024. https://doi.org/10.1101/2024.07.03.601833
- 22. Xiao X, Juan C, Drennon T, Uytingco CR, Vishlaghi N, Sokolowskei D, et al. Spatial transcriptomic interrogation of the murine bone marrow signaling landscape. Bone Res. 2023;11(1):59. pmid:37926705
- 23. Wang J, Weiss LM, Chang KL, Slovak ML, Gaal K, Forman SJ, et al. Diagnostic utility of bilateral bone marrow examination: significance of morphologic and ancillary technique study in malignancy. Cancer. 2002;94(5):1522–31. pmid:11920510
- 24. Hartmann L, Hecker JS, Rothenberg-Thurley M, Rivière J, Jentzsch M, Ksienzyk B, et al. Compartment-specific mutational landscape of clonal hematopoiesis. Leukemia. 2022;36(11):2647–55. pmid:36131041
- 25. Young AL, Davis HC, Cox MJ, Parsons TM, Burkart SC, Bender DE, et al. Spatial Mapping of Hematopoietic Clones in Human Bone Marrow. Blood Cancer Discov. 2024;5(3):153–63. pmid:38421682
- 26. Aubry OA, Spangler EA, Schleis SE, Smith AN. Evaluation of bone marrow aspirates from multiple sites for staging of canine lymphoma and mast cell tumours. Vet Comp Oncol. 2014;12(1):58–66. pmid:22577893
- 27. Roldán-Carvajal IJ, Alzate-Velásquez D, Muñoz-Duque JD, Mesa-Oquendo AF, Salazar-Flórez JE, Jaramillo-Arbeláez PE. Characterization of bone marrow aspirate reports in dogs and cats: A retrospective study. Rev colomb cienc pecu. 2025;38(2).
- 28. Gal A, Burchell RK, Worth AJ, Lopez-Villallobos N, Marshall JC, MacNeill AL. The Site of Bone Marrow Acquisition Affects the Myeloid to Erythroid Ratio in Apparently Healthy Dogs. Vet Pathol. 2018;55(6):853–60. pmid:29940832
- 29. Pawsat GA, Fry MM, Behling-Kelly E, Olin SJ, Schaefer DMW. Bone marrow iron scoring in healthy and clinically ill dogs with and without evidence of iron-restricted erythropoiesis. Vet Clin Pathol. 2023;52(2):243–51. pmid:37127847
- 30. Stasney J, Higgins GM. A quantitative cytologic study of the bone marrow of the adult dog. The American Journal of the Medical Sciences. 1937;193(4):462–9.
- 31. Bain BJ, Bailey K. Pitfalls in obtaining and interpreting bone marrow aspirates: to err is human. J Clin Pathol. 2011;64(5):373–9. pmid:21296794
- 32. Lee S-H, Erber WN, Porwit A, Tomonaga M, Peterson LC, International Council for Standardization In Hematology. ICSH guidelines for the standardization of bone marrow specimens and reports. Int J Lab Hematol. 2008;30(5):349–64. pmid:18822060
- 33. Wilkins BS. Pitfalls in bone marrow pathology: avoiding errors in bone marrow trephine biopsy diagnosis. J Clin Pathol. 2011;64(5):380–6. pmid:21325142
- 34. Akintoye SO, Lam T, Shi S, Brahim J, Collins MT, Robey PG. Skeletal site-specific characterization of orofacial and iliac crest human bone marrow stromal cells in same individuals. Bone. 2006;38(6):758–68. pmid:16403496
- 35. Otto A, Muench LN, Kia C, Baldino JB, Mehl J, Dyrna F, et al. Proximal Humerus and Ilium Are Reliable Sources of Bone Marrow Aspirates for Biologic Augmentation During Arthroscopic Surgery. Arthroscopy. 2020;36(9):2403–11. pmid:32554079
- 36. Peters AE, Watts AE. Biopsy Needle Advancement during Bone Marrow Aspiration Increases Mesenchymal Stem Cell Concentration. Front Vet Sci. 2016;3:23. pmid:27014705
- 37. Wang X, Zou X, Zhao J, Wu X, E L, Feng L, et al. Site-Specific Characteristics of Bone Marrow Mesenchymal Stromal Cells Modify the Effect of Aging on the Skeleton. Rejuvenation Res. 2016;19(5):351–61. pmid:26650116
- 38. Brestoff JR, Toland A, Afaneh K, Qavi AJ, Press B, Westervelt P, et al. Bone Marrow Biopsy Needle Type Affects Core Biopsy Specimen Length and Quality and Aspirate Hemodilution. Am J Clin Pathol. 2019;151(2):185–93. pmid:30307478
- 39. Marinelli LM, Fang H, Howard MT, Hanson CA, Haack JJ, Eick EA, et al. Bone Marrow Biopsy Operator Experience and Impact on Aspirate, Biopsy, and Ancillary Testing Quality. Mayo Clin Proc Innov Qual Outcomes. 2018;2(3):241–7. pmid:30225457
- 40. Buhr T, Hebeda K, Kaloutsi V, Porwit A, Van der Walt J, Kreipe H. European Bone Marrow Working Group trial on reproducibility of World Health Organization criteria to discriminate essential thrombocythemia from prefibrotic primary myelofibrosis. Haematologica. 2012;97(3):360–5. pmid:22058215
- 41. Font P, Loscertales J, Benavente C, Bermejo A, Callejas M, Garcia-Alonso L, et al. Inter-observer variance with the diagnosis of myelodysplastic syndromes (MDS) following the 2008 WHO classification. Ann Hematol. 2013;92(1):19–24. pmid:22948274
- 42. Wilkins BS, Erber WN, Bareford D, Buck G, Wheatley K, East CL, et al. Bone marrow pathology in essential thrombocythemia: interobserver reliability and utility for identifying disease subtypes. Blood. 2008;111(1):60–70. pmid:17885079
- 43. Berenson JR, Yellin O, Blumenstein B, Bojanower D, Croopnick J, Aboulafia D, et al. Using a powered bone marrow biopsy system results in shorter procedures, causes less residual pain to adult patients, and yields larger specimens. Diagn Pathol. 2011;6:23. pmid:21429220
- 44. Miller LJ, Philbeck TE, Montez DF, Puga TA, Brodie KE, Cohen SC, et al. Powered bone marrow biopsy procedures produce larger core specimens, with less pain, in less time than with standard manual devices. Hematol Rep. 2011;3(1):e8. pmid:22184530
- 45. Reed LJ, Raghupathy R, Strakhan M, Philbeck TE, Kim MY, Battini R, et al. The OnControl bone marrow biopsy technique is superior to the standard manual technique for hematologists-in-training: a prospective, randomized comparison. Hematol Rep. 2011;3(3):e21. pmid:22593813
- 46. Swords RT, Anguita J, Higgins RA, Yunes AC, Naski M, Padmanabhan S, et al. A prospective randomised study of a rotary powered device (OnControl) for bone marrow aspiration and biopsy. J Clin Pathol. 2011;64(9):809–13. pmid:21606230
- 47. Chen J, Hendriks M, Chatzis A, Ramasamy SK, Kusumbe AP. Bone Vasculature and Bone Marrow Vascular Niches in Health and Disease. J Bone Miner Res. 2020;35(11):2103–20. pmid:32845550
- 48. Boueya IL, Sandhow L, Albuquerque JRP, Znaidi R, Passaro D. Endothelial heterogeneity in bone marrow: insights across development, adult life and leukemia. Leukemia. 2025;39(1):8–24. pmid:39528790
- 49. Al-Quran SZ, Yang L, Magill JM, Braylan RC, Douglas-Nikitin VK. Assessment of bone marrow plasma cell infiltrates in multiple myeloma: the added value of CD138 immunohistochemistry. Hum Pathol. 2007;38(12):1779–87. pmid:17714757
- 50. Li X, An C, Zhang W. Is it sufficient to evaluate metastatic bone involvement in breast cancer using SPECT/CT? A new approach of SPECT/CT-guided targeted bone marrow biopsy. BMC Cancer. 2022;22(1):614. pmid:35659208
- 51. Jumniensuk C, Nobori A, Lee T, Senaratne TN, Rao D, Pullarkat S. Concordance of Peripheral Blood and Bone Marrow Next-Generation Sequencing in Hematologic Neoplasms. Adv Hematol. 2022;2022:8091746. pmid:35378848