FIRM image analysis: A machine learning workflow for quantifying extracellular matrix components from electron microscopy images

Nicholas T. Gigliotti; Justin Lee; Emily H. Mang; Giancarlo R. Zambrano; Mitra L. Taheri

doi:10.1371/journal.pone.0312196

Abstract

The extracellular matrix (ECM) is a complex network of biomolecules that plays an integral role in the structure, processes, and signaling mechanisms of cells and tissues. Identifying and quantifying changes in these matrix components provides insight into the mechanisms behind specific tissue remodeling processes; however, quantifying these changes is challenging due to difficult imaging conditions, complexity of the ECM, and the subtlety of these changes. Current imaging techniques allow us to visualize these critical remodeling events and developments in image analysis have employed a combination of analysis software and machine learning techniques to improve the efficiency and accuracy with which features are measured. Although image analysis has seen much improvement in recent years, there has been no technique developed to address ambiguity in feature edges in electron microscopy images. Presented here is a new machine learning-based workflow for the analysis of microscopy images named FIRM (Feature Identification from Raw Microscopy) that uses a random forest classifier to identify ECM features of interest and generate binary segmentation masks for quantification with ImageJ-FIJI. FIRM performed with an F1 score of 0.794 and greater than 80% accuracy for number and size of features detected. FIRM had similar deviation from the ground truth in the number of identified fibrils, fibril size, and size distributions when compared to human analyses. The results suggest that FIRM performs as well as manual analysis and requires a fraction of the time. This analysis technique is more efficient, eliminates user bias, and can be easily optimized to identify a variety of features, making it useful for any discipline requiring image analysis.

Citation: Gigliotti NT, Lee J, Mang EH, Zambrano GR, Taheri ML (2025) FIRM image analysis: A machine learning workflow for quantifying extracellular matrix components from electron microscopy images. PLoS ONE 20(2): e0312196. https://doi.org/10.1371/journal.pone.0312196

Editor: Baeckkyoung Sung, KIST: Korea Institute of Science and Technology, GERMANY

Received: December 21, 2023; Accepted: October 2, 2024; Published: February 6, 2025

Copyright: © 2025 Gigliotti et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the manuscript and its Supporting Information files.

Funding: The authors acknowledge funding in part from the March of Dimes Prematurity Research Center at the University of Pennsylvania (https://www.marchofdimes.org/), under contract 22-FY17-890 (MLT, NTG), the Hopkins Extreme Materials Institute Army Educational Outreach Program (https://www.usaeop.com/) (GRZ), and the Department of Energy, Basic Energy Sciences (https://www.energy.gov/science/bes/basic-energy-sciences), under contract DE-SC0020314 (JL, EMH, MLT). The funding sources did not play a role in the study design, data collection and analysis, decision to publish, or the preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

The extracellular matrix (ECM), which is a matrix rich in proteins and other biological molecules that provides structure and support to cells and tissues, has long been studied to understand the pathways and mechanisms by which cells and tissues signal each other, remodel, and respond to changes in their environment [1, 2]. Advancements in biological microscopy have made it possible to image the ECM at various length scales, providing information about the structure and composition and how it changes over time [3]. The ability to accurately identify and quantify features of interest in images of the ECM presents difficulties due to the complexity of the ECM and often poor imaging conditions. Due to inherent water content in biological samples, lack of heavy atoms, and short sample life, significant sample processing is required to achieve images in an electron microscope. While fixation techniques and the use of stains have made biological electron microscopy possible, it is difficult to manipulate delicate soft tissue samples without damaging the tissue. Stains, which are used to improve the contrast of features of interest, often leave staining artifacts, which impact contrast and the ability to threshold images for analysis. Additional difficulties arise with maintaining sample orientation through sectioning to achieve ideal imaging planes. In many previous studies, researchers used programs like ImageJ-FIJI to aid in image analysis [4–10]. In some studies, analysis software was used as a measurement tool for quantifying feature geometries [7–10]. Others used available software packages to segment the image based on a user defined threshold, separating the images into features and background [11–18]. Although ImageJ-FIJI greatly reduces time spent on manual analysis and has a variety of tools for different analysis methods, there are still sources of error due to inconsistent image processing and the need for human input to determine appropriate feature boundaries. Especially in cases of poor imaging conditions or sample preparation, high levels of noise can result in ambiguous feature edges, leaving their interpretation subjective to the researcher. This introduces bias and makes tracking feature evolution through various time points unreliable.

An apt example of difficult imaging conditions impacting ECM studies can be found in TEM micrographs of mouse cervix sections. The cervix (Fig 1), which undergoes significant remodeling during pregnancy, has been identified as a critical component that contributes to preterm birth [19, 20]. Since the primary structural component of the cervix is collagen, tracking collagen fibril evolution through pregnancy is necessary for understanding remodeling mechanisms. Due to limited samples and difficult imaging conditions, the fibril cross sections shown in Fig 2 appear to have two boundaries, neither of which are well-defined and are highly dependent on image contrast, making it nearly impossible to reliably determine the degree of swelling or shrinking from one image to another. To define features accurately and consistently in the ECM and reliably study their evolution in EM images, a new method for analysis is required. Using a machine learning model to identify the fibrils without bias or human error will allow for a more reliable view of the ECM changes in the cervix.

Download:

Fig 1. Schematic of the cervix ECM.

The cervix (A) undergoes changes in the ECM (B) that contribute to its changing mechanical properties through pregnancy. Collagen is the main structural component of the ECM and is a primary feature of interest in understanding the remodeling process. (C) shows the cross-section of a collagen fiber, which is comprised of bundles of fibrils that undergo changes in size and density throughout pregnancy. Created with Biorender.com.

https://doi.org/10.1371/journal.pone.0312196.g001

Download:

Fig 2. TEM of collagen in the murine cervix.

(A) TEM micrograph of pregnant mouse cervix tissue showing collagen fibril cross-sections. (B) The magnified section demonstrates 2 ways to define ambiguous fibril edges based on either the bright centers or darker edges. (C) An idealized schematic of the magnified inset showing collagen fibril cross sections. Figure created with Biorender.com.

https://doi.org/10.1371/journal.pone.0312196.g002

Machine learning (ML) has become a topic of targeted development in recent years to aid in image analysis, making research more efficient and accurate. Generalized toolkits that utilize ML provide solutions for a variety of applications and can be trained in real time as images are collected on a microscope [21–24]. Although programs such as this are powerful and pervasive, they are not specifically optimized for niche applications like fibril detection in TEM images and often require training input from the individual as they are imaging, which still introduces user bias in how the features are defined.

More specific tools have also been developed for analyzing biological images acquired through a variety of imaging modes, including but not limited to histological staining [25], electron microscopy (EM) [26–28], MRI [29], and polarized light microscopy [30, 31]. The two machine learning algorithms that are most commonly used are random forest classifiers (RFs) and convolutional neural networks (CNNs). In many of these examples, algorithms are used to classify a variety of ECM and other biological features based on manually annotated training data sets. Some of these ML tools rely on histological staining to classify collagen and other ECM components, using color values or hue as defining variables in the segmentation processes. This aids in defining features but is not applicable to all imaging modes. For example, EM images, which provide high resolution images on the nanometer scale, are grayscale, making them more challenging to classify accurately. Other ML tools have been developed that are designed for grayscale images, making them applicable for electron microscopy; however, these models are designed for specific applications that target classification [25–31], tracking objects through 3D image stacks [26–28], or identifying density [25, 30, 31] and do not emphasize distinct edge definition for quantitative size measurements.

To the best of our knowledge, no prior ML based workflow has been presented that addresses the segmentation and analysis of collagen fibrils in EM images. Here we present FIRM, a machine learning classification approach utilizing a random forest classifier to address this problem. RF classifiers are well established in the machine learning community and are commonly used for image classification problems. They also provide a distinct advantage over CNNs because they require minimal training data sets to accurately classify features. For applications that require tissue samples from animal trials, it is difficult to gather large training data sets due to limited sample collection.

FIRM utilizes manually annotated images as a training set for the RF classifier. Features are extracted from the training data set with a variety of image filters, and the classification model is fit to the training set. Once the model is trained, new images can be analyzed to identify features of interest. Output masks are generated that segment the features of interest, which can be quantitatively analyzed through ImageJ-FIJI. For collagen fibril identification and resulting measurements, the model shows sufficient agreement with the established ground truth, performing equal to individual human reviewers. Size and number determination by FIRM consistently presented with an error of less than 20%. Precision and recall accuracy were also determined which resulted in an F₁ score across all test images of 0.794. This method eliminates human subjectivity in defining feature edges and allows for rapid iteration through large data sets with limited training images, limiting total image analysis time. FIRM can be tailored to identify a variety of features with different image filters, making it a powerful imaging tool not only for ECM quantification, but for any application requiring contrast-dependent object detection analysis in grayscale electron microscopy images.

Methods

Mouse cervix image generation

Pregnant and non-pregnant CD-1 mice were acquired from Charles River Laboratories (Wilmington, Massachusetts, United States) and sacrificed via CO₂ inhalation. Cervix tissue samples were isolated from the mice via dissection and were fixed in 4% formaldehyde – 1% glutaraldehyde fixative. Following a wash with S-Collidine buffer, the samples were post-fixed with 1% osmium tetroxide for 1 hour, dehydrated in graded concentrations of acetone, and embedded in epoxy resin. Thick sections were cut at 2 microns using Leica 6 ultramicrotome (Wetzlar, Germany) and stained with 0.1% toluidine blue. Ultrathin 90 nm sections were placed on copper grids, stained with 3% uranyl acetate and 0.4% lead citrate, and examined with a JEOL JEM-1230 transmission electron microscope (Akishima, Tokyo, Japan) operated at 80 kV. Digital images were acquired using an Advanced Microscopy Techniques camera. Experiments were performed with approval from the University of Pennsylvania’s Institutional Animal Care and Use Committee (IACUC #805513) and adhered to the National Institute of Health Guidelines on Laboratory Animals.

Random forest detection model

Image segmentation for FIRM is performed using a random forest classifier in Python and was built with dependencies on the following components: scikit-learn [32], scikit-image [33], pandas [34], numPy [35], OpenCV [36], PIL [37], matplotlib [38], joblib [39], and sciPy [40]. To train the classifier, 5 TEM images of mouse cervix sections were chosen based on image quality and fibril orientation. To create segmentation masks for training, the 5 training images were manually annotated using APEER annotation software [41] by 3 reviewers who were instructed to identify fibril cross-sections and background regions, resulting in 3 separate annotations for each image. Only one annotated mask was chosen at random for each image to account for variability in how fibril edges were defined between reviewers. Masks were exported to ImageJ-FIJI to define pixel values for each class. Training image pixel values were read using Python and feature extraction was performed by generating new images with filters optimized for collagen fibril detection based on edge contrast effects. Filters included Canny Edge, Roberts Edge, Sobel, Scharr, Prewitt, Gaussian, and Median filters. The minimum and maximum pixel values used for the Canny filter were 100 and 200 respectively. For Gaussian filters, sigma values of 1 and 3 were used. For the Median filter, a sigma value of 3 was used. The random forest classifier model was imported from the scikit-learn package and fitted to the training image sets and corresponding annotated masks with n = 8 decision trees. The data extracted from the training images was randomly split so that 80% of the dataset is used to train the model while 20% is reserved for testing model accuracy. This model was then loaded into a new script to perform feature detection. To test the model’s ability to detect fibrils, 11 new TEM images with a variety of collagen packing scenarios were selected for testing. The same filters were applied to the test images for feature extraction and the images were fed into the model for classification. Image post-processing was performed using a Gaussian blur with a kernel size of 7 to remove Gaussian noise and a threshold blur with minimum and maximum pixel values set to 100 and 255. Features were labelled and exported as a segmented mask. The FIRM workflow is outlined in Fig 3. All images used are available in the S2 Dataset.

Download:

Fig 3. FIRM workflow.

Flow chart showing the FIRM workflow used to train the random forest classifier and detect features from test images.

https://doi.org/10.1371/journal.pone.0312196.g003

Quantitative measurements of fibril size

ImageJ-FIJI was used to quantify the size of the fibrils from the segmented mask obtained from both the random forest classifier and the manually annotated masks. Mean fibril diameter, size distribution, and number of fibrils were determined for both manually annotated masks and model-generated masks using the particle analysis tool. Ellipses were fit to each fibril and the minor axis was taken as the diameter to account for non-normal fibril orientations. Size filtering was applied to eliminate detection of background features. Features with areas greater than 5000 nm² or less than 10 nm² were discarded. Size distributions were determined by counting the number of fibrils in designated bins based on fibril diameter. Bin size was 10nm with 14 bins ranging from 10nm to 150nm.

Evaluation of FIRM identification and segmentation of fibrils

FIRM’s ability to properly identify and segment fibrils was evaluated by comparing model-generated masks to designated “ground truth” masks of the same TEM micrographs. To establish a “ground truth” for a given micrograph, 3 reviewers were asked to annotate the fibrils in the image and generate binary segmentation masks. The three resulting masks were merged and only areas where at least two of three reviewers indicated a fibril were kept, removing any outlier annotations. Precision and recall metrics were evaluated to determine how accurately the model identifies fibrils in the correct location. To achieve this, model-generated masks were stacked with the corresponding ground truth mask and each fibril was classified as a true positive (TP) or false positive (FP). This determination was made by setting a threshold for the intersection over union (IoU) value, or the area of the feature overlap divided by the area of the union of the features, for each fibril. IoU values equal to or exceeding 0.33 were considered true positives. Values below 0.33 resulted in false positives, indicating that the model incorrectly identified the location of the fibril. This threshold was chosen instead of the conventional IoU = 0.5 because in some cases, fibril edge ambiguity may result in differences in area of up to 200% depending on edge definition. Consequently, identified fibrils may be wrongfully assigned as false positives even if the model-generated and ground truth fibrils are completely overlapped due to differences in area. Precision was calculated with Eq 1 and indicates what fraction of the model-identified fibrils were identified correctly.

(Eq 1)

Recall indicates how many of the fibrils the model accurately identified compared to the total number in the ground truth (N_GT) and is calculated using Eq 2.

(Eq 2)

Using these two metrics, the F₁ score for the model was calculated using Eq 3 and provides a single value representing the harmonic mean of the precision and recall.

(Eq 3)

Evaluation of FIRM performance vs other methods

The number of fibrils identified, average fibril size, and the fibril size distribution were identified as key parameters to evaluate the performance of the model compared to other methods. Ilastik, a RF based toolkit for image segmentation and analysis, was used to analyze the dataset to compare the performance of FIRM against Ilastik, another open-source RF classifier [21]. Human analyses were also compared to FIRM to assess performance compared to traditional manual analysis. The results for each method were plotted in a parity plot against “ground truth” values, and the coefficient of determination statistic, R², was reported as a measure of agreement between measured data and the ground truth. To account for variability in human performance, 3 individual reviewers were asked to manually annotate the fibrils, generate binary segmentation masks, and analyze the fibril count, size, and size distribution via the particle analysis tool in ImageJ-FIJI. Individual R² values as well as an average R² value from all 3 reviewers were reported. Additionally, precision, recall, and F1 scores were determined for Ilastik to further compare the accuracy of the two ML-based methods.

Evaluation of model training on FIRM performance

Further investigation into the impact of training data on segmentation accuracy was also performed. Although FIRM has been designed to utilize only a few training images (n = 5) to limit computational costs and annotation requirements, training sets of 7 and 10 images were also considered. Fibril count, size, and distribution were measured and compared to assess whether the model improved with more training images. Training sets above 10 images were computationally expensive and were not considered. In addition to training set size, the number of annotations included in each training image were examined since RF classifiers may be adequately trained with sparse annotations. Segmentation masks were generated using FIRM with 25%, 50%, 75%, and 100% fibril annotations in the training set. Segmented masks were overlain with the original TEM images and qualitatively compared.

Results and discussion

Fig 4 shows a subset of the original TEM micrographs with corresponding model-generated masks and an overlay showing a visual representation of the agreement between the model and original images. The overlay demonstrates that a majority of the identified fibrils appear in the same locations as the fibrils from the original image, qualitatively showing FIRM’s ability to identify features of interest.

Download:

Fig 4. TEM micrographs with FIRM segmentation.

Column A) Original TEM images of mouse cervix tissue. Column B) FIRM-generated segmentation masks of identified collagen fibrils. Column C) FIRM-detected fibrils (red) overlain with the corresponding TEM micrographs to demonstrate accuracy of feature identification.

https://doi.org/10.1371/journal.pone.0312196.g004

Precision and recall evaluations are shown in Table 1. Precision scores varied between 0.739 and 0.986 and recall scores varied between 0.578 and 0.939 depending on the image selected. Across all images, the model has a precision of 0.821 and recall of 0.769, resulting in an F₁ score of 0.794 which demonstrates similar performance to other machine learning classification models developed for biological imaging applications. Chen et al. presented four models for segmentation-based feature extraction for 3D cryo-electron imaging of proteins. Precision ranged from 0.61 to 0.79, recall ranged from 0.59–0.81, and F₁ scores ranged from 0.66 to 0.80 [26]. Another example, Touma et al., utilized Google AutoML to classify cataract phases from surgical videos. They reported an F₁ score of 0.79 with a precision of 0.81 and a recall of 0.77 [42]. Sheneman et al. explored the use of machine learning methods to segment lipid droplets imaged via quantitative phase imaging. The random forest classifier they employed performed with an F₁ score of 0.86, precision of 0.89, and recall of 0.82 [43]. Comparing the values from these studies with FIRM, FIRM performs comparably with other machine learning segmentation-based tools. Table 2 contains precision and recall evaluations for Ilastik, another RF classifier tool available for segmentation and analysis [21]. When performed on a subset of the same data, FIRM consistently outperformed Ilastik, which achieved F1 scores ranging from 0.384–0.78 with an overall F₁ score of 0.547. Although both Ilastik and FIRM rely on RF classification, FIRM uses a greater variety of edge detection filters, whereas Ilastik only incorporates Gaussian based edge filters. Fig 5 shows a visual comparison of FIRM and Ilastik segmentation. Although FIRM’s performance metrics achieve reasonable success compared to both manual and other ML-based methods, the performance metrics are inconsistent from image to image. Comparing the values for precision and recall with a visual assessment of the image quality and collagen packing density suggests that the accuracy of the model is dependent on both the quality of the test image and how densely packed the collagen fibrils are. Despite this limitation, average performance metrics all fall within the ranges reported in literature.

Download:

Fig 5. Visual comparison of FIRM and Ilastik segmentation of collagen fibrils in TEM images.

The left column contains original TEM images, the middle column contains overlays of the FIRM segmentation masks, and the right column contains overlays of the Ilastik segmentation masks. ML-segmented collagen fibrils appear in red.

https://doi.org/10.1371/journal.pone.0312196.g005

Download:

Table 1. Summary of FIRM performance: Precision, recall, and F₁ score.

https://doi.org/10.1371/journal.pone.0312196.t001

Download:

Table 2. Summary of Ilastik performance: Precision, recall, and F1 score.

https://doi.org/10.1371/journal.pone.0312196.t002

The number of identified fibrils and the average size of the fibrils are shown in Figs 6 and 7. Percent error calculations for number of detected fibrils and average fibril diameter (Table 3) show that the model consistently reports with less than 20% error compared to the ground truth, except for one outlier, Image 9.

Download:

Fig 6. Number of collagen fibrils detected by FIRM compared to the ground truth for each test image.

https://doi.org/10.1371/journal.pone.0312196.g006

Download:

Fig 7. Average fibril diameter measured by FIRM compared to the ground truth for each image.

Error bars represent standard deviation.

https://doi.org/10.1371/journal.pone.0312196.g007

Download:

Table 3. % Error calculations for FIRM compared to ground truth.

https://doi.org/10.1371/journal.pone.0312196.t003

Parity plots comparing FIRM to manual feature identification are shown in Fig 8 and comparing FIRM to Ilastik in Fig 9. R² values indicate the deviation from the ground truth. For average fibril size across all 11 test images, R² = 0.846 for FIRM while the average for human reviewers via manual methods was 0.863. When looking at how many fibrils were identified, R² = 0.950 for FIRM and 0.973 for manual methods. To compare size distributions, each data point on the parity plot represents the percentage of fibrils that falls into a specified bin for a given image. All images were compiled into a single parity plot. R² = 0.804 for FIRM and 0.814 for manual methods. For all three metrics, FIRM consistently performs similarly to manual methods. When comparing FIRM to Ilastik across 8 different images, FIRM achieved R² values of 0.846, 0.950, and 0.796 for size, count, and distributions, respectively, whereas Ilastik achieved R² values of 0.787, 0.871, and 0.417, demonstrating that FIRM outperforms Ilastik in each performance metric.

Download:

Fig 8. FIRM vs human analysis parity plots.

Plots show ground truth and measured values of (A) average fibril size, (B) number of fibrils, and (C) fibril size distribution for FIRM and human analyses. Coefficient of determination (R²) is displayed for each dataset. The R² = 1 line, which corresponds to the ground truth, is displayed for reference.

https://doi.org/10.1371/journal.pone.0312196.g008

Download:

Fig 9. FIRM vs Ilastik parity plots.

Plots show ground truth and measured values of (A) average fibril size, (B) number of fibrils, and (C) fibril size distribution for FIRM and Ilastik analyses. Coefficient of determination (R²) is displayed for each dataset. The R² = 1 line, which corresponds to the ground truth, is displayed for reference.

https://doi.org/10.1371/journal.pone.0312196.g009

To further investigate the use of the FIRM workflow for collagen analysis, the size of the training datasets and the number of annotations supplied were also examined. One of the advantages of the RF classifier employed in FIRM is that segmentation can be achieved with reasonable success using relatively small sets of training data. Fig 10 shows parity plots of fibril size, count, and distributions of FIRM operating with 5, 7, and 10 training images to assess the impact of larger training datasets on performance. It was found that the smaller training dataset outperformed the larger training sets in each performance metric, maintaining that operating FIRM with 5 training images balances performance and computational costs while avoiding overfitting.

Download:

Fig 10. FIRM increased training set parity plots.

Plots show ground truth and measured values for (A) average fibril size, (B) number of fibrils, and (C) fibril size distribution for FIRM with training sets of n = 5, 7, and 10 images. Coefficient of determination (R²) is displayed for each dataset. The R² = 1 line, which corresponds to the ground truth, is displayed for reference.

https://doi.org/10.1371/journal.pone.0312196.g010

Another advantage of RF classifiers is that in some instances they can be trained on sparse annotations rather than meticulously generated training masks and still achieve successful segmentation. Fig 11 shows a qualitative analysis of varying degrees of annotating for the training dataset. When only annotating 25% of the fibrils, the RF classifier still achieves moderate success at identifying fibrils on the image with clear, high contrast, moderately packed collagen fibrils (bottom). However, the sparse annotations lead to difficulties identifying fibrils in the images of densely packed fibrils (middle) or loosely packed collagen with non-fibril features scattered throughout (top). As the number of annotations increases, fibril detection continuously improves in all 3 images. While sparse training annotations can be sufficient and time-efficient for optimal images, in most cases it is recommended to fully annotate the training images for best results.

Download:

Fig 11. TEM micrographs with FIRM RF classifier overlays.

From left to right, images show FIRM fibril detection using 25%, 50%, 75%, and 100% annotated images for model training.

https://doi.org/10.1371/journal.pone.0312196.g011

With a limited training data set of 5 images, FIRM was able to identify collagen fibril cross-sections in TEM images. The RF classifier identified the number and average size of the fibrils within a reasonable margin of error across all test images and had an F₁ score that outperformed similar RF classifier-based tools on the same dataset and fell within the range for acceptable ML-based image segmentation presented in literature. Although the performance of FIRM is dependent on image quality and collagen packing, this workflow demonstrates that classification of ECM features in electron microscopy images can be achieved with accuracy and improved efficiency compared to manual methods despite a limited training data set. Furthermore, removing subjectivity from a reviewer’s interpretation of feature boundaries results in more consistent analyses from image to image, eliminating bias in studies where image analysis may not be completed consistently between two images.

Depending on the goals for a specific study, prioritization of specific performance metrics may be necessary. For tracking ECM component evolution through a remodeling process, a higher emphasis is placed on the size distribution, average fibril diameter, and F₁ score. The number of fibrils detected is not as important as minimizing false positives. False positives can skew size related data, making it difficult to accurately compare fibril sizes throughout the remodeling process. False negatives are less detrimental to size measurements since they only serve to reduce sample size instead of adding incorrect data points. For the application presented in this work, it was concluded that FIRM performs sufficiently to replace manual analysis of EM images with the added benefits of reduced variation due to subjectivity in fibril edge definition and rapid iteration through large datasets. For other applications, it may be necessary to optimize the model for specific evaluation metrics so that the resulting classification is more relevant.

Conclusion

The machine learning approach proposed in this study is a promising method for image quantification that can be used to identify and extract a variety of features for any application. FIRM-generated segmentation masks showed agreement in collagen fibril number, size, and distribution with the ground truth. Further analysis of the precision, recall, and F₁ score showed that the model’s performance is dependent on image quality but is nonetheless comparable with similar attempts at ML based segmentation reported in literature. With high quality images, the model has achieved an F₁ score as high as 0.928, whereas a lower quality image lowers the F₁ score to 0.689. For the application explored in this study, the model performs sufficiently to analyze fibril cross-sections with similar agreement with ground truth compared to individual human reviewers with the added benefit of removing user bias from the analysis process. Rapid iteration through image sets with large numbers of fibrils made the image analysis faster and more efficient than manual methods, even with the use of software aids like ImageJ-FIJI.

This workflow allows for quick, accurate segmentation of features from grayscale images that facilitates relevant analyses of the ECM or other components in any biological tissue system. Furthermore, this model’s applicability spans beyond biological imaging. Any discipline requiring feature detection and consistent edge definition from any imaging mode can benefit from FIRM, improving the reliability and consistency of feature quantification and the speed at which these measurements are acquired. Many other tools have been developed to aid in feature quantification but fail to address the challenges presented in electron microscopy analysis including user bias, lack of color staining for feature identification, ambiguous feature edges from poor imaging conditions, and contrast variability between images that makes batch thresholding inconsistent. FIRM addresses these concerns and can accurately identify and segment fibrils without human bias while only requiring a small training set of annotated grayscale images.

Future studies will be aimed at quantifying image quality using signal to noise ratio and determining a threshold for minimum required image quality for reliable classification. Further analysis with larger training data sets (>10 images) and increased number of decision trees will be performed to increase accuracy and broaden the applicability of FIRM for scenarios where large training sets are available.

Supporting information

S1 File. Access to FIRM code.

https://doi.org/10.1371/journal.pone.0312196.s001

(DOCX)

S1 Dataset. All raw data.

https://doi.org/10.1371/journal.pone.0312196.s002

(ZIP)

S2 Dataset. All training and testing images.

https://doi.org/10.1371/journal.pone.0312196.s003

(ZIP)

Acknowledgments

The authors would like to acknowledge James Hart and Sirma Koutzaki for image generation, Vivian Su and David Woo for their assistance with training annotations, and Michal Elovitz and Lou Soslowsky for helpful discussions and collaboration in mouse cervix preparation and testing. Biorender.com was used to generate figures.

References

1. Yue B, Biology of the extracellular matrix: an overview. J Glaucoma. 2014; 23(8 Suppl 1): S20–S23. pmid:25275899
- View Article
- PubMed/NCBI
- Google Scholar
2. Hu M, Ling Z, Ren X, Extracellular matrix dynamics: tracking in biological systems and their implications. J Biol Eng. 2022; 16: 13. pmid:35637526
- View Article
- PubMed/NCBI
- Google Scholar
3. Leonard AK, Loughran EA, Klymenko Y, Liu Y, Kim O, Asem M, et al., Methods for the visualization and analysis of extracellular matrix protein structure and degradation. Methods Cell Biol. 2018; 143:79–95. pmid:29310793
- View Article
- PubMed/NCBI
- Google Scholar
4. Wollmann T, Erfle H, Eils R, Rohr K, Gunkel M, Workflows for microscopy image analysis and cellular phenotyping. J Biotechnol. 2017; 261: 70–75. pmid:28757289
- View Article
- PubMed/NCBI
- Google Scholar
5. Eliceiri KW, Berthold MR, Goldberg IG, Ibáñez L, Manjunath BS, Martone ME, et al., Biological imaging software tools. Nat Methods. 2012; 9(7): 697–710. pmid:22743775
- View Article
- PubMed/NCBI
- Google Scholar
6. Jung SK A Review of Image Analysis in Biochemical Engineering. Biotechnol Bioprocess Eng. 2019; 24(1): 65–75.
- View Article
- Google Scholar
7. Raub CB, Suresh V, Krasieva T, Lyubovitsky J, Mih JD, Putnam AJ, et al., Noninvasive assessment of collagen gel microstructure and mechanics using multiphoton microscopy. Biophys J. 2007; 92(6): 2212–2222. pmid:17172303
- View Article
- PubMed/NCBI
- Google Scholar
8. Jones B, Tonniges JR, Debski A, Albert B, Yeung DA, Gadde N, et al., Collagen fibril abnormalities in human and mice abdominal aortic aneurysm: Abnormal collagen fibrils in human and mice AAA. Acta Biomater. 2020; 110: 129–140. pmid:32339711
- View Article
- PubMed/NCBI
- Google Scholar
9. Fahlé J, Salmé L, Pore and Matrix Distribution in the Fiber Wall Revealed by Atomic Force Microscopy and Image Analysis. Biomacromolecules. 2005; 6(1): 433–438. pmid:15638549
- View Article
- PubMed/NCBI
- Google Scholar
10. Kim EH, Lim S, Kim E, Jeon IO, Choi YS, Preparation of in situ Injectable Chitosan/Gelatin Hydrogel Using an Acid-tolerant Tyrosinase. Biotechnol Bioprocess Eng. 2018; 23: 500–506.
- View Article
- Google Scholar
11. Gole L, Yeong J, Lim JCT, Ong KH, Han H, Thike AA, et al., Quantitative stain-free imaging and digital profiling of collagen structure reveal diverse survival of triple negative breast cancer patients. Breast Cancer Res. 2020; 22(1). pmid:32375854
- View Article
- PubMed/NCBI
- Google Scholar
12. Wülfers EM, Greiner J, Giese M, Madl J, Kroll J, Stiller B, et al., Quantitative collagen assessment in right ventricular myectomies from patients with tetralogy of Fallot. Europace. 2021; 23: i38–i47. pmid:33404047
- View Article
- PubMed/NCBI
- Google Scholar
13. Taufalele PV, VanderBurgh JA, Muñoz A, Zanotelli MR, Reinhart-King CA, Fiber alignment drives changes in architectural and mechanical features in collagen matrices. PLoS One. 2019; 14(5): e0216537 pmid:31091287
- View Article
- PubMed/NCBI
- Google Scholar
14. Vasiukov G, Novitskaya T, Senosain MF, Camai A, Menshikh A, Massion P, et al., Integrated Cells and Collagen Fibers Spatial Image Analysis. Front Bioinform. 2021. pmid:35813245
- View Article
- PubMed/NCBI
- Google Scholar
15. D’Amore A, Stella JA, Wagner WR, Sacks MS, Characterization of the complete fiber network topology of planar fibrous tissues and scaffolds. Biomaterials. 2010; 31(20): 5345–5354. pmid:20398930
- View Article
- PubMed/NCBI
- Google Scholar
16. Yehliu K, vander Wal RL, Boehman AL, Development of an HRTEM image analysis method to quantify carbon nanostructure. Combust Flame. 2011; 158(9): 1837–1851.
- View Article
- Google Scholar
17. Jung SK, Aleman-Meza B, Riepe C, Zhong W. QuantWorm: A Comprehensive Software Package for Caenorhabditis elegans Phenotypic Assays. PLoS One. 2014; 9(1): e84830. pmid:24416295
- View Article
- PubMed/NCBI
- Google Scholar
18. Nikolaisen J, Nilsson LIH, Pettersen IKN, Willems PHGM, Lorens JB, Koopman WJH, et al., Automated Quantification and Integrative Analysis of 2D and 3D Mitochondrial Shape and Network Properties. PLoS One. 2014; 9(7): e101365. pmid:24988307
- View Article
- PubMed/NCBI
- Google Scholar
19. Mahendroo M, Cervical remodeling in term and preterm birth: insights from an animal model. Reproduction. 2012; 143: 429–438. pmid:22344465
- View Article
- PubMed/NCBI
- Google Scholar
20. Timmons B, Akins M, Mahendroo M, Cervical remodeling during pregnancy and parturition. Trends Endocrinol Metab. 2010; 21: 353–361. pmid:20172738
- View Article
- PubMed/NCBI
- Google Scholar
21. Berg S, Kutra D, Kroeger T, Straehle CN, Kausler BX, Haubold C, et al., ilastik: interactive machine learning for (bio)image analysis. Nat Methods. 2019; 16: 1226–1232. pmid:31570887
- View Article
- PubMed/NCBI
- Google Scholar
22. Uchida S, Image processing and recognition for biological images. Dev Growth Differ, 2013; 55(4): 523–549. pmid:23560739
- View Article
- PubMed/NCBI
- Google Scholar
23. Schorb M, Haberbosch I, Hagen WJH, Schwab Y, Mastronarde DN, Software tools for automated transmission electron microscopy. Nat Methods. 2019; 16(6), 471–477. pmid:31086343
- View Article
- PubMed/NCBI
- Google Scholar
24. Kistenev YV, Vrazhnov DA, Nikolaev VV, Sandykove EA, Krivova NA, Analysis of Collagen Spatial Structure Using Multiphoton Microscopy and Machine Learning Methods. Biochemistry (Mosc). 2018; 94(1): S108–S123. pmid:31213198
- View Article
- PubMed/NCBI
- Google Scholar
25. Pham TTA, Kim H, Lee Y, Kang HW, Park S, Deep Learning for Analysis of Collagen Fiber Organization in Scar Tissue. IEEE Access. 2021; 9: 101755–101764.
- View Article
- Google Scholar
26. Chen L, Jebril R, al Nasr K, Al K, Segmentation-based Feature Extraction for Cryo-Electron Microscopy at Medium Resolution. In Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics. 2020; 113: 1–9.
- View Article
- Google Scholar
27. Almutairi Y, Cootes T, Kadler K, editors. Analysing the Structure of Collagen Fibres in SBFSEM Images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops; 2016.
- View Article
- Google Scholar
28. Almutairi Y, Lambrou T, (ed.), Ye X, (ed.). Tracking Collagen Fibres through Image Volumes from SBFSEM. In Lambrou T, Ye X, editors, Tracking Collagen Fibres through Image Volumes from SBFSEM: MIUA 2015. 19th Annual Conference, BMVA. UK: BMVA Press. 2015; 40–45.
- View Article
- Google Scholar
29. Gray KR, Aljabar P, Heckemann RA, Hammers A, Rueckert D, Random forest-based similarity measures for multi-modal classification of Alzheimer’s disease. NeuroImage. 2013; 65: 167–175. pmid:23041336
- View Article
- PubMed/NCBI
- Google Scholar
30. Roa C, Du Le VN, Mahendroo M, Saytashev I, Ramella-Roman JC, Auto-detection of cervical collagen and elastin in Mueller matrix polarimetry microscopic images using K-NN and semantic segmentation classification. Biomed Opt Express. 2021; 12(4): 2236–2249. pmid:33996226
- View Article
- PubMed/NCBI
- Google Scholar
31. Gonzalez M, Roa C, Jimenez A, Gomez-Guevara R, Du Le VN, Novikova T, et al., Machine learning powered Mueller matrix microscope for collagen and elastin visualization in the mouse cervix. In Proceedings SPIE 11963, Polarized Light and Optical Angular Momentum for Biomedical Diagnostics. 2022.
- View Article
- Google Scholar
32. Buitinck L, Louppe G, Blondel M, Pedregosa F, Mueller A, Grisel O, et al., API design for machine learning software: experiences from the scikit-learn project. European Conference on Machine Learning and Principles and Practices of Knowledge Discovery in Databases. 2013; 108–122.
- View Article
- Google Scholar
33. van der Walt S, Schonberger JL, Nunez-Iglesias J, Boulogne F, Warnder JD, Yager N, et al., scikit-image: image processing in Python. PeerJ. 2014; 2: e453. pmid:25024921
- View Article
- PubMed/NCBI
- Google Scholar
34. The pandas development team, pandas-dev/pandas: Pandas (v2.0.3). Zenodo. 2023. https://doi.org/10.5281/zenodo.8092754
35. Harris CR, Millman KJ, van der Walt SJ, Gommers R, Virtanen P, Cournapeau D, et al., Array programming with NumPY. Nature. 2020; 585: 357–362. pmid:32939066
- View Article
- PubMed/NCBI
- Google Scholar
36. Bradski G, The OpenCV Library. Dr. Dobb’s Journal of Software Tools. 2000.
- View Article
- Google Scholar
37. Murray A, van Kemenade H, wiredfool, Clark JA, Karpinsky A, Baranovič O, et al., python-pillow/Pillow: 10.0.0. Zenodo. 2023. https://doi.org/10.5281/zenodo.8104287
38. Hunter JD, Matplotlib: A 2D graphics environment. Computing in Science & Engineering. 2007; 9(3), 90–95.
- View Article
- Google Scholar
39. Varoquaux G, Grisel O, Joblib: running python function as pipeline jobs. packages. python. org/joblib. 2009.
- View Article
- Google Scholar
40. Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, et al., SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nature Methods. 2020; 17(3), 261–272. pmid:32015543
- View Article
- PubMed/NCBI
- Google Scholar
41. Dang D, Le M, Irmer T, Angay O, Fichtl B, Schwarz B, APEER: an interactive cloud platform for microscopists to easily deploy deep learning. Zenodo. 2021.
- View Article
- Google Scholar
42. Touma S, Antaki F, Duval R, Development of a code-free machine learning model for the classification of cataract surgery phases. Sci Rep. 2022; 12: 2398. pmid:35165304
- View Article
- PubMed/NCBI
- Google Scholar
43. Sheneman L, Stephanopoulos G, Vasdekis AE. Deep learning classification of lipid droplets in quantitative phase images. PLoS One. 2021;16(4):e0249196. pmid:33819277
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Yue B, Biology of the extracellular matrix: an overview. J Glaucoma. 2014; 23(8 Suppl 1): S20–S23. pmid:25275899
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Hu M, Ling Z, Ren X, Extracellular matrix dynamics: tracking in biological systems and their implications. J Biol Eng. 2022; 16: 13. pmid:35637526
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Leonard AK, Loughran EA, Klymenko Y, Liu Y, Kim O, Asem M, et al., Methods for the visualization and analysis of extracellular matrix protein structure and degradation. Methods Cell Biol. 2018; 143:79–95. pmid:29310793
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Wollmann T, Erfle H, Eils R, Rohr K, Gunkel M, Workflows for microscopy image analysis and cellular phenotyping. J Biotechnol. 2017; 261: 70–75. pmid:28757289
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Eliceiri KW, Berthold MR, Goldberg IG, Ibáñez L, Manjunath BS, Martone ME, et al., Biological imaging software tools. Nat Methods. 2012; 9(7): 697–710. pmid:22743775
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Jung SK A Review of Image Analysis in Biochemical Engineering. Biotechnol Bioprocess Eng. 2019; 24(1): 65–75.
View Article
Google Scholar

[22] View Article

[23] Google Scholar

[ref7] 7. Raub CB, Suresh V, Krasieva T, Lyubovitsky J, Mih JD, Putnam AJ, et al., Noninvasive assessment of collagen gel microstructure and mechanics using multiphoton microscopy. Biophys J. 2007; 92(6): 2212–2222. pmid:17172303
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref8] 8. Jones B, Tonniges JR, Debski A, Albert B, Yeung DA, Gadde N, et al., Collagen fibril abnormalities in human and mice abdominal aortic aneurysm: Abnormal collagen fibrils in human and mice AAA. Acta Biomater. 2020; 110: 129–140. pmid:32339711
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref9] 9. Fahlé J, Salmé L, Pore and Matrix Distribution in the Fiber Wall Revealed by Atomic Force Microscopy and Image Analysis. Biomacromolecules. 2005; 6(1): 433–438. pmid:15638549
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref10] 10. Kim EH, Lim S, Kim E, Jeon IO, Choi YS, Preparation of in situ Injectable Chitosan/Gelatin Hydrogel Using an Acid-tolerant Tyrosinase. Biotechnol Bioprocess Eng. 2018; 23: 500–506.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref11] 11. Gole L, Yeong J, Lim JCT, Ong KH, Han H, Thike AA, et al., Quantitative stain-free imaging and digital profiling of collagen structure reveal diverse survival of triple negative breast cancer patients. Breast Cancer Res. 2020; 22(1). pmid:32375854
View Article
PubMed/NCBI
Google Scholar

[40] View Article

[41] PubMed/NCBI

[42] Google Scholar

[ref12] 12. Wülfers EM, Greiner J, Giese M, Madl J, Kroll J, Stiller B, et al., Quantitative collagen assessment in right ventricular myectomies from patients with tetralogy of Fallot. Europace. 2021; 23: i38–i47. pmid:33404047
View Article
PubMed/NCBI
Google Scholar

[44] View Article

[45] PubMed/NCBI

[46] Google Scholar

[ref13] 13. Taufalele PV, VanderBurgh JA, Muñoz A, Zanotelli MR, Reinhart-King CA, Fiber alignment drives changes in architectural and mechanical features in collagen matrices. PLoS One. 2019; 14(5): e0216537 pmid:31091287
View Article
PubMed/NCBI
Google Scholar

[48] View Article

[49] PubMed/NCBI

[50] Google Scholar

[ref14] 14. Vasiukov G, Novitskaya T, Senosain MF, Camai A, Menshikh A, Massion P, et al., Integrated Cells and Collagen Fibers Spatial Image Analysis. Front Bioinform. 2021. pmid:35813245
View Article
PubMed/NCBI
Google Scholar

[52] View Article

[53] PubMed/NCBI

[54] Google Scholar

[ref15] 15. D’Amore A, Stella JA, Wagner WR, Sacks MS, Characterization of the complete fiber network topology of planar fibrous tissues and scaffolds. Biomaterials. 2010; 31(20): 5345–5354. pmid:20398930
View Article
PubMed/NCBI
Google Scholar

[56] View Article

[57] PubMed/NCBI

[58] Google Scholar

[ref16] 16. Yehliu K, vander Wal RL, Boehman AL, Development of an HRTEM image analysis method to quantify carbon nanostructure. Combust Flame. 2011; 158(9): 1837–1851.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref17] 17. Jung SK, Aleman-Meza B, Riepe C, Zhong W. QuantWorm: A Comprehensive Software Package for Caenorhabditis elegans Phenotypic Assays. PLoS One. 2014; 9(1): e84830. pmid:24416295
View Article
PubMed/NCBI
Google Scholar

[63] View Article

[64] PubMed/NCBI

[65] Google Scholar

[ref18] 18. Nikolaisen J, Nilsson LIH, Pettersen IKN, Willems PHGM, Lorens JB, Koopman WJH, et al., Automated Quantification and Integrative Analysis of 2D and 3D Mitochondrial Shape and Network Properties. PLoS One. 2014; 9(7): e101365. pmid:24988307
View Article
PubMed/NCBI
Google Scholar

[67] View Article

[68] PubMed/NCBI

[69] Google Scholar

[ref19] 19. Mahendroo M, Cervical remodeling in term and preterm birth: insights from an animal model. Reproduction. 2012; 143: 429–438. pmid:22344465
View Article
PubMed/NCBI
Google Scholar

[71] View Article

[72] PubMed/NCBI

[73] Google Scholar

[ref20] 20. Timmons B, Akins M, Mahendroo M, Cervical remodeling during pregnancy and parturition. Trends Endocrinol Metab. 2010; 21: 353–361. pmid:20172738
View Article
PubMed/NCBI
Google Scholar

[75] View Article

[76] PubMed/NCBI

[77] Google Scholar

[ref21] 21. Berg S, Kutra D, Kroeger T, Straehle CN, Kausler BX, Haubold C, et al., ilastik: interactive machine learning for (bio)image analysis. Nat Methods. 2019; 16: 1226–1232. pmid:31570887
View Article
PubMed/NCBI
Google Scholar

[79] View Article

[80] PubMed/NCBI

[81] Google Scholar

[ref22] 22. Uchida S, Image processing and recognition for biological images. Dev Growth Differ, 2013; 55(4): 523–549. pmid:23560739
View Article
PubMed/NCBI
Google Scholar

[83] View Article

[84] PubMed/NCBI

[85] Google Scholar

[ref23] 23. Schorb M, Haberbosch I, Hagen WJH, Schwab Y, Mastronarde DN, Software tools for automated transmission electron microscopy. Nat Methods. 2019; 16(6), 471–477. pmid:31086343
View Article
PubMed/NCBI
Google Scholar

[87] View Article

[88] PubMed/NCBI

[89] Google Scholar

[ref24] 24. Kistenev YV, Vrazhnov DA, Nikolaev VV, Sandykove EA, Krivova NA, Analysis of Collagen Spatial Structure Using Multiphoton Microscopy and Machine Learning Methods. Biochemistry (Mosc). 2018; 94(1): S108–S123. pmid:31213198
View Article
PubMed/NCBI
Google Scholar

[91] View Article

[92] PubMed/NCBI

[93] Google Scholar

[ref25] 25. Pham TTA, Kim H, Lee Y, Kang HW, Park S, Deep Learning for Analysis of Collagen Fiber Organization in Scar Tissue. IEEE Access. 2021; 9: 101755–101764.
View Article
Google Scholar

[95] View Article

[96] Google Scholar

[ref26] 26. Chen L, Jebril R, al Nasr K, Al K, Segmentation-based Feature Extraction for Cryo-Electron Microscopy at Medium Resolution. In Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics. 2020; 113: 1–9.
View Article
Google Scholar

[98] View Article

[99] Google Scholar

[ref27] 27. Almutairi Y, Cootes T, Kadler K, editors. Analysing the Structure of Collagen Fibres in SBFSEM Images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops; 2016.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref28] 28. Almutairi Y, Lambrou T, (ed.), Ye X, (ed.). Tracking Collagen Fibres through Image Volumes from SBFSEM. In Lambrou T, Ye X, editors, Tracking Collagen Fibres through Image Volumes from SBFSEM: MIUA 2015. 19th Annual Conference, BMVA. UK: BMVA Press. 2015; 40–45.
View Article
Google Scholar

[104] View Article

[105] Google Scholar

[ref29] 29. Gray KR, Aljabar P, Heckemann RA, Hammers A, Rueckert D, Random forest-based similarity measures for multi-modal classification of Alzheimer’s disease. NeuroImage. 2013; 65: 167–175. pmid:23041336
View Article
PubMed/NCBI
Google Scholar

[107] View Article

[108] PubMed/NCBI

[109] Google Scholar

[ref30] 30. Roa C, Du Le VN, Mahendroo M, Saytashev I, Ramella-Roman JC, Auto-detection of cervical collagen and elastin in Mueller matrix polarimetry microscopic images using K-NN and semantic segmentation classification. Biomed Opt Express. 2021; 12(4): 2236–2249. pmid:33996226
View Article
PubMed/NCBI
Google Scholar

[111] View Article

[112] PubMed/NCBI

[113] Google Scholar

[ref31] 31. Gonzalez M, Roa C, Jimenez A, Gomez-Guevara R, Du Le VN, Novikova T, et al., Machine learning powered Mueller matrix microscope for collagen and elastin visualization in the mouse cervix. In Proceedings SPIE 11963, Polarized Light and Optical Angular Momentum for Biomedical Diagnostics. 2022.
View Article
Google Scholar

[115] View Article

[116] Google Scholar

[ref32] 32. Buitinck L, Louppe G, Blondel M, Pedregosa F, Mueller A, Grisel O, et al., API design for machine learning software: experiences from the scikit-learn project. European Conference on Machine Learning and Principles and Practices of Knowledge Discovery in Databases. 2013; 108–122.
View Article
Google Scholar

[118] View Article

[119] Google Scholar

[ref33] 33. van der Walt S, Schonberger JL, Nunez-Iglesias J, Boulogne F, Warnder JD, Yager N, et al., scikit-image: image processing in Python. PeerJ. 2014; 2: e453. pmid:25024921
View Article
PubMed/NCBI
Google Scholar

[121] View Article

[122] PubMed/NCBI

[123] Google Scholar

[ref34] 34. The pandas development team, pandas-dev/pandas: Pandas (v2.0.3). Zenodo. 2023. https://doi.org/10.5281/zenodo.8092754

[ref35] 35. Harris CR, Millman KJ, van der Walt SJ, Gommers R, Virtanen P, Cournapeau D, et al., Array programming with NumPY. Nature. 2020; 585: 357–362. pmid:32939066
View Article
PubMed/NCBI
Google Scholar

[126] View Article

[127] PubMed/NCBI

[128] Google Scholar

[ref36] 36. Bradski G, The OpenCV Library. Dr. Dobb’s Journal of Software Tools. 2000.
View Article
Google Scholar

[130] View Article

[131] Google Scholar

[ref37] 37. Murray A, van Kemenade H, wiredfool, Clark JA, Karpinsky A, Baranovič O, et al., python-pillow/Pillow: 10.0.0. Zenodo. 2023. https://doi.org/10.5281/zenodo.8104287

[ref38] 38. Hunter JD, Matplotlib: A 2D graphics environment. Computing in Science & Engineering. 2007; 9(3), 90–95.
View Article
Google Scholar

[134] View Article

[135] Google Scholar

[ref39] 39. Varoquaux G, Grisel O, Joblib: running python function as pipeline jobs. packages. python. org/joblib. 2009.
View Article
Google Scholar

[137] View Article

[138] Google Scholar

[ref40] 40. Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, et al., SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nature Methods. 2020; 17(3), 261–272. pmid:32015543
View Article
PubMed/NCBI
Google Scholar

[140] View Article

[141] PubMed/NCBI

[142] Google Scholar

[ref41] 41. Dang D, Le M, Irmer T, Angay O, Fichtl B, Schwarz B, APEER: an interactive cloud platform for microscopists to easily deploy deep learning. Zenodo. 2021.
View Article
Google Scholar

[144] View Article

[145] Google Scholar

[ref42] 42. Touma S, Antaki F, Duval R, Development of a code-free machine learning model for the classification of cataract surgery phases. Sci Rep. 2022; 12: 2398. pmid:35165304
View Article
PubMed/NCBI
Google Scholar

[147] View Article

[148] PubMed/NCBI

[149] Google Scholar

[ref43] 43. Sheneman L, Stephanopoulos G, Vasdekis AE. Deep learning classification of lipid droplets in quantitative phase images. PLoS One. 2021;16(4):e0249196. pmid:33819277
View Article
PubMed/NCBI
Google Scholar

[151] View Article

[152] PubMed/NCBI

[153] Google Scholar

Figures

Abstract

Introduction

Methods

Mouse cervix image generation

Random forest detection model

Quantitative measurements of fibril size

Evaluation of FIRM identification and segmentation of fibrils

Evaluation of FIRM performance vs other methods

Evaluation of model training on FIRM performance

Results and discussion

Conclusion

Supporting information

S1 File. Access to FIRM code.

S1 Dataset. All raw data.

S2 Dataset. All training and testing images.

Acknowledgments

References