Histopathological domain adaptation with generative adversarial networks: Bridging the domain gap between thyroid cancer histopathology datasets

William Dee; Rana Alaaeldin Ibrahim; Eirini Marouli

doi:10.1371/journal.pone.0310417

Abstract

Deep learning techniques are increasingly being used to classify medical imaging data with high accuracy. Despite this, due to often limited training data, these models can lack sufficient generalizability to predict unseen test data, produced in different domains, with comparable performance. This study focuses on thyroid histopathology image classification and investigates whether a Generative Adversarial Network [GAN], trained with just 156 patient samples, can produce high quality synthetic images to sufficiently augment training data and improve overall model generalizability. Utilizing a StyleGAN2 approach, the generative network produced images with an Fréchet Inception Distance (FID) score of 5.05, matching state-of-the-art GAN results in non-medical domains with comparable dataset sizes. Augmenting the training data with these GAN-generated images increased model generalizability when tested on external data sourced from three separate domains, improving overall precision and AUC by 7.45% and 7.20% respectively compared with a baseline model. Most importantly, this performance improvement was observed on minority class images, tumour subtypes which are known to suffer from high levels of inter-observer variability when classified by trained pathologists.

Citation: Dee W, Alaaeldin Ibrahim R, Marouli E (2024) Histopathological domain adaptation with generative adversarial networks: Bridging the domain gap between thyroid cancer histopathology datasets. PLoS ONE 19(12): e0310417. https://doi.org/10.1371/journal.pone.0310417

Editor: Avaniyapuram Kannan Murugan, King Faisal Specialist Hospital and Research Center, SAUDI ARABIA

Received: October 10, 2023; Accepted: August 31, 2024; Published: December 26, 2024

Copyright: © 2024 Dee et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Source codes, as well as underlying data, are freely available at: https://github.com/williamdee1/ThyCa-GAN.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Thyroid cancer incidence has generally been increasing since the 1970s [1]. It is currently the ninth most common cancer worldwide [2], and the 2019 Global Burden of Disease study predicted incidence will increase across all age groups for the next 20 years [3].

Differentiated thyroid cancer (DTC) includes all types of thyroid cancer that originate in the cells which produce and store thyroid hormones, and accounts for approximately 90% of thyroid cancer incidence [3, 4]. DTC generally has a good prognosis compared with undifferentiated thyroid cancers which include anaplastic and medullary thyroid cancer [5]. Within this DTC designation, the most common thyroid gland malignancy is papillary thyroid carcinoma (PTC), constituting 80–90% of diagnosed cases [3, 6]. Notably, several genetic mutations have been implicated in the pathogenesis of PTC especially BRAF V600E [7]. The ALK gene mutation has been observed in various thyroid cancers, including PTC, follicular thyroid carcinoma (FTC), and undifferentiated anaplastic thyroid cancer [8]. Additionally, the C228T promoter mutation in the TERT gene is associated with the BRAF V600E mutation in PTC [9, 10].

Other genetic mutations reported in various thyroid cancer include RAS [11, 12] PIK3CA, AKT1, PTEN [13], mTOR [14], and chromosomal rearrangements involving RET/PTC and PAX8-PPARɣ genes [12]. These genetic mutations can deregulate the mitogen-activated protein kinase (MAPK) and phosphatidylinositol-3 kinase (PI3K)/AKT signalling pathways which are crucial to the pathogenesis of thyroid cancer [11]. It is noteworthy to mention that MAPK activation is crucial for the initiation of PTC [13]. Other signalling pathways include p53 and Wnt/β-catenin [13].

Distinctive nuclear features aid in the clinical classification of PTC and its variants designated as PTC-like [15–20]. These include changes to nuclear size and shape, primarily elongation, enlargement and overlapping nuclei, as well as chromatin alterations such as “clearing, margination and glassy nuclei” [21, 22].

Whilst histopathology assessments remain the gold standard in tumour diagnosis [15, 23], there still exists significant inter-observer variability between diagnoses [24, 25]. Additionally, diagnostic accuracy is dependent on the experience of the pathologist [26], and greater patient imaging throughput is placing increasing demands on the time of these highly qualified professionals [27].

In the past two decades, the evolution of whole-slide image technology has facilitated the digital storage of high-resolution histopathological images. The ability to share high quality sample data globally has enabled the development of various machine learning approaches aimed at automating histopathological image classification. Computer-based methods have the potential to improve diagnosis speed and accuracy, as they require less overall training time than a human and have been shown to outperform experienced clinicians in various image classification tasks [28, 29]. Furthermore, an interpretable ML system can be used to aid pathologist training, as well as enabling quality assurance and the assessment of both inter and intra-observer variability [23].

Accurate diagnosis is particularly important within thyroid cancer as overdiagnosis is a known and growing issue, accounting for up to 60–90% of newly diagnosed cases [30, 31]. This overdiagnosis places needless psychological burden on patients and can lead to overtreatment, i.e., unnecessary thyroidectomy surgery [32].

Due to PTC’s prevalence and relatively clear features, combined with long-term survival rates of more than 90% [33], identifying patient samples with PTC-like nuclei is the first step in a pathologist’s diagnostic approach [34]. Machine learning approaches have therefore often focused on automating the bulk of the diagnosis burden, classifying histopathological images as PTC-like or not [18, 27, 34–36]. These methods have utilised a variety of architectures, ranging from Random Forests and Support Vector Machines (SVMs) [37–39] to deep learning Convolutional Neural Network (CNN) models [34, 40, 41].

Böhland et al. [34] directly compared seven different machine learning-based approaches for predicting the presence of PTC-like nuclei in whole-slide image patches. While the best performing method achieved 89.7% accuracy when tested on set-aside data from the same domain as the training data, it classified minority class non-PTC-like samples, sourced from a separate domain, with 46.7% accuracy.

The authors suggested this failure to generalize was due to a lack of diverse training data, a common problem when applying machine learning methods to (often small) medical imaging datasets [28]. This issue can be exacerbated when using deep learning, which typically requires large datasets to produce highly generalizable models [42, 43]. A lack of training data diversity can be especially problematic when there is a large domain gap present between datasets, i.e., differences in data distributions and/or feature representations caused by the underlying processes behind gathering and processing the data. These domain differences can obscure the underlying biological information, making it difficult to train robust models which can adapt well to the new data.

A Uniform Manifold Approximation and Projection (UMAP) [44] plot can be used to visualize the domain gap between datasets. Images from the Tharun and Thompson (T&T) [34] and Nikiforov datasets [45] used in Böhland et al.’s work were passed through a pretrained ResNet50 [46] model to obtain embeddings, before being represented in a two-dimensional latent space using UMAP (Fig 1). The separation between the two sets of embeddings demonstrates that the data distributions are clearly distinct from each other—i.e., there are stronger differences caused by the domain origination, than similarities relating to samples which share the same histopathological classifications.

Download:

Fig 1. UMAP visualization of the domain gap between the T&T [34] and the Nikiforov datasets [45].

Images from both datasets were passed through a pretrained ResNet50 [46] model to obtain embedding representations. The figure shows a clear separation between the two datasets along the first UMAP dimension. This gap can have a negative impact on the ability of a model trained on one dataset being able to generalize to predict the other.

https://doi.org/10.1371/journal.pone.0310417.g001

There are no sources in the current document

One potential solution to bridge this domain gap is to apply generative adversarial networks (GANs). GANs were introduced by Goodfellow et al. [47] as a method of producing high quality synthetic (fake) images which approximate an underlying real data distribution. Adversarial training methods have previously been applied successfully to generate artificial data in MRI reconstruction and tumour segmentation [48], X-ray organ segmentation [49, 50], and virtual slide staining [51–53].

GAN-generated synthetic images have proven to be convincing representations when presented to trained medical professionals. Synthetic lung nodule samples were assessed as real 67% of the time by a radiologist with 13 years’ experience, whilst 100% were considered real by a radiologist with four years’ experience [54]. Both board-certified and trainee pathologists showed an inability to distinguish between real and synthetic ovarian carcinoma samples, choosing correctly only 54% of the time [55]. Lastly, Xue et al. [52] found that three out of four pathologists could not differentiate over half of the synthetic histopathology images produced by their GAN. Furthermore, augmenting training data with GAN-generated images has been shown to increase the performance of machine learning models when generalizing to unseen test data [56, 57].

In this paper we investigated whether a GAN could be successfully trained on a limited dataset of 156 patient histopathology slides to produce realistic synthetic images. We evaluated the impact of using these GAN-generated images to augment the original training data and measured the improvement when classifying whether held-out test samples from the same domain had PTC-like nuclei present.

In addition, we curated a new dataset, combining publicly available histopathology data from three separate domains (see Methods: Data acquisition and processing for more detail). This dataset contains histological subtypes which are frequently misclassified by trained histopathologists due to their relative scarcity or the complexity of diagnosis criteria. We assessed whether our generated synthetic images helped bridge the domain gap present between the training and test data. We aim to show that improved classification performance can be gained for these difficult minority class subtypes, with generative data augmentation allowing a deep learning model to partially overcome domain differences and focus on the biological signal present in histopathological images.

An overview of our approach is included in Fig 2.

Download:

Fig 2. An overview schematic of the process flow followed by the methods of this paper.

(A) An overview of how either a binary-class or multi-class GAN is trained to produced synthetic samples for a given label using the Tharun Thompson data. (B) Shows how the original training data is combined with the synthetic data to produce a deep learning model which can generalize to unseen test data. Note, synthetic images are only added to the training data in the case that GAN augmentation is being utilized, otherwise (i.e., during baseline model training) no synthetic samples are added to the training data.

https://doi.org/10.1371/journal.pone.0310417.g002

Results

GAN training

Table 1 shows the results of the StyleGAN2 training. The FID score significantly improves to 5.05 when the original 1,916 x 1,053 px images were split into non-overlapping 512 x 512 px crops to increase the amount of available training data. This score is similar to the 4.67 average score that Karras et al. (2020) [58] obtained for three 5,000 image “Animal Faces (AFHQ)” datasets. It is also notably lower than the FID score of 15.71 Karras et al. (2020) [58] achieved on the BreCaHAD dataset, which consisted of 162 breast cancer histopathology images.

Download:

Table 1. GAN training results.

https://doi.org/10.1371/journal.pone.0310417.t001

Fig 3 depicts synthetic images generated by GANs trained on binary labels. It compares images produced using two different training sets: one consisting of the 1,496 centrally cropped T&T images, producing an FID score of 18.38, and another consisting of 12,038 overlapping crops from the same T&T dataset, resulting in an FID score of 5.10.

Download:

Fig 3.

Examples of PTC-like and non-PTC-like images produced after Style-GAN2 was trained conditionally using: (A) the original 1,496 T&T dataset with 512 x 512 px crops extracted from the centre of each image, achieving an FID score of 18.38. (B) Compared with the expanded T&T dataset which extracted 12,038 overlapping 512 x 512 px crops from the original images, resulting in an FID score of 5.10.

https://doi.org/10.1371/journal.pone.0310417.g003

Deep learning classification

T&T dataset.

Table 2 displays the five-fold cross-validation performance of the model trained to classify images from the T&T dataset. A ResNet101 architecture [59] with a three-layer multi-layer perceptron was used as the classifier (see Methods: Deep learning classifier (DLC) for more detail).

Download:

Table 2. Five-fold cross-validated T&T results.

https://doi.org/10.1371/journal.pone.0310417.t002

Our baseline model achieved similar overall accuracy to both approaches in Böhland et al.’s work, classifying datapoints with 1% lower overall accuracy. Notably, the worst predicted class remains NIFTP, one of the minority classes in the data, and a subtype considered different for trained pathologists to identify with high accuracy.

Increasing the number of samples of each binary classification (PTC-like or not) using the binary GAN resulted in an increase in accuracy of 10%, improving across all classes. Using a multi-class GAN to specifically augment each class, equalizing the number of training examples across classes (see Methods: Synthetic data augmentation for more detail), resulted in the best performance with an accuracy of 99.38%.

NTE dataset.

Like the five-fold cross-validation results seen with the T&T data, augmenting the training data with GAN-generated samples has a positive impact on performance when tested on the external NTE data (Table 3). The ResNet101 baseline model was adept at classifying negative (non-PTC-like) images but classified the two different subtypes with PTC-like nuclei, FVPTC and NIFTP, with only 33.33% and 16% accuracy respectively. This approach therefore had a low recall score of 25.45%, illustrating the model had not adapted well to the external domain samples.

Download:

Table 3. NTE results.

https://doi.org/10.1371/journal.pone.0310417.t003

Using a SwinV2 [60, 61] model in place of ResNet101 resulted in improved recall, to the detriment of precision and recall as benign samples were classified with 9.10% accuracy and overall AUC dropped to 64.31%.

Using the binary and multi-class GAN for data augmentation resulted in more balanced accuracy scores across subtypes, reducing the range of classification performance. This resulted in higher precision scores of 85.00% and 84.21% compared with the SwinV2 baseline of 77.55%. Additionally, both GAN augmentation methods improved the model’s AUC over the baseline models, showing a stronger ability for those models to discriminate between PTC-like and non-PTC-like images overall.

Discussion

In this work we successfully trained a GAN which could produce high quality synthetic thyroid histopathology images using a limited dataset of images from 156 patients. The GAN FID scores were comparable to prior state-of-the-art results for similar-sized datasets from different domains [58], showing the applicability of the StyleGAN2 approach to the medical imaging domain. We demonstrated that including these synthetic samples in the training data for a deep learning model resulted in a more robust model which could generalize more effectively to unseen external data.

The NTE dataset mirrored the key challenges present when considering deploying in-silico models in practice, focusing on the real-world issues of both data scarcity and heterogeneity. Our results support the use of GANs as a method for data augmentation in this field, offering evidence that generative models can learn some of the key characteristics of histopathological image data by approximating the underlying population distribution, improving classification performance in the absence of more real training images.

Two GAN-augmentation strategies were trialled on two held-out test sets—T&T (via five-fold cross-validation) and the NTE data, which was sourced from multiple domains. In both instances, augmenting the training data with GAN-generated synthetic samples proved to be beneficial.

In the case of the T&T data, classification accuracy increased by 10% with the binary GAN and 11% with the multi-class GAN compared with the baseline created by this paper and the original models from Böhland et al.’s research [34]. Even with only 156 different patient samples the baseline classification accuracy of 88.13% is high, so the additional diversity and number of images added with the GAN augmentation meant the classification was able to achieve a high accuracy across all classes.

Both GAN strategies notably increased the ability of the model to classify the minority PTC-like samples, FVPTC and NIFTP, which were relatively poorly classified by the baseline. These subtypes are much more difficult to identify because the PTC-like features can be found within encapsulated follicular lesions, are less numerous and distinct, and are often surrounded by benign-appearing cells [20, 62–66]. Increasing the frequency of training data via GAN data augmentation appears to enable the model to pick up on these more subtle aspects in the underlying data, resulting in more robust models across all classes.

Despite the high accuracy for the models tested on the T&T data, when the same ResNet101-based model (see Baseline: ResNet101 in Table 3) was trained to classify the NTE dataset it performed poorly at predicting which samples had PTC-like nuclei present. This baseline model classified FVPTC and NIFTP samples with 33.33% and 16% accuracy respectively. Whilst it predicted the non-PTC-like samples with high accuracy, this reflects the model’s inability to distinguish between classes, simply classifying most datapoints as non-PTC-like, resulting in a low recall score of 25.45%.

To improve classification performance, the ResNet architecture was switched for a SwinV2 model architecture, which has proven effective within the medical domain for capturing phenotypic differences captured in microscopy images [67, 68]. The baseline SwinV2 model achieved a higher recall score compared with the ResNet approach, however, was unable to differentiate between the NIFTP and benign (B) samples from the Nikiforov data, classifying all but three samples as PTC-like. The model was therefore still unable to bridge the domain gap observed in Fig 1 between the T&T training data and the Nikiforov external domain data included within the NTE test dataset.

Augmenting the training data using either the GAN trained to produce binary class samples or multi-class samples improved the stability of classification accuracy across subtypes. The binary GAN showed the greatest ability to differentiate between the PTC-like (NIFTP) and non-PTC-like (B) samples from the Nikiforov data, classifying them with 76% and 63.64% accuracy respectively.

Both models trained with GAN-augmented data achieved higher precision scores compared with the SwinV2 baseline. Precision is particularly important within thyroid diagnosis given the issues in the field with overdiagnosis and the patient and treatment burden associated with false positive results [30–32]. Additionally, the AUC scores for both GAN-augmented models improved by 7.20% (binary) and 5.92% (multi-class) over the SwinV2 baseline, demonstrating their increased classification robustness across subtypes within the binary class designations.

The NTE dataset was created as a difficult test for a model to be able to generalize, given the range of demographics, batch effects and other domain differences across its samples. Interestingly, the FA class is well-predicted across all models, however Fig 4 (see Methods: NTE dataset) shows the Eftimie data (FA samples) is more similar to the T&T training data than the Nikiforov (NIFTP, B) or TCGA (FVPTC) data. Thus, the model is better able to generalize to this data.

Download:

Fig 4. Images from the T&T and NTE dataset were passed through a pre-trained ResNet101 classifier to produce embeddings.

These embeddings were then visualized using the Uniform Manifold Approximation (UMAP) and coloured according to: (A) dataset domain, referring to the paper which produced and shared the original images; (B) classification as having PTC-like or non-PTC-like nuclei present; (C) subtype diagnosis. As can be seen in (A) the Eftimie FA samples are more similar to the T&T data, as represented by their closeness in two-dimensional space, than the Nikiforov or TCGA samples are, models with therefore likely generalize better from the T&T data to the Eftimie data than they will to the Nikiforov or TCGA data.

https://doi.org/10.1371/journal.pone.0310417.g004

We have shown in our work that without generative data augmentation our models fail to generalize well to external data where there are significant domain differences—i.e., the Nikiforov and TCGA samples within the NTE dataset. Whilst classification accuracy is not high across all subtypes these are different classifications for even a trained pathologist to make, so any increase in a model’s precision or AUC represents an important contribution to the field.

For future work, an alternative solution to bridging the domain gap between datasets could be to use a style-transfer GAN. This approach aims to separate images into “content” and “style” spaces [69]. In the context of this report, content would represent the various structural features of the histopathology images, whilst the domain differences, such as the stain colour or resolution, would be considered the style. Additionally, creating class-specific connected generative models could increase accuracy across the minority subtypes.

Finally, one criticism of deep learning architectures is that they function like “black box” models. Explainable AI is particularly important in healthcare as accountability, trust and understanding bias are integral components to the functioning of the system as a whole [59, 70]. Using a self-attention GAN (SAGAN) [71] could partially alleviate these issues. The attention mechanism [72] can be visualized to provide feedback regarding which parts of the image were most important for the model’s classification decision. This could form part of an important feedback loop for a pathologist to understand and interpret the model output, increasing trust in its ability to accurately classify pathological or other medical images.

Materials and methods

Data acquisition and processing

Tharun and Thompson (“T&T”) dataset.

The dataset is comprised of 156 patient whole slide images (WSIs) of thyroid gland tumours, 138 sourced from the University Clinic Schleswig-Holstein, Campus Luebeck, and 18 from the Woodland Hills Medical Centre, California [34]. Two pathologists agreed on the classification of each tumour, before 1,916 px by 1,053 px crops were extracted from the identified neoplastic regions of interest. Each image has an objective magnification factor of 40x and a resolution of 0.23 μm/px. The dataset was requested by emailing sekretariat.patho@uksh.de. See Table 4 for additional information.

Download:

Table 4. Summary of the T&T dataset.

https://doi.org/10.1371/journal.pone.0310417.t004

PTC samples constitute the majority of the positive “PTC-like” class, whilst the FVPTC and NIFTP subtypes, which are considered much more difficult to diagnose due to their less distinctive nuclear features [34], are minority samples within the data—mimicking their real-life comparative scarcity. The “Non-PTC-like” class consists of the two most common diagnoses which lack PTC-like nuclei—FA and FTC. Additional detail about these classifications can be found in the S1 File.

NTE dataset.

The NTE dataset (Table 5) is comprised of WSIs from three separate domains. Firstly, 36 samples (25 non-invasive follicular thyroid neoplasm with papillary-like nuclear features, 11 benign) were obtained from “Box A” of the Nikiforov online repository, relating to research performed by Nikiforov et al. [45]. The study accepted WSI contributions from 13 institutions across six different countries, before a panel of 24 expert thyroid pathologists determined each slide’s classification. The research sought to establish consensus diagnostic criteria for classifying NIFTP as a separate subcategory and therefore accepted many borderline cases which were considered difficult to diagnose even by expert pathologists [34, 45]. The images were processed at a resolution of 0.49 μm/ px and magnification of 40x.

Download:

Table 5. Summary of the NTE dataset.

https://doi.org/10.1371/journal.pone.0310417.t005

31 follicular variant of papillary thyroid carcinoma (FVPTC) samples were sourced from The Cancer Genome Atlas (TCGA) Thyroid Carcinoma study. This study consists of 507 different patient samples, collected from 20 separate tissue source sites. Each patient was originally diagnosed with PTC, before a board-certified pathologist assigned fine-grained subtyping to each sample [73]. Within the TCGA data there are a range of magnifications and resolutions, according to the equipment used at the source site. Finally, 12 follicular thyroid adenoma (FA) samples were obtained by emailing the corresponding author of Eftimie et al. [15]. These were originally imaged using a 20x magnification (see S2 File for further information regarding sample selection).

The NTE dataset was formed specifically to provide a robust test of a model’s generalizability. The FVPTC and NIFTP subtypes are commonly misdiagnosed by expert clinicians, and therefore are most important to be able to predict. These two subtypes account for 9 samples each within the T&T training dataset (Table 4), mimicking the severe lack of training data which is often prohibitive to highly generalizable models in practice. Lastly, the NTE dataset has high heterogeneity in terms of staining, resolution, magnification, and image quality, due to the combination of WSI patches from multiple different domains each with differing instruments and collection procedures. Across the data sources there are also a range of patients from diverse hospitals with varied demographics.

Sample subtypes present in the NTE dataset, comprised of images sourced from the research by Nikiforov et al. [45], Eftimie et al. [15] and the TCGA Thyroid Carcinoma study [73]. The table includes each sample’s binary classification as having PTC-like nuclei or not as well as the number of patient samples for that designation.

Fig 5 shows PTC-like and non-PTC-like examples from both the T&T and Niki-TCGA datasets, displaying the range of staining and resolution present.

Download:

Fig 5. Example 512 x 512 px image crops taken from the four different sources which comprise the T&T and the NTE datasets.

The classification and dataset identifier of each sample is as follows: (A) T&T—Papillary thyroid carcinoma (Dataset ID: 47h_5), (B) Nikiforov—Noninvasive follicular thyroid neoplasm with papillary-like nuclear features (Dataset ID: NIK-A079_0), (C) TCGA—Papillary carcinoma, follicular variant (Dataset ID: TCGA-EM-A4FH_17), (D) Eftimie—Follicular thyroid adenoma (Dataset ID: EFT-552053_6).

https://doi.org/10.1371/journal.pone.0310417.g005

NTE data pre-processing.

WSIs for each sample in the dataset were downloaded, before a trained pathologist identified the neoplastic regions of interest within each slide which were indicative of the underlying classification (S3 File). From these regions, the OpenSlide function ‘DeepZoomGenerator’ was used to extract non-overlapping 512 x 512 px crops, before 20 from each sample were selected at random for inclusion within the dataset.

Generative Adversarial Network (GAN)

GANs are comprised of two neural network models, referred to as the generator and the discriminator. The generator aims to learn to approximate the underlying distribution of the training data to generate high-quality synthetic images. The discriminator conversely learns to discern the difference between these GAN-generated fakes and the real images [47].

The two networks compete during training to improve at their respective roles, producing realistic synthetic examples and detecting fake images. An equilibrium is reached when the generator is producing images which are indistinguishable from the underlying real data, and thus the discriminator can make predictions about whether an image is real or fake with only 50% accuracy.

The StyleGAN2 framework [74] was selected because it was specifically designed to produce high quality synthetic images from limited training data. It utilizes an extensive array of 18 transformations to augment the discriminator network’s training data. The method then adapts the level of augmentation based on feedback from an overfitting heuristic during model training.

The PyTorch implementation of StyleGAN2 was adapted from the official StyleGAN2-ADA GitHub repository and trained with the following parameters:

“cfg = paper512” to mirror the parameter settings used by Karras et al. (2020) [58] for the BRECAHAD dataset—a small dataset containing 162 breast cancer histopathology images [75].
“cond = 1” ensures the GAN is trained conditionally using the labels provided, and so is subsequently able to produce images for a given class.
“mirror = 1” includes x-flips of each image in the dataset, effectively doubling the training images.
“kimg = 25000” trains the GAN for up to 25 million training steps. All GANs tested in the StyleGAN2 paper were shown to produce their highest quality images before this point in training.

The GAN was trained with two different conditional labelling approaches to assess the quality and diversity of the synthetic images produced:

Binary: training samples were labelled to be either PTC-like (1.0 label) or non-PTC-like (0.0).
Multi-class: labelled according to their individual subtypes, being PTC (0.0 label), NIFTP (1.0), FVPTC (2.0), FA (3.0) and FTC (4.0). This enables the trained GAN to produce synthetic images of any given subtype.

GAN pre-processing and evaluation.

The original images in the T&T dataset are 1916 x 1053 px. The GAN was trained to produce 512 x 512 px image patches, which are subsequently used to augment training data for a deep learning classifier (DLC) to classify images of the same size. To ensure that the maximum data was made available to train the GAN, the T&T images were split into equal 512 x 512 px patches with minimal overlap. This increased the number of training images from 1,496 to 12,038.

GAN training can be notoriously unstable, as the models are prone to overfitting and mode collapse [55]. To assess progression, batches of synthetic images were manually assessed at fixed intervals and compared to real images. Additionally, Fréchet Inception Distance (FID) [76] was used as the programmatic evaluation criteria for the Generator model. FID computes the Fréchet Distance [77] between multivariate Gaussian approximations of both the real and generated image distributions.

A lower FID score implies a greater alignment between the synthetic and real images and has been correlated with human judgement regarding image quality [78]. Despite this, the metric is not considered perfect, and several papers have warned against over-reliance on FID to assess GAN improvement [58, 78].

In this paper FID was only used as the target metric for GAN training. The true assessment of the quality of the generated synthetic images will be whether their addition to the real images during training improves the classification performance of a deep learning model when tested on unseen data. Our research therefore assesses the impact of GAN samples in an applied scenario, rather than purely evaluating the generative technique in isolation.

Deep Learning Classification (DLC) model

The DLC model is based on the research performed by Böhland et al (2021) [34] using the T&T dataset. A non-pre-trained version of ResNet101 [79], or SwinV2 [61] was loaded using the Torchvision module, and the final output layer was replaced by a three-layer multi-layer perceptron (MLP) model which outputs a binary prediction—whether the image is PTC-like or not. The model was then trained using the histopathology images in the T&T training set. The Albumentations package was used to apply random cropping, flipping, adaptive histogram equalization, blurring, Gaussian noise, and Fourier Domain Adaptation [80] transformations.

A grid search was used to find optimal parameters for the model. This included an initial learning rate of 1e-3, with a decay of 5e-1 if the validation loss does not decrease for 10 epochs. Early stopping was set at 50 epochs while the model trained for a maximum of 100 epochs. The Adam optimizer [81] and cross-entropy loss were also used. This setup utilized one GPU and a batch size of 32 to obtain the most stable training results.

Five-fold cross validation was used to evaluate the model’s performance on the T&T dataset. Each data split contained 60% for training, 20% for validation and 20% for testing. The splits were shuffled and stratified according to classification subtype, ensuring that slide patches from the same patient were retained within the same split to avoid data leakage. There are 156 patients in the T&T data, but each patient has multiple image slides (with the same diagnosis) associated with them. The accuracy metrics are therefore calculated at a patient level, rather than at a slide level. The final classification is determined by majority voting, in the case of a 50:50 split decision between the slides, the wrong class is assigned to the patient.

To evaluate the model’s generalizability to the external NTE test dataset, the model is trained using 80% of the original T&T data, with the remaining 20% used as the validation set. Precision, recall and Area under the curve (AUC) are used as an additional metrics due to the subtype class imbalance between PTC-like and non-PTC-like samples and the designations within the NTE dataset between domains. In both instances, if GAN-generated samples are used, they are included within the training data only.

Synthetic data augmentation

The binary GAN approach uses a GAN trained with binary classification labels (PTC-like or non-PTC-like) and increased the frequency of each of these binary class images by 100% in the training data, thus retaining the original subtype class imbalance.

The multi-class GAN was trained to produce images of all five subtypes in the T&T training data. This method augments the subtype with the most real images (FA) by 100%, and then equalizes all other subtypes to that level, resulting in all classification subtypes having an equal number of training images.

Supporting information

S1 File. Additional information regarding subtype classifications.

Detailed information pertaining to the Thyroid tumour classification subtypes which are relevant to this report, including WHO 2022 designations.

https://doi.org/10.1371/journal.pone.0310417.s001

(DOCX)

S2 File. NTE dataset selection.

Additional details about the selection of the samples included within the NTE dataset.

https://doi.org/10.1371/journal.pone.0310417.s002

(DOCX)

S3 File. Neoplastic region identification.

An overview of the selection of neoplastic regions from the downloaded WSIs.

https://doi.org/10.1371/journal.pone.0310417.s003

(DOCX)

S4 File. GAN augmentations.

Tables displaying the amount of GAN-generated synthetic samples included during model training for each augmentation strategy included within the report.

https://doi.org/10.1371/journal.pone.0310417.s004

(DOCX)

Acknowledgments

The authors would like to thank Ryan Reavette for his guidance and feedback during the project, as well as the initial inspiration for the topic on GANs. We would also like to thank Jamie Holdstock for his contribution towards developing ‘ripsvs’, a tool to download image patches from online-hosted Aperio ImageScope WSIs.

References

1. La Vecchia C, Malvezzi M, Bosetti C, Garavello W, Bertuccio P, Levi F, et al. Thyroid cancer mortality and incidence: A global overview. International Journal of Cancer. 2015;136(9):2187–95. pmid:25284703
- View Article
- PubMed/NCBI
- Google Scholar
2. Zhai M, Zhang D, Long J, Gong Y, Ye F, Liu S, et al. The global burden of thyroid cancer and its attributable risk factor in 195 countries and territories: A systematic analysis for the Global Burden of Disease Study. Cancer Med. 2021 May 18;10(13):4542–54. pmid:34002931
- View Article
- PubMed/NCBI
- Google Scholar
3. Cheng F, Xiao J, Shao C, Huang F, Wang L, Ju Y, et al. Burden of Thyroid Cancer From 1990 to 2019 and Projections of Incidence and Mortality Until 2039 in China: Findings From Global Burden of Disease Study. Frontiers in Endocrinology [Internet]. 2021 [cited 2022 Jul 18];12. Available from: https://www.frontiersin.org/articles/10.3389/fendo.2021.738213
- View Article
- Google Scholar
4. Han L, Li W, Li Y, Wen W, Yao Y, Wang Y. Total thyroidectomy is superior for initial treatment of thyroid cancer. Asia Pac J Clin Oncol. 2021 Oct;17(5):e170–5. pmid:32757466
- View Article
- PubMed/NCBI
- Google Scholar
5. Paschke R, Lincke T, Müller SP, Kreissl MC, Dralle H, Fassnacht M. The Treatment of Well-Differentiated Thyroid Carcinoma. Dtsch Arztebl Int. 2015 Jun;112(26):452–8. pmid:26205749
- View Article
- PubMed/NCBI
- Google Scholar
6. Rossi ED, Pantanowitz L, Hornick JL. A worldwide journey of thyroid cancer incidence centred on tumour histology. Lancet Diabetes Endocrinol. 2021 Apr;9(4):193–4. pmid:33662332
- View Article
- PubMed/NCBI
- Google Scholar
7. Murugan AK, Qasem E, Al-Hindi H, Shi Y, Alzahrani AS. Classical V600E and other non-hotspot BRAF mutations in adult differentiated thyroid cancer. J Transl Med. 2016 Jul 7;14(1):204. pmid:27387551
- View Article
- PubMed/NCBI
- Google Scholar
8. Murugan AK, Xing M. Anaplastic Thyroid Cancers Harbor Novel Oncogenic Mutations of the ALK Gene. Cancer Res. 2011 Jul 1;71(13):4403–11. pmid:21596819
- View Article
- PubMed/NCBI
- Google Scholar
9. Xing M, Alzahrani AS, Carson KA, Shong YK, Kim TY, Viola D, et al. Association between BRAF V600E mutation and recurrence of papillary thyroid cancer. J Clin Oncol. 2015 Jan 1;33(1):42–50. pmid:25332244
- View Article
- PubMed/NCBI
- Google Scholar
10. Liu R, Zhang T, Zhu G, Xing M. Regulation of mutant TERT by BRAF V600E/MAP kinase pathway through FOS/GABP in human cancer. Nat Commun. 2018 Feb 8;9(1):579. pmid:29422527
- View Article
- PubMed/NCBI
- Google Scholar
11. Howell GM, Hodak SP, Yip L. RAS Mutations in Thyroid Cancer. Oncologist. 2013 Aug;18(8):926–32. pmid:23873720
- View Article
- PubMed/NCBI
- Google Scholar
12. Nikiforov YE. Thyroid carcinoma: molecular pathways and therapeutic targets. Mod Pathol. 2008 May;21 Suppl 2(Suppl 2):S37–43. pmid:18437172
- View Article
- PubMed/NCBI
- Google Scholar
13. Prete A, Borges de Souza P, Censi S, Muzza M, Nucci N, Sponziello M. Update on Fundamental Mechanisms of Thyroid Cancer. Front Endocrinol [Internet]. 2020 Mar 13 [cited 2024 Aug 8];11. Available from: https://www.frontiersin.org/journals/endocrinology/articles/10.3389/fendo.2020.00102/full pmid:32231639
- View Article
- PubMed/NCBI
- Google Scholar
14. Murugan AK, Humudh EA, Qasem E, Al-Hindi H, Almohanna M, Hassan ZK, et al. Absence of somatic mutations of the mTOR gene in differentiated thyroid cancer. Meta Gene. 2015 Dec;6:69–71. pmid:26504747
- View Article
- PubMed/NCBI
- Google Scholar
15. Eftimie LG, Glogojeanu RR, Tejaswee A, Gheorghita P, Stanciu SG, Chirila A, et al. Differential diagnosis of thyroid nodule capsules using random forest guided selection of image features. Sci Rep. 2022 Dec 14;12(1):21636. pmid:36517531
- View Article
- PubMed/NCBI
- Google Scholar
16. McHenry CR, Phitayakorn R. Follicular Adenoma and Carcinoma of the Thyroid Gland. Oncologist. 2011 May;16(5):585–93. pmid:21482585
- View Article
- PubMed/NCBI
- Google Scholar
17. Baloch ZW, LiVolsi VA. Follicular-Patterned Lesions of the Thyroid: The Bane of the Pathologist. American Journal of Clinical Pathology. 2002 Jan 1;117(1):143–50. pmid:11789719
- View Article
- PubMed/NCBI
- Google Scholar
18. LiVolsi VA. Papillary thyroid carcinoma: an update. Modern Pathology. 2011 Jan 1;24:S1–9. pmid:21455196
- View Article
- PubMed/NCBI
- Google Scholar
19. Baloch ZW, Asa SL, Barletta JA, Ghossein RA, Juhlin CC, Jung CK, et al. Overview of the 2022 WHO Classification of Thyroid Neoplasms. Endocr Pathol. 2022 Mar 1;33(1):27–63. pmid:35288841
- View Article
- PubMed/NCBI
- Google Scholar
20. Pillai S, Gopalan V, Smith RA, Lam AKY. Diffuse sclerosing variant of papillary thyroid carcinoma—an update of its clinicopathological features and molecular biology. Critical Reviews in Oncology/Hematology. 2015 Apr 1;94(1):64–73. pmid:25577570
- View Article
- PubMed/NCBI
- Google Scholar
21. Singh K, Pujani M, Chauhan V, Agarwal C, Dhingra S. Encapsulated Follicular Variant of Papillary Thyroid Carcinoma Arising in a Follicular Adenoma: a Diagnostic Dilemma. Indian J Surg Oncol. 2018 Sep;9(3):414–7. pmid:30288010
- View Article
- PubMed/NCBI
- Google Scholar
22. Romei C, Elisei R. RET/PTC Translocations and Clinico-Pathological Features in Human Papillary Thyroid Carcinoma. Frontiers in Endocrinology [Internet]. 2012 [cited 2023 May 11];3. Available from: https://www.frontiersin.org/articles/10.3389/fendo.2012.00054 pmid:22654872
- View Article
- PubMed/NCBI
- Google Scholar
23. Jose L, Liu S, Russo C, Nadort A, Di Ieva A. Generative Adversarial Networks in Digital Pathology and Histopathological Image Processing: A Review. Journal of Pathology Informatics. 2021 Jan 1;12(1):43. pmid:34881098
- View Article
- PubMed/NCBI
- Google Scholar
24. Paech DC, Weston AR, Pavlakis N, Gill A, Rajan N, Barraclough H, et al. A Systematic Review of the Interobserver Variability for Histology in the Differentiation between Squamous and Nonsquamous Non-small Cell Lung Cancer. Journal of Thoracic Oncology. 2011 Jan 1;6(1):55–63. pmid:21107286
- View Article
- PubMed/NCBI
- Google Scholar
25. van den Bent MJ. Interobserver variation of the histopathological diagnosis in clinical trials on glioma: a clinician’s perspective. Acta Neuropathol. 2010 Sep 1;120(3):297–304. pmid:20644945
- View Article
- PubMed/NCBI
- Google Scholar
26. Trueblood JS, Holmes WR, Seegmiller AC, Douds J, Compton M, Szentirmai E, et al. The impact of speed and bias on the cognitive processes of experts and novices in medical image decision-making. Cogn Res Princ Implic. 2018 Jul 4;3:28.
- View Article
- Google Scholar
27. El-Hossiny AS, Al-Atabany W, Hassan O, Soliman AM, Sami SA. Classification of Thyroid Carcinoma in Whole Slide Images Using Cascaded CNN. IEEE Access. 2021;9:88429–38.
- View Article
- Google Scholar
28. Giger ML. Machine Learning in Medical Imaging. Journal of the American College of Radiology. 2018 Mar 1;15(3, Part B):512–20. pmid:29398494
- View Article
- PubMed/NCBI
- Google Scholar
29. Erickson BJ, Korfiatis P, Akkus Z, Kline TL. Machine Learning for Medical Imaging. Radiographics. 2017 Mar;37(2):505–15. pmid:28212054
- View Article
- PubMed/NCBI
- Google Scholar
30. Li M, Zheng R, Maso LD, Zhang S, Wei W, Vaccarella S. Mapping overdiagnosis of thyroid cancer in China. The Lancet Diabetes & Endocrinology. 2021 Jun 1;9(6):330–2. pmid:33891886
- View Article
- PubMed/NCBI
- Google Scholar
31. Li M, Dal Maso L, Vaccarella S. Global trends in thyroid cancer incidence and the impact of overdiagnosis. Lancet Diabetes Endocrinol. 2020 Jun;8(6):468–70. pmid:32445733
- View Article
- PubMed/NCBI
- Google Scholar
32. Jegerlehner S, Bulliard JL, Aujesky D, Rodondi N, Germann S, Konzelmann I, et al. Overdiagnosis and overtreatment of thyroid cancer: A population-based temporal trend study. PLoS One. 2017 Jun 14;12(6):e0179387. pmid:28614405
- View Article
- PubMed/NCBI
- Google Scholar
33. Mazzaferri El. https://home.liebertpub.com/thy. 2009 [cited 2022 Apr 25]. An Overview of the Management of Papillary and Follicular Thyroid Carcinoma. https://www.liebertpub.com/doi/epdf/10.1089/thy.1999.9.421
34. Böhland M, Tharun L, Scherr T, Mikut R, Hagenmeyer V, Thompson LDR, et al. Machine learning methods for automated classification of tumors with papillary thyroid carcinoma-like nuclei: A quantitative analysis. PLoS One. 2021 Sep 22;16(9):e0257635. pmid:34550999
- View Article
- PubMed/NCBI
- Google Scholar
35. Thompson LD. Ninety-four cases of encapsulated follicular variant of papillary thyroid carcinoma: A name change to Noninvasive Follicular Thyroid Neoplasm with Papillary-like Nuclear Features would help prevent overtreatment. Mod Pathol. 2016 Jul;29(7):698–707. pmid:27102347
- View Article
- PubMed/NCBI
- Google Scholar
36. Angel Arul Jothi J, Mary Anita Rajam V. Effective segmentation and classification of thyroid histopathology images. Applied Soft Computing. 2016 Sep 1;46:652–64.
- View Article
- Google Scholar
37. Chang CY, Chen SJ, Tsai MF. Application of support-vector-machine-based method for feature selection and classification of thyroid nodules in ultrasound images. Pattern Recognition. 2010 Oct 1;43(10):3494–506.
- View Article
- Google Scholar
38. Poudel P, Illanes A, Ataide EJG, Esmaeili N, Balakrishnan S, Friebe M. Thyroid Ultrasound Texture Classification Using Autoregressive Features in Conjunction With Machine Learning Approaches. IEEE Access. 2019;7:79354–65.
- View Article
- Google Scholar
39. Chen D, Hu J, Zhu M, Tang N, Yang Y, Feng Y. Diagnosis of thyroid nodules for ultrasonographic characteristics indicative of malignancy using random forest. BioData Mining. 2020 Sep 3;13(1):14. pmid:32905307
- View Article
- PubMed/NCBI
- Google Scholar
40. Buddhavarapu VG, AAJ J. An experimental study on classification of thyroid histopathology images using transfer learning. Pattern Recognition Letters. 2020 Dec 1;140:1–9.
- View Article
- Google Scholar
41. Wang Y, Guan Q, Lao I, Wang L, Wu Y, Li D, et al. Using deep convolutional neural networks for multi-classification of thyroid tumor by histopathology: a large-scale pilot study. Ann Transl Med. 2019 Sep;7(18):468. pmid:31700904
- View Article
- PubMed/NCBI
- Google Scholar
42. Najafabadi MM, Villanustre F, Khoshgoftaar TM, Seliya N, Wald R, Muharemagic E. Deep learning applications and challenges in big data analytics. Journal of Big Data. 2015 Feb 24;2(1):1.
- View Article
- Google Scholar
43. Liu Z, Mao H, Wu CY, Feichtenhofer C, Darrell T, Xie S. A ConvNet for the 2020s [Internet]. arXiv; 2022 [cited 2023 Apr 5]. http://arxiv.org/abs/2201.03545
44. McInnes L, Healy J, Melville J. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. arXiv:180203426 [cs, stat] [Internet]. 2020 Sep 17 [cited 2022 Apr 25]; http://arxiv.org/abs/1802.03426
45. Nikiforov YE, Seethala RR, Tallini G, Baloch ZW, Basolo F, Thompson LDR, et al. Nomenclature Revision for Encapsulated Follicular Variant of Papillary Thyroid Carcinoma: A Paradigm Shift to Reduce Overtreatment of Indolent Tumors. JAMA Oncology. 2016 Aug 1;2(8):1023–9. pmid:27078145
- View Article
- PubMed/NCBI
- Google Scholar
46. He K, Zhang X, Ren S, Sun J. Deep Residual Learning for Image Recognition [Internet]. arXiv; 2015 [cited 2022 Aug 2]. http://arxiv.org/abs/1512.03385
47. Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, et al. Generative Adversarial Networks. arXiv:14062661 [cs, stat] [Internet]. 2014 Jun 10 [cited 2022 Apr 13]; http://arxiv.org/abs/1406.2661
48. Ali H, Biswas MdR, Mohsen F, Shah U, Alamgir A, Mousa O, et al. The role of generative adversarial networks in brain MRI: a scoping review. Insights into Imaging. 2022 Jun 4;13(1):98. pmid:35662369
- View Article
- PubMed/NCBI
- Google Scholar
49. Dai W, Doyle J, Liang X, Zhang H, Dong N, Li Y, et al. SCAN: Structure Correcting Adversarial Network for Organ Segmentation in Chest X-rays [Internet]. arXiv; 2017 [cited 2022 Aug 2]. http://arxiv.org/abs/1703.08770
50. Ciano G, Andreini P, Mazzierli T, Bianchini M, Scarselli F. A multi-stage GAN for multi-organ chest X-ray image generation and segmentation. Mathematics. 2021 Nov 14;9(22):2896.
- View Article
- Google Scholar
51. Bayramoglu N, Kaakinen M, Eklund L, Heikkila J. Towards Virtual H&E Staining of Hyperspectral Lung Histology Images Using Conditional Generative Adversarial Networks. In 2017 [cited 2022 Aug 2]. p. 64–71. https://openaccess.thecvf.com/content_ICCV_2017_workshops/w1/html/Bayramoglu_Towards_Virtual_HE_ICCV_2017_paper.html
52. Xue Y, Zhou Q, Ye J, Long LR, Antani S, Cornwell C, et al. Synthetic Augmentation and Feature-Based Filtering for Improved Cervical Histopathology Image Classification. In: Shen D, Liu T, Peters TM, Staib LH, Essert C, Zhou S, et al., editors. Medical Image Computing and Computer Assisted Intervention—MICCAI 2019. Cham: Springer International Publishing; 2019. p. 387–96. (Lecture Notes in Computer Science).
53. Boyd J, Villa I, Mathieu MC, Deutsch E, Paragios N, Vakalopoulou M, et al. Region-guided CycleGANs for Stain Transfer in Whole Slide Images [Internet]. arXiv; 2022 [cited 2023 Apr 5]. http://arxiv.org/abs/2208.12847
54. Chuquicusma MJM, Hussein S, Burt J, Bagci U. How to fool radiologists with generative adversarial networks? A visual turing test for lung cancer diagnosis. In: 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018). 2018. p. 240–4.
55. Levine AB, Peng J, Farnell D, Nursey M, Wang Y, Naso JR, et al. Synthesis of diagnostic quality cancer pathology images by generative adversarial networks. The Journal of Pathology. 2020;252(2):178–88. pmid:32686118
- View Article
- PubMed/NCBI
- Google Scholar
56. Liu S, Shah Z, Sav A, Russo C, Berkovsky S, Qian Y, et al. Isocitrate dehydrogenase (IDH) status prediction in histopathology images of gliomas using deep learning. Sci Rep. 2020 May 7;10(1):7733. pmid:32382048
- View Article
- PubMed/NCBI
- Google Scholar
57. Guan S, Loew M. Breast cancer detection using synthetic mammograms from generative adversarial networks in convolutional neural networks. J Med Imaging (Bellingham). 2019 Jul;6(3):031411. pmid:30915386
- View Article
- PubMed/NCBI
- Google Scholar
58. Karras T, Aittala M, Hellsten J, Laine S, Lehtinen J, Aila T. Training Generative Adversarial Networks with Limited Data [Internet]. arXiv; 2020 [cited 2022 Jul 22]. http://arxiv.org/abs/2006.06676
59. He K, Zhang X, Ren S, Sun J. Deep Residual Learning for Image Recognition [Internet]. arXiv; 2015 [cited 2024 Aug 8]. http://arxiv.org/abs/1512.03385
60. Liu Z, Lin Y, Cao Y, Hu H, Wei Y, Zhang Z, et al. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [Internet]. arXiv; 2021 [cited 2023 May 10]. http://arxiv.org/abs/2103.14030
61. Liu Z, Hu H, Lin Y, Yao Z, Xie Z, Wei Y, et al. Swin Transformer V2: Scaling Up Capacity and Resolution [Internet]. arXiv; 2022 [cited 2024 Aug 8]. http://arxiv.org/abs/2111.09883
62. Asa SL. My approach to oncocytic tumours of the thyroid. Journal of Clinical Pathology. 2004 Mar 1;57(3):225–32. pmid:14990587
- View Article
- PubMed/NCBI
- Google Scholar
63. Rosario PW, Mourão GF. Noninvasive follicular thyroid neoplasm with papillary-like nuclear features (NIFTP): a review for clinicians. Endocrine-Related Cancer. 2019 May 1;26(5):R259–66. pmid:30913533
- View Article
- PubMed/NCBI
- Google Scholar
64. D’Avanzo A, Treseler P, Ituarte PHG, Wong M, Streja L, Greenspan FS, et al. Follicular thyroid carcinoma: Histology and prognosis. Cancer. 2004;100(6):1123–9. pmid:15022277
- View Article
- PubMed/NCBI
- Google Scholar
65. Cipriani NA, Nagar S, Kaplan SP, White MG, Antic T, Sadow PM, et al. Follicular Thyroid Carcinoma: How Have Histologic Diagnoses Changed in the Last Half-Century and What Are the Prognostic Implications? Thyroid. 2015 Nov;25(11):1209–16. pmid:26440366
- View Article
- PubMed/NCBI
- Google Scholar
66. Basolo F, Macerola E, Poma AM, Torregrossa L. The 5th edition of WHO classification of tumors of endocrine organs: changes in the diagnosis of follicular-derived thyroid carcinoma. Endocrine. 2023 Mar 25; pmid:36964880
- View Article
- PubMed/NCBI
- Google Scholar
67. Filiot A, Ghermi R, Olivier A, Jacob P, Fidon L, Mac Kain A, et al. Scaling Self-Supervised Learning for Histopathology with Masked Image Modeling [Internet]. 2023 [cited 2024 Aug 8]. http://medrxiv.org/lookup/doi/10.1101/2023.07.21.23292757
68. Dee W, Sequeira I, Lobley A, Slabaugh G. Cell-vision fusion: A Swin transformer-based approach for predicting kinase inhibitor mechanism of action from Cell Painting data. iScience. 2024 Aug 16;27(8):110511. pmid:39175778
- View Article
- PubMed/NCBI
- Google Scholar
69. Huang X, Belongie S. Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization. In: 2017 IEEE International Conference on Computer Vision (ICCV) [Internet]. Venice: IEEE; 2017 [cited 2022 Aug 16]. p. 1510–9. http://ieeexplore.ieee.org/document/8237429/
70. Durán JM, Jongsma KR. Who is afraid of black box algorithms? On the epistemological and ethical basis of trust in medical AI. Journal of Medical Ethics. 2021 May 1;47(5):329–35. pmid:33737318
- View Article
- PubMed/NCBI
- Google Scholar
71. Zhang H, Goodfellow I, Metaxas D, Odena A. Self-Attention Generative Adversarial Networks. In: Proceedings of the 36th International Conference on Machine Learning [Internet]. PMLR; 2019 [cited 2022 Apr 12]. p. 7354–63. https://proceedings.mlr.press/v97/zhang19d.html
72. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention Is All You Need [Internet]. arXiv; 2017 [cited 2022 Aug 16]. http://arxiv.org/abs/1706.03762
73. Agrawal N, Akbani R, Aksoy BA, Ally A, Arachchi H, Asa SL, et al. Integrated Genomic Characterization of Papillary Thyroid Carcinoma. Cell. 2014 Oct 23;159(3):676–90. pmid:25417114
- View Article
- PubMed/NCBI
- Google Scholar
74. Karras T, Laine S, Aila T. A Style-Based Generator Architecture for Generative Adversarial Networks. [cited 2023 Aug 18]. https://www.computer.org/csdl/journal/tp/2021/12/08977347/1h2AHNHb9bW
75. Aksac A, Demetrick DJ, Ozyer T, Alhajj R. BreCaHAD: a dataset for breast cancer histopathological annotation and diagnosis. BMC Research Notes. 2019 Feb 12;12(1):82. pmid:30755250
- View Article
- PubMed/NCBI
- Google Scholar
76. Heusel M, Ramsauer H, Unterthiner T, Nessler B, Hochreiter S. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. In: Advances in Neural Information Processing Systems [Internet]. Curran Associates, Inc.; 2017 [cited 2022 Aug 19]. https://papers.nips.cc/paper/2017/hash/8a1d694707eb0fefe65871369074926d-Abstract.html
77. Dowson DC, Landau BV. The Fréchet distance between multivariate normal distributions. Journal of Multivariate Analysis. 1982 Sep 1;12(3):450–5.
- View Article
- Google Scholar
78. Kynkäänniemi T, Karras T, Aittala M, Aila T, Lehtinen J. The Role of ImageNet Classes in Fr\’echet Inception Distance [Internet]. arXiv; 2022 [cited 2022 Aug 2]. http://arxiv.org/abs/2203.06026
79. Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L. ImageNet: A large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition. 2009. p. 248–55.
80. Yang Y, Soatto S. FDA: Fourier Domain Adaptation for Semantic Segmentation. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) [Internet]. Seattle, WA, USA: IEEE; 2020 [cited 2022 Aug 2]. p. 4084–94. https://ieeexplore.ieee.org/document/9157228/
81. Kingma DP, Ba J. Adam: A Method for Stochastic Optimization [Internet]. arXiv; 2017 [cited 2023 May 10]. http://arxiv.org/abs/1412.6980

[ref1] 1. La Vecchia C, Malvezzi M, Bosetti C, Garavello W, Bertuccio P, Levi F, et al. Thyroid cancer mortality and incidence: A global overview. International Journal of Cancer. 2015;136(9):2187–95. pmid:25284703
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Zhai M, Zhang D, Long J, Gong Y, Ye F, Liu S, et al. The global burden of thyroid cancer and its attributable risk factor in 195 countries and territories: A systematic analysis for the Global Burden of Disease Study. Cancer Med. 2021 May 18;10(13):4542–54. pmid:34002931
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Cheng F, Xiao J, Shao C, Huang F, Wang L, Ju Y, et al. Burden of Thyroid Cancer From 1990 to 2019 and Projections of Incidence and Mortality Until 2039 in China: Findings From Global Burden of Disease Study. Frontiers in Endocrinology [Internet]. 2021 [cited 2022 Jul 18];12. Available from: https://www.frontiersin.org/articles/10.3389/fendo.2021.738213
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref4] 4. Han L, Li W, Li Y, Wen W, Yao Y, Wang Y. Total thyroidectomy is superior for initial treatment of thyroid cancer. Asia Pac J Clin Oncol. 2021 Oct;17(5):e170–5. pmid:32757466
View Article
PubMed/NCBI
Google Scholar

[13] View Article

[14] PubMed/NCBI

[15] Google Scholar

[ref5] 5. Paschke R, Lincke T, Müller SP, Kreissl MC, Dralle H, Fassnacht M. The Treatment of Well-Differentiated Thyroid Carcinoma. Dtsch Arztebl Int. 2015 Jun;112(26):452–8. pmid:26205749
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref6] 6. Rossi ED, Pantanowitz L, Hornick JL. A worldwide journey of thyroid cancer incidence centred on tumour histology. Lancet Diabetes Endocrinol. 2021 Apr;9(4):193–4. pmid:33662332
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref7] 7. Murugan AK, Qasem E, Al-Hindi H, Shi Y, Alzahrani AS. Classical V600E and other non-hotspot BRAF mutations in adult differentiated thyroid cancer. J Transl Med. 2016 Jul 7;14(1):204. pmid:27387551
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref8] 8. Murugan AK, Xing M. Anaplastic Thyroid Cancers Harbor Novel Oncogenic Mutations of the ALK Gene. Cancer Res. 2011 Jul 1;71(13):4403–11. pmid:21596819
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref9] 9. Xing M, Alzahrani AS, Carson KA, Shong YK, Kim TY, Viola D, et al. Association between BRAF V600E mutation and recurrence of papillary thyroid cancer. J Clin Oncol. 2015 Jan 1;33(1):42–50. pmid:25332244
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref10] 10. Liu R, Zhang T, Zhu G, Xing M. Regulation of mutant TERT by BRAF V600E/MAP kinase pathway through FOS/GABP in human cancer. Nat Commun. 2018 Feb 8;9(1):579. pmid:29422527
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref11] 11. Howell GM, Hodak SP, Yip L. RAS Mutations in Thyroid Cancer. Oncologist. 2013 Aug;18(8):926–32. pmid:23873720
View Article
PubMed/NCBI
Google Scholar

[41] View Article

[42] PubMed/NCBI

[43] Google Scholar

[ref12] 12. Nikiforov YE. Thyroid carcinoma: molecular pathways and therapeutic targets. Mod Pathol. 2008 May;21 Suppl 2(Suppl 2):S37–43. pmid:18437172
View Article
PubMed/NCBI
Google Scholar

[45] View Article

[46] PubMed/NCBI

[47] Google Scholar

[ref13] 13. Prete A, Borges de Souza P, Censi S, Muzza M, Nucci N, Sponziello M. Update on Fundamental Mechanisms of Thyroid Cancer. Front Endocrinol [Internet]. 2020 Mar 13 [cited 2024 Aug 8];11. Available from: https://www.frontiersin.org/journals/endocrinology/articles/10.3389/fendo.2020.00102/full pmid:32231639
View Article
PubMed/NCBI
Google Scholar

[49] View Article

[50] PubMed/NCBI

[51] Google Scholar

[ref14] 14. Murugan AK, Humudh EA, Qasem E, Al-Hindi H, Almohanna M, Hassan ZK, et al. Absence of somatic mutations of the mTOR gene in differentiated thyroid cancer. Meta Gene. 2015 Dec;6:69–71. pmid:26504747
View Article
PubMed/NCBI
Google Scholar

[53] View Article

[54] PubMed/NCBI

[55] Google Scholar

[ref15] 15. Eftimie LG, Glogojeanu RR, Tejaswee A, Gheorghita P, Stanciu SG, Chirila A, et al. Differential diagnosis of thyroid nodule capsules using random forest guided selection of image features. Sci Rep. 2022 Dec 14;12(1):21636. pmid:36517531
View Article
PubMed/NCBI
Google Scholar

[57] View Article

[58] PubMed/NCBI

[59] Google Scholar

[ref16] 16. McHenry CR, Phitayakorn R. Follicular Adenoma and Carcinoma of the Thyroid Gland. Oncologist. 2011 May;16(5):585–93. pmid:21482585
View Article
PubMed/NCBI
Google Scholar

[61] View Article

[62] PubMed/NCBI

[63] Google Scholar

[ref17] 17. Baloch ZW, LiVolsi VA. Follicular-Patterned Lesions of the Thyroid: The Bane of the Pathologist. American Journal of Clinical Pathology. 2002 Jan 1;117(1):143–50. pmid:11789719
View Article
PubMed/NCBI
Google Scholar

[65] View Article

[66] PubMed/NCBI

[67] Google Scholar

[ref18] 18. LiVolsi VA. Papillary thyroid carcinoma: an update. Modern Pathology. 2011 Jan 1;24:S1–9. pmid:21455196
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref19] 19. Baloch ZW, Asa SL, Barletta JA, Ghossein RA, Juhlin CC, Jung CK, et al. Overview of the 2022 WHO Classification of Thyroid Neoplasms. Endocr Pathol. 2022 Mar 1;33(1):27–63. pmid:35288841
View Article
PubMed/NCBI
Google Scholar

[73] View Article

[74] PubMed/NCBI

[75] Google Scholar

[ref20] 20. Pillai S, Gopalan V, Smith RA, Lam AKY. Diffuse sclerosing variant of papillary thyroid carcinoma—an update of its clinicopathological features and molecular biology. Critical Reviews in Oncology/Hematology. 2015 Apr 1;94(1):64–73. pmid:25577570
View Article
PubMed/NCBI
Google Scholar

[77] View Article

[78] PubMed/NCBI

[79] Google Scholar

[ref21] 21. Singh K, Pujani M, Chauhan V, Agarwal C, Dhingra S. Encapsulated Follicular Variant of Papillary Thyroid Carcinoma Arising in a Follicular Adenoma: a Diagnostic Dilemma. Indian J Surg Oncol. 2018 Sep;9(3):414–7. pmid:30288010
View Article
PubMed/NCBI
Google Scholar

[81] View Article

[82] PubMed/NCBI

[83] Google Scholar

[ref22] 22. Romei C, Elisei R. RET/PTC Translocations and Clinico-Pathological Features in Human Papillary Thyroid Carcinoma. Frontiers in Endocrinology [Internet]. 2012 [cited 2023 May 11];3. Available from: https://www.frontiersin.org/articles/10.3389/fendo.2012.00054 pmid:22654872
View Article
PubMed/NCBI
Google Scholar

[85] View Article

[86] PubMed/NCBI

[87] Google Scholar

[ref23] 23. Jose L, Liu S, Russo C, Nadort A, Di Ieva A. Generative Adversarial Networks in Digital Pathology and Histopathological Image Processing: A Review. Journal of Pathology Informatics. 2021 Jan 1;12(1):43. pmid:34881098
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref24] 24. Paech DC, Weston AR, Pavlakis N, Gill A, Rajan N, Barraclough H, et al. A Systematic Review of the Interobserver Variability for Histology in the Differentiation between Squamous and Nonsquamous Non-small Cell Lung Cancer. Journal of Thoracic Oncology. 2011 Jan 1;6(1):55–63. pmid:21107286
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref25] 25. van den Bent MJ. Interobserver variation of the histopathological diagnosis in clinical trials on glioma: a clinician’s perspective. Acta Neuropathol. 2010 Sep 1;120(3):297–304. pmid:20644945
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref26] 26. Trueblood JS, Holmes WR, Seegmiller AC, Douds J, Compton M, Szentirmai E, et al. The impact of speed and bias on the cognitive processes of experts and novices in medical image decision-making. Cogn Res Princ Implic. 2018 Jul 4;3:28.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref27] 27. El-Hossiny AS, Al-Atabany W, Hassan O, Soliman AM, Sami SA. Classification of Thyroid Carcinoma in Whole Slide Images Using Cascaded CNN. IEEE Access. 2021;9:88429–38.
View Article
Google Scholar

[104] View Article

[105] Google Scholar

[ref28] 28. Giger ML. Machine Learning in Medical Imaging. Journal of the American College of Radiology. 2018 Mar 1;15(3, Part B):512–20. pmid:29398494
View Article
PubMed/NCBI
Google Scholar

[107] View Article

[108] PubMed/NCBI

[109] Google Scholar

[ref29] 29. Erickson BJ, Korfiatis P, Akkus Z, Kline TL. Machine Learning for Medical Imaging. Radiographics. 2017 Mar;37(2):505–15. pmid:28212054
View Article
PubMed/NCBI
Google Scholar

[111] View Article

[112] PubMed/NCBI

[113] Google Scholar

[ref30] 30. Li M, Zheng R, Maso LD, Zhang S, Wei W, Vaccarella S. Mapping overdiagnosis of thyroid cancer in China. The Lancet Diabetes & Endocrinology. 2021 Jun 1;9(6):330–2. pmid:33891886
View Article
PubMed/NCBI
Google Scholar

[115] View Article

[116] PubMed/NCBI

[117] Google Scholar

[ref31] 31. Li M, Dal Maso L, Vaccarella S. Global trends in thyroid cancer incidence and the impact of overdiagnosis. Lancet Diabetes Endocrinol. 2020 Jun;8(6):468–70. pmid:32445733
View Article
PubMed/NCBI
Google Scholar

[119] View Article

[120] PubMed/NCBI

[121] Google Scholar

[ref32] 32. Jegerlehner S, Bulliard JL, Aujesky D, Rodondi N, Germann S, Konzelmann I, et al. Overdiagnosis and overtreatment of thyroid cancer: A population-based temporal trend study. PLoS One. 2017 Jun 14;12(6):e0179387. pmid:28614405
View Article
PubMed/NCBI
Google Scholar

[123] View Article

[124] PubMed/NCBI

[125] Google Scholar

[ref33] 33. Mazzaferri El. https://home.liebertpub.com/thy. 2009 [cited 2022 Apr 25]. An Overview of the Management of Papillary and Follicular Thyroid Carcinoma. https://www.liebertpub.com/doi/epdf/10.1089/thy.1999.9.421

[ref34] 34. Böhland M, Tharun L, Scherr T, Mikut R, Hagenmeyer V, Thompson LDR, et al. Machine learning methods for automated classification of tumors with papillary thyroid carcinoma-like nuclei: A quantitative analysis. PLoS One. 2021 Sep 22;16(9):e0257635. pmid:34550999
View Article
PubMed/NCBI
Google Scholar

[128] View Article

[129] PubMed/NCBI

[130] Google Scholar

[ref35] 35. Thompson LD. Ninety-four cases of encapsulated follicular variant of papillary thyroid carcinoma: A name change to Noninvasive Follicular Thyroid Neoplasm with Papillary-like Nuclear Features would help prevent overtreatment. Mod Pathol. 2016 Jul;29(7):698–707. pmid:27102347
View Article
PubMed/NCBI
Google Scholar

[132] View Article

[133] PubMed/NCBI

[134] Google Scholar

[ref36] 36. Angel Arul Jothi J, Mary Anita Rajam V. Effective segmentation and classification of thyroid histopathology images. Applied Soft Computing. 2016 Sep 1;46:652–64.
View Article
Google Scholar

[136] View Article

[137] Google Scholar

[ref37] 37. Chang CY, Chen SJ, Tsai MF. Application of support-vector-machine-based method for feature selection and classification of thyroid nodules in ultrasound images. Pattern Recognition. 2010 Oct 1;43(10):3494–506.
View Article
Google Scholar

[139] View Article

[140] Google Scholar

[ref38] 38. Poudel P, Illanes A, Ataide EJG, Esmaeili N, Balakrishnan S, Friebe M. Thyroid Ultrasound Texture Classification Using Autoregressive Features in Conjunction With Machine Learning Approaches. IEEE Access. 2019;7:79354–65.
View Article
Google Scholar

[142] View Article

[143] Google Scholar

[ref39] 39. Chen D, Hu J, Zhu M, Tang N, Yang Y, Feng Y. Diagnosis of thyroid nodules for ultrasonographic characteristics indicative of malignancy using random forest. BioData Mining. 2020 Sep 3;13(1):14. pmid:32905307
View Article
PubMed/NCBI
Google Scholar

[145] View Article

[146] PubMed/NCBI

[147] Google Scholar

[ref40] 40. Buddhavarapu VG, AAJ J. An experimental study on classification of thyroid histopathology images using transfer learning. Pattern Recognition Letters. 2020 Dec 1;140:1–9.
View Article
Google Scholar

[149] View Article

[150] Google Scholar

[ref41] 41. Wang Y, Guan Q, Lao I, Wang L, Wu Y, Li D, et al. Using deep convolutional neural networks for multi-classification of thyroid tumor by histopathology: a large-scale pilot study. Ann Transl Med. 2019 Sep;7(18):468. pmid:31700904
View Article
PubMed/NCBI
Google Scholar

[152] View Article

[153] PubMed/NCBI

[154] Google Scholar

[ref42] 42. Najafabadi MM, Villanustre F, Khoshgoftaar TM, Seliya N, Wald R, Muharemagic E. Deep learning applications and challenges in big data analytics. Journal of Big Data. 2015 Feb 24;2(1):1.
View Article
Google Scholar

[156] View Article

[157] Google Scholar

[ref43] 43. Liu Z, Mao H, Wu CY, Feichtenhofer C, Darrell T, Xie S. A ConvNet for the 2020s [Internet]. arXiv; 2022 [cited 2023 Apr 5]. http://arxiv.org/abs/2201.03545

[ref44] 44. McInnes L, Healy J, Melville J. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. arXiv:180203426 [cs, stat] [Internet]. 2020 Sep 17 [cited 2022 Apr 25]; http://arxiv.org/abs/1802.03426

[ref45] 45. Nikiforov YE, Seethala RR, Tallini G, Baloch ZW, Basolo F, Thompson LDR, et al. Nomenclature Revision for Encapsulated Follicular Variant of Papillary Thyroid Carcinoma: A Paradigm Shift to Reduce Overtreatment of Indolent Tumors. JAMA Oncology. 2016 Aug 1;2(8):1023–9. pmid:27078145
View Article
PubMed/NCBI
Google Scholar

[161] View Article

[162] PubMed/NCBI

[163] Google Scholar

[ref46] 46. He K, Zhang X, Ren S, Sun J. Deep Residual Learning for Image Recognition [Internet]. arXiv; 2015 [cited 2022 Aug 2]. http://arxiv.org/abs/1512.03385

[ref47] 47. Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, et al. Generative Adversarial Networks. arXiv:14062661 [cs, stat] [Internet]. 2014 Jun 10 [cited 2022 Apr 13]; http://arxiv.org/abs/1406.2661

[ref48] 48. Ali H, Biswas MdR, Mohsen F, Shah U, Alamgir A, Mousa O, et al. The role of generative adversarial networks in brain MRI: a scoping review. Insights into Imaging. 2022 Jun 4;13(1):98. pmid:35662369
View Article
PubMed/NCBI
Google Scholar

[167] View Article

[168] PubMed/NCBI

[169] Google Scholar

[ref49] 49. Dai W, Doyle J, Liang X, Zhang H, Dong N, Li Y, et al. SCAN: Structure Correcting Adversarial Network for Organ Segmentation in Chest X-rays [Internet]. arXiv; 2017 [cited 2022 Aug 2]. http://arxiv.org/abs/1703.08770

[ref50] 50. Ciano G, Andreini P, Mazzierli T, Bianchini M, Scarselli F. A multi-stage GAN for multi-organ chest X-ray image generation and segmentation. Mathematics. 2021 Nov 14;9(22):2896.
View Article
Google Scholar

[172] View Article

[173] Google Scholar

[ref51] 51. Bayramoglu N, Kaakinen M, Eklund L, Heikkila J. Towards Virtual H&E Staining of Hyperspectral Lung Histology Images Using Conditional Generative Adversarial Networks. In 2017 [cited 2022 Aug 2]. p. 64–71. https://openaccess.thecvf.com/content_ICCV_2017_workshops/w1/html/Bayramoglu_Towards_Virtual_HE_ICCV_2017_paper.html

[ref52] 52. Xue Y, Zhou Q, Ye J, Long LR, Antani S, Cornwell C, et al. Synthetic Augmentation and Feature-Based Filtering for Improved Cervical Histopathology Image Classification. In: Shen D, Liu T, Peters TM, Staib LH, Essert C, Zhou S, et al., editors. Medical Image Computing and Computer Assisted Intervention—MICCAI 2019. Cham: Springer International Publishing; 2019. p. 387–96. (Lecture Notes in Computer Science).

[ref53] 53. Boyd J, Villa I, Mathieu MC, Deutsch E, Paragios N, Vakalopoulou M, et al. Region-guided CycleGANs for Stain Transfer in Whole Slide Images [Internet]. arXiv; 2022 [cited 2023 Apr 5]. http://arxiv.org/abs/2208.12847

[ref54] 54. Chuquicusma MJM, Hussein S, Burt J, Bagci U. How to fool radiologists with generative adversarial networks? A visual turing test for lung cancer diagnosis. In: 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018). 2018. p. 240–4.

[ref55] 55. Levine AB, Peng J, Farnell D, Nursey M, Wang Y, Naso JR, et al. Synthesis of diagnostic quality cancer pathology images by generative adversarial networks. The Journal of Pathology. 2020;252(2):178–88. pmid:32686118
View Article
PubMed/NCBI
Google Scholar

[179] View Article

[180] PubMed/NCBI

[181] Google Scholar

[ref56] 56. Liu S, Shah Z, Sav A, Russo C, Berkovsky S, Qian Y, et al. Isocitrate dehydrogenase (IDH) status prediction in histopathology images of gliomas using deep learning. Sci Rep. 2020 May 7;10(1):7733. pmid:32382048
View Article
PubMed/NCBI
Google Scholar

[183] View Article

[184] PubMed/NCBI

[185] Google Scholar

[ref57] 57. Guan S, Loew M. Breast cancer detection using synthetic mammograms from generative adversarial networks in convolutional neural networks. J Med Imaging (Bellingham). 2019 Jul;6(3):031411. pmid:30915386
View Article
PubMed/NCBI
Google Scholar

[187] View Article

[188] PubMed/NCBI

[189] Google Scholar

[ref58] 58. Karras T, Aittala M, Hellsten J, Laine S, Lehtinen J, Aila T. Training Generative Adversarial Networks with Limited Data [Internet]. arXiv; 2020 [cited 2022 Jul 22]. http://arxiv.org/abs/2006.06676

[ref59] 59. He K, Zhang X, Ren S, Sun J. Deep Residual Learning for Image Recognition [Internet]. arXiv; 2015 [cited 2024 Aug 8]. http://arxiv.org/abs/1512.03385

[ref60] 60. Liu Z, Lin Y, Cao Y, Hu H, Wei Y, Zhang Z, et al. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [Internet]. arXiv; 2021 [cited 2023 May 10]. http://arxiv.org/abs/2103.14030

[ref61] 61. Liu Z, Hu H, Lin Y, Yao Z, Xie Z, Wei Y, et al. Swin Transformer V2: Scaling Up Capacity and Resolution [Internet]. arXiv; 2022 [cited 2024 Aug 8]. http://arxiv.org/abs/2111.09883

[ref62] 62. Asa SL. My approach to oncocytic tumours of the thyroid. Journal of Clinical Pathology. 2004 Mar 1;57(3):225–32. pmid:14990587
View Article
PubMed/NCBI
Google Scholar

[195] View Article

[196] PubMed/NCBI

[197] Google Scholar

[ref63] 63. Rosario PW, Mourão GF. Noninvasive follicular thyroid neoplasm with papillary-like nuclear features (NIFTP): a review for clinicians. Endocrine-Related Cancer. 2019 May 1;26(5):R259–66. pmid:30913533
View Article
PubMed/NCBI
Google Scholar

[199] View Article

[200] PubMed/NCBI

[201] Google Scholar

[ref64] 64. D’Avanzo A, Treseler P, Ituarte PHG, Wong M, Streja L, Greenspan FS, et al. Follicular thyroid carcinoma: Histology and prognosis. Cancer. 2004;100(6):1123–9. pmid:15022277
View Article
PubMed/NCBI
Google Scholar

[203] View Article

[204] PubMed/NCBI

[205] Google Scholar

[ref65] 65. Cipriani NA, Nagar S, Kaplan SP, White MG, Antic T, Sadow PM, et al. Follicular Thyroid Carcinoma: How Have Histologic Diagnoses Changed in the Last Half-Century and What Are the Prognostic Implications? Thyroid. 2015 Nov;25(11):1209–16. pmid:26440366
View Article
PubMed/NCBI
Google Scholar

[207] View Article

[208] PubMed/NCBI

[209] Google Scholar

[ref66] 66. Basolo F, Macerola E, Poma AM, Torregrossa L. The 5th edition of WHO classification of tumors of endocrine organs: changes in the diagnosis of follicular-derived thyroid carcinoma. Endocrine. 2023 Mar 25; pmid:36964880
View Article
PubMed/NCBI
Google Scholar

[211] View Article

[212] PubMed/NCBI

[213] Google Scholar

[ref67] 67. Filiot A, Ghermi R, Olivier A, Jacob P, Fidon L, Mac Kain A, et al. Scaling Self-Supervised Learning for Histopathology with Masked Image Modeling [Internet]. 2023 [cited 2024 Aug 8]. http://medrxiv.org/lookup/doi/10.1101/2023.07.21.23292757

[ref68] 68. Dee W, Sequeira I, Lobley A, Slabaugh G. Cell-vision fusion: A Swin transformer-based approach for predicting kinase inhibitor mechanism of action from Cell Painting data. iScience. 2024 Aug 16;27(8):110511. pmid:39175778
View Article
PubMed/NCBI
Google Scholar

[216] View Article

[217] PubMed/NCBI

[218] Google Scholar

[ref69] 69. Huang X, Belongie S. Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization. In: 2017 IEEE International Conference on Computer Vision (ICCV) [Internet]. Venice: IEEE; 2017 [cited 2022 Aug 16]. p. 1510–9. http://ieeexplore.ieee.org/document/8237429/

[ref70] 70. Durán JM, Jongsma KR. Who is afraid of black box algorithms? On the epistemological and ethical basis of trust in medical AI. Journal of Medical Ethics. 2021 May 1;47(5):329–35. pmid:33737318
View Article
PubMed/NCBI
Google Scholar

[221] View Article

[222] PubMed/NCBI

[223] Google Scholar

[ref71] 71. Zhang H, Goodfellow I, Metaxas D, Odena A. Self-Attention Generative Adversarial Networks. In: Proceedings of the 36th International Conference on Machine Learning [Internet]. PMLR; 2019 [cited 2022 Apr 12]. p. 7354–63. https://proceedings.mlr.press/v97/zhang19d.html

[ref72] 72. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention Is All You Need [Internet]. arXiv; 2017 [cited 2022 Aug 16]. http://arxiv.org/abs/1706.03762

[ref73] 73. Agrawal N, Akbani R, Aksoy BA, Ally A, Arachchi H, Asa SL, et al. Integrated Genomic Characterization of Papillary Thyroid Carcinoma. Cell. 2014 Oct 23;159(3):676–90. pmid:25417114
View Article
PubMed/NCBI
Google Scholar

[227] View Article

[228] PubMed/NCBI

[229] Google Scholar

[ref74] 74. Karras T, Laine S, Aila T. A Style-Based Generator Architecture for Generative Adversarial Networks. [cited 2023 Aug 18]. https://www.computer.org/csdl/journal/tp/2021/12/08977347/1h2AHNHb9bW

[ref75] 75. Aksac A, Demetrick DJ, Ozyer T, Alhajj R. BreCaHAD: a dataset for breast cancer histopathological annotation and diagnosis. BMC Research Notes. 2019 Feb 12;12(1):82. pmid:30755250
View Article
PubMed/NCBI
Google Scholar

[232] View Article

[233] PubMed/NCBI

[234] Google Scholar

[ref76] 76. Heusel M, Ramsauer H, Unterthiner T, Nessler B, Hochreiter S. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. In: Advances in Neural Information Processing Systems [Internet]. Curran Associates, Inc.; 2017 [cited 2022 Aug 19]. https://papers.nips.cc/paper/2017/hash/8a1d694707eb0fefe65871369074926d-Abstract.html

[ref77] 77. Dowson DC, Landau BV. The Fréchet distance between multivariate normal distributions. Journal of Multivariate Analysis. 1982 Sep 1;12(3):450–5.
View Article
Google Scholar

[237] View Article

[238] Google Scholar

[ref78] 78. Kynkäänniemi T, Karras T, Aittala M, Aila T, Lehtinen J. The Role of ImageNet Classes in Fr\’echet Inception Distance [Internet]. arXiv; 2022 [cited 2022 Aug 2]. http://arxiv.org/abs/2203.06026

[ref79] 79. Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L. ImageNet: A large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition. 2009. p. 248–55.

[ref80] 80. Yang Y, Soatto S. FDA: Fourier Domain Adaptation for Semantic Segmentation. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) [Internet]. Seattle, WA, USA: IEEE; 2020 [cited 2022 Aug 2]. p. 4084–94. https://ieeexplore.ieee.org/document/9157228/

[ref81] 81. Kingma DP, Ba J. Adam: A Method for Stochastic Optimization [Internet]. arXiv; 2017 [cited 2023 May 10]. http://arxiv.org/abs/1412.6980

Figures

Abstract

Introduction

There are no sources in the current document

Results

GAN training

Deep learning classification

T&T dataset.

NTE dataset.

Discussion

Materials and methods

Data acquisition and processing

Tharun and Thompson (“T&T”) dataset.

NTE dataset.

NTE data pre-processing.

Generative Adversarial Network (GAN)

GAN pre-processing and evaluation.

Deep Learning Classification (DLC) model

Synthetic data augmentation

Supporting information

S1 File. Additional information regarding subtype classifications.

S2 File. NTE dataset selection.

S3 File. Neoplastic region identification.

S4 File. GAN augmentations.

Acknowledgments

References