Semisupervised adaptive learning models for IDH1 mutation status prediction

Fengning Liang; Yaru Cao; Teng Zhao; Qian Xu; Hong Zhu

doi:10.1371/journal.pone.0321404

Abstract

The mutation status of isocitrate dehydrogenase1 (IDH1) in glioma is critical information for the diagnosis, treatment, and prognosis. Accurately determining such information from MRI data has emerged as a significant research challenge in recent years. Existing techniques for this problem often suffer from various limitations, such as the data waste and instability issues. To address such issues, we present a semisupervised adaptive deep learning model based on radiomics and rough sets for predicting the mutation status of IDH1 from MRI data. Firstly, our model uses a rough set algorithm to remove the redundant medical image features extracted by radiomics, while adding pseudo-labels for non-labeled data via statistical. T-tests to mitigate the common issue of insufficient datasets in medical imaging analysis. Then, it applies a Sand Cat Swarm Optimization (SCSO) algorithm to optimize the weight of pseudo-label data. Finally, our model adopts U-Net and CRNN to construct UCNet, a semisupervised classification model for classifying IDH1 mutation status. To validate our models, we use a preoperative MRI dataset with 316 glioma patients to evaluate the performance. Our study suggests that the prediction accuracy of glioma IDH1 mutation status reaches 95.63%. Our experimental results suggest that the study can effectively improve the utilization of glioma imaging data and the accuracy of intelligent diagnosis of glioma IDH1 mutation status.

Citation: Liang F, Cao Y, Zhao T, Xu Q, Zhu H (2025) Semisupervised adaptive learning models for IDH1 mutation status prediction. PLoS One 20(5): e0321404. https://doi.org/10.1371/journal.pone.0321404

Editor: Hilary A. Coller, UC Los Angeles: University of California Los Angeles, UNITED STATES OF AMERICA

Received: June 21, 2024; Accepted: March 5, 2025; Published: May 5, 2025

Copyright: © 2025 Liang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The TCIA data that support the findings of this study are openly available in Figshare at https://doi.org/10.6084/m9.figshare.28366148.v1

Funding: the National Natural Science Foundation of China (grant number 62102345), the Project for Research on Medical in Jiangsu Commission of Health (grant number Z2020032), and the Xuzhou Key Research and Development Program (grant number KC22117)

Competing interests: The authors have declared that no competing interests exist.

I. Introduction

Glioma is the most common primary tumor in the central nervous system, accounting for 26.5% of brain tumors, and its 5-year survival rate is 20% to 30% [1,2]. The World Health Organization redefined the classification of gliomas in the 2021 revision of the CNS tumor classification. The revision integrates molecular subtypes for glioma classification, such as IDH-wild type and IDH-mutant [3]. Due to the heterogeneity of tumors, patients with glioma may differ in terms of clinical and pathological characteristics and treatment methods. The integrated molecular subtypes enable doctors to understand tumor types based on molecular expression, providing an important basis for follow-up clinical treatment. The main type of IDH mutation in glioma is IDH1 mutation. IDH1 mutation changes the activity of the IDH enzyme, thereby affecting the treatment and prognosis of patients. Generally, patients with IDH1 mutations have better overall survival and prognosis [4]. Due to the heterogeneity of tumors, patients with glioma may differ in terms of clinical and pathological characteristics and treatment methods. The integrated molecular subtypes enable doctors to understand tumor types based on molecular expression, providing an important basis for follow-up clinical treatment. The main type of IDH mutation in glioma is IDH1 mutation. IDH1 mutation changes the activity of the IDH enzyme, thereby affecting the treatment and prognosis of patients. Generally, patients with IDH1 mutations have better overall survival and prognosis [5,6]. For the above reasons, a noninvasive and easy-to-use method for the preoperative prediction of IDH1 mutation status is essential to alleviate patient suffering and improve the detection efficiency and success rate.

In recent years, we have witnessed tremendous progress in technology development for medical imaging analysis. Currently, deep learning models have been frequently used in almost every phase of medical image processing. Previous research has shown that features extracted from MRI of gliomas are related to gene expression patterns. This has led to a series of studies on mutation status determination. Zhang et al. [7] used the clinical features of multimodal MRI combined with random forest machine learning algorithms to predict IDH mutation status with 86% accuracy. Chang et al. [8] used a convolutional neural network for brain MRI to predict IDH1 mutation status with an accuracy of 94%. Choi et al. [9] used the classification method of segmentation and then combined radiomics features and a convolutional neural network to predict IDH mutation status, obtaining accuracy rates of 93.8%, 87.9% and 78.8% on three different datasets. However, these methods typically involve manual tumor presegmentation and utilize 2D slice classification methods. Adjacent slices of glioma MRI usually share much information. The 2D slice classification method is unable to capture sequential features, and different slices from the same sequence may be distributed in both the training and test sets, potentially introducing bias in the testing phase. In addition, deep learning often requires a large amount of labeled data. However, the abundance of unlabeled data in glioma imaging datasets leads to a significant scarcity of data. The semisupervised learning method, which uses both labeled and unlabeled data to fit the model, improves the accuracy of the model in some cases. However, this method is unstable and may lead to a worse model. In terms of feature reduction, the conventional LASSO model tends to yield sparse results.

To address the above problems, we propose a semi-supervised adaptive deep learning model based on radiomics and rough sets for predicting IDH1 mutation status in glioma MRI images. The specific contributions of this paper are as follows:

(1) To address the problems of data waste and instability in semi-supervised learning methods, we introduce a pseudo-labeling algorithm that removes a large number of redundant radiomics attributes while keeping the pseudo-labeling classification ability unchanged.
(2) A pseudo-labeled data weight adaptive adjustment algorithm using the Sand Cat Swarm Optimization (SCSO) algorithm is proposed to address the potential adverse effects of for-labeled data on the model. The pseudo-labeled data weight adjustment algorithm based on the Sand Cat Swarm Optimization (SCSO) algorithm can adaptively search and capture local and global information, and continuously adjust the search strategy by analyzing the adaptability of each pseudo-labeled data, and ultimately obtain the optimal weights, which effectively reduces the potential adverse effects of pseudo-labeled data on the model.
(3) For glioma image sequences, this paper proposes an improved U-Net-based feature extractor to extract feature sequences from glioma image sequences, and then use a convolutional recurrent neural network (CRNN)-based classifier to classify the feature sequences to predict the IDH1 mutation status of gliomas. The proposed model is an end-to-end model and does not require manual tumor pre-segmentation. In addition, an image sequence rather than a 2D slice classification method is used. The complete framework of the proposed IDH1 mutation status prediction is shown in Fig 1.

Download:

Fig 1. The proposed framework for IDH1 mutation status classification.

https://doi.org/10.1371/journal.pone.0321404.g001

II. Materials and methods

A. Data collection

The brain MRI data of glioma patients used in this study are from a local affiliated hospital. Informed consent is obtained from all study participants as per their preferences, and the study received approval from the relevant Ethics Committee. Patients came to the hospital for diagnosis from May 30, 2020 to August 31, 2023. Experimental data were acquired on September 1, 2023 and anonymized. All brain MRI scans are preoperative images, including T1C, T2, FLAIR and DWI sequences. Patients who meet the following inclusion criteria are included in this study: (i) histologically confirmed glioma; (ii) pathologic examination of tumor specimens carrie out with proven records of IDH1 mutation status (for patients with known IDH1 mutation status); and (iii) no preoperative treatment. In the end, a total of 316 patients are included in this study, yielding 20,224 images. Among them, 202 patients have a known IDH1 mutation status, with a total of 12,928 images, while 114 patients have unknown IDH1 mutation status, with a total of 7,296 images. Among patients with IDH1 mutation status, 73 are IDH1-mutant and 129 are IDH1-wild type. Among them, level IV account for 55.4%, level III account for 17.8%, level II account for 23.8%, and level I account for 3.0%. Among patients without IDH1 mutation status, 48.9% are grade IV, 23.9% are grade III, 22.7% are grade II, and 4.5% are grade I. To validate the model, we select 80 patients (comprise 45 IDH1-wild type and 35 IDH1-mutant) from the initial 202 patients with known IDH1 mutation status as a validation cohort. In addition, 208 patients with IDH1 mutation status from The Cancer Imaging Archive (TCIA set) are enrolled in accordance with the same criteria. The detail patient characteristics are summarize in Table 1.

Download:

Table 1. Patient demographics and genetic information.

https://doi.org/10.1371/journal.pone.0321404.t001

B. Pseudo-labeled algorithm based on radiomics and rough sets

Radiomics [10] involves the high-throughput extraction of extensive information from various images (e.g., CT, MRI, PET). This process transforms images into high-dimensional features and employs data mining techniques to enhance support for medical decision-making. Radiomics can extract thousands of features, many of which are useless or redundant. Therefore, reducing such massive radiomics features is an important preprocessing step for the study. This paper presents a feature reduction method based on radiomics and the positive regions of rough sets.

The experiment used IBEX [11] to extract radiomics features from different sequences of glioma MRI with IDH1 mutation status that are not select for validation, include the gray-level co-occurrence matrix and gray level run length matrix. The gray-level co-occurrence matrix is a matrix function based on pixel distance and angle. It captures a broad spectrum of information within an image, including direction, interval, range of variation, and speed, by calculating the correlation between the gray levels of points at a specific distance and direction in the image. The gray level run length matrix can reflect the comprehensive information of the image gray level, such as direction, adjacent interval and variation amplitude. The parameters for the gray level cooccurrence matrix are configured as follows: GrayLimits=[0 2100], NumLevels = 100, Direction=[0 45 90 135], and Offset=[1 4 7]. The parameters for the gray level run length matrix are set as: GrayLimits=[0 2100], NumLevels = 100, and Direction=[0 90]. The radiomics features obtained based on the gray level cooccurrence matrix and gray level run length matrix are shown in Table 2.

Download:

Table 2. Image feature information.

https://doi.org/10.1371/journal.pone.0321404.t002

The radiomics features extracted from MRI data of different sequences constitute the original feature set. First, we perform an independent-sample t test on the extracted features to screen out useless features. When P < 0.05, the parameter is statistically significant. Following the independent-sample T-test, it is possible that the obtained statistically significant features still contain a significant number of redundant features. To eliminate these redundant features, we employ a reduction algorithm based on the positive region of the rough set to reduce the feature set.

The rough set is an attribute reduction method proposed by Pwalak [12] that can quantitatively analyze and process inaccurate, inconsistent, and incomplete information and knowledge. It does not need any prior information other than the dataset needed by the problem. This paper adopts a method based on the positive region of rough set to perform feature reduction in radiomics. There is a large number of features obtained through radiomics methods and hence there is a need to reduce the time complexity and remove redundant features. In comparison to other feature dimension reduction methods, the rough set method ensures that classification accuracy remains the same as the original even after removing redundant attributes. Because the positive region of the rough set is monotonic, the experiment adopted the method of deleting irrelevant attributes one by one. The algorithm begins with the complete set of conditional attributes, and in each iteration, it evaluates an attribute. If the positive region of the decision table remains unaltered after removing the attribute from the set of conditional attributes, the attribute can be excluded from the existing attribute set; otherwise, the attribute is retained. The above operations are repeated until a subset of conditional attributes is obtained. If the positive region of the decision table changes when any attribute is removed, the algorithm ends. The attribute subset at this time is the reduction of the decision table. Finally, the experiment extracts the radiomics features of the unlabeled image data based on the reduced feature set. Then, SPSS is used to calculate Youden’s index for these features, and the maximum value of Youden’s index is used as the threshold. We used Youden’s index to determine the threshold for pseudo-labeling, ensuring that only features with high discriminative power were used for labeling. This reduced the likelihood of assigning incorrect labels to unlabeled data. We judge the IDH1 mutation status of the unlabeled image data according to the threshold of these features. The entire pseudo-labeled algorithm based on radiomics and rough sets is shown in Algorithm 1.

Download:

C. SCSO-based adaptive weight adjustment algorithm for pseudo-labeled data

To mitigate the impact of inaccurate pseudo-labeled on the model’s accuracy and stability, this paper introduces an adaptive adjustment algorithm for pseudo-labeled data weights based on the SCSO (Sand Cat Swarm Optimization) method. Specifically, it evaluates the fitness of each pseudo-labeled instance and adaptively modifies its weight during the optimization process. This ensures that pseudo-labels with higher confidence (i.e., those that align well with the labeled data) are assigned greater weight, while less reliable pseudo-labels are down-weighted. As a result, the SCSO algorithm minimizes the influence of noisy or incorrect pseudo-labels, enhancing the model’s stability and accuracy.

The Sand Cat Swarm Optimization algorithm [13] is a heuristic optimization technique inspired by the behavioral traits of sand cats in their natural desert environment. It mimics their strategies for finding food and evading threats. The sand cat’s ears possess the remarkable ability to perceive frequencies below 2 kHz during foraging, couple with an incredible proficiency in excavating prey. Its foraging process comprises two phases: searching and attacking prey. The algorithm effectively obtains the local and global information of the population through fewer parameters and steps, adaptively adjusts the search step length and direction in the iterative process according to the adaptation of each pseudo-label, and finally selects the optimal or better weight scheme as the adjust pseudo-labeled weights. Suppose there are N instances in the glioma image dataset, including Nk labeled instances, Xk={(x1,y1),(x2,y2),…,()}, and (N - Nk) pseudo-labeled instances Xl={,…,}. Then, the loss function of the CRNN-based classifier is expressed as:

(1)

In this experiment, the weight of the labeled data is fixed at 1, and the weight of the pseudo-labeled data is adjusted to change the loss function of the classifier. Since labeled instances and pseudo-labeled instances have different weights, the loss function can be redefined as:

(2)

where α is the weight applied to the pseudo-labeled instance.

To further determine the value of α, the experiment used the SCSO-based pseudo-labeled data adaptive weight adjustment algorithm to find the optimum solution of α. The algorithm initially initializes parameters including the number of sand cat individuals (N), population positions (Posi) and maximum iteration count (T). The primary parameters governing the transition between the exploration and exploitation stages are denoted as R. It is assumed that the sensitivity range (r) of sand cats extends from 0 to 2 kHz. Each individual sand cat updates its position based on the optimal candidate position (α), its current position (Posc),and its sensitivity range (r). Consequently, sand cats are capable of discovering alternative optimal prey positions after the iterative update. The specific algorithm is shown in Algorithm 2.

Download:

D. Feature extraction of glioma image sequences and IDH1 mutation status classification

As shown in Fig 2, we design an IDH1 mutation status classification model UCNet based on an improved U-Net [14] feature extractor and a CRNN [15] classifier using glioma image sequences.

Download:

Fig 2. Glioma IDH1 mutation status classification model UCNet.

https://doi.org/10.1371/journal.pone.0321404.g002

The classification model consists of a feature extractor based on the improved U-Net and a classifier based on a CRNN. The feature extractor based on the improved U-Net is composed of an encoder based on dense blocks and a decoder based on residual blocks, and there is a skip connection between the encoder and the decoder. The use of dense blocks can enhance feature propagation, improve the accuracy and reliability of feature extraction, and reduces the number of parameters. The use of a residual block reduce the weight of some features, improves the effect of MRI spatial sequence reconstruction, and effectively avoids gradient disappearance. Skip connections can effectively merge low-level features and high-level features.

Each dense block is composed of 4 convolutional layers with 64, 64, 128, and 128 convolution kernels. Except for the first and last convolutional layers, the size of the convolution kernel in each convolutional layer is 3*3. The input of each convolutional layer is the sum of the outputs of all previous layers, and each convolutional layer uses LeakyReLU as the activation function. The encoder is composed of two such dense blocks, and the blocks are connected using a 1 * 1 convolutional layer and a 2 * 2 AvgPooling layer to complete downsampling. Each residual block consists of 4 convolutional layers, with 64, 64, 128 and 128 convolution kernels. Except for the first and last convolutional layers, the size of the convolution kernel in each convolutional layer is 3*3. Each convolutional layer uses LeakyReLU as the activation function. The decoder is composed of two such residual blocks and relies on an upsampling layer to increase the size of the feature map; it then stitches them to the corresponding feature map of the encoder.

In this experiment, the glioma image sequence is pass into the encoder to extract the feature sequence, and then it is restore by the decoder. We compare the corresponding pixels of the generated image and the original image. The more similar the final generate image and the original image are, the smaller the loss and the more representative the extract feature sequence. The extract feature sequence is input into the CRNN-based classifier for final classification, and IDH1 mutation status prediction is performe. The CRNN is composed of a convolutional neural network and a recurrent neural network, which is suitable for processing sequence data. In this model, the recurrent neural network part of the CRNN uses long short-term memory. The whole model uses a glioma image sequence, not a 2D slice classification method. Finally, this paper constructs F-UCNet, an IDH1 mutation status classification model based on semisupervised adaptive deep learning. The model uses glioma image sequence data containing pseudo-labeled data, extracts the feature sequence of the image sequence through a feature extractor based on the improved U-Net, and uses a CRNN classifier to predict the IDH1 mutation status. In the classification stage, the model adopts an SCSO-based pseudo-labeled data adaptive weight adjustment algorithm to ensure model stability. The source code is available at the following link: https://github.com/100002006023/Semi-supervised-Adaptive-Prediction-Model.

Results

A. Pseudo-labeled based on radiomics and rough sets

This experiment used IBEX to extract 363 radiomics features for each sequence of each labeled patient, of which 330 are derived from the gray level cooccurrence matrix and 33 are derive from the gray level run length matrix. All 363 radiomics features are highly reproducible (ICC > 0.75). Finally, the radiomics features extract from the four sequences of each patient constitute the 1452-dimensional original feature set.

The experiment used SPSS 22 to perform an independent-sample t test on the original feature set. When P < 0.05, a parameter is statistically significant. After the independent-sample t test, there are 372 statistically significant features that constitute a new feature set (T1C 181, T2 56, FLAIR 18, DWI 117). To further remove redundant features from the new feature set, the experiment used a reduction algorithm based on the positive region of the rough set. The software use for attribute reduction is MATLAB 2016.

The features after reduction are shown in Tables 3–6.

Download:

Table 3. T1C attributes.

https://doi.org/10.1371/journal.pone.0321404.t003

Download:

Table 4. T2 attributes.

https://doi.org/10.1371/journal.pone.0321404.t004

Download:

Table 5. Flair attributes.

https://doi.org/10.1371/journal.pone.0321404.t005

Download:

Table 6. DWI attributes.

https://doi.org/10.1371/journal.pone.0321404.t006

After attribute reduction, only 18 of the original 1452 features remain. This step greatly reduces feature redundancy, prevents redundant features from interfering with pseudolabeling and improves the efficiency of pseudolabeling. The experiment used the feature set after attribute reduction to classify and verify the labeled image data. The final accuracy rate is 82.19%, the sensitivity is 78.26%, the specificity is 84%, and the AUROC is 0.895 (95% CI). The ROC curve is shown in Fig 3. The accuracy of the conventional semisupervised learning method is 69.05%, and the accuracy of feature reduction by LASSO is 58.33%. Compared with the conventional methods, the method we propose achieved great improvement.

Download:

Fig 3. ROC curve of the attribute reduction results.

https://doi.org/10.1371/journal.pone.0321404.g003

Finally, radiomics feature extraction is performed on 114 pieces of unlabeled image data based on the feature set after attribute reduction. Then, the IDH1 mutation status is predicted with the feature threshold, and the unlabeled image data are labeled. Among 114 cases of unlabeled image data, 75 cases are IDH1-wild type and 39 cases are IDH1-mutant.

B. SCSO-based adaptive weight adjustment algorithm for pseudo-labeled data

For the data including the pseudo-labeled data, this experiment adopted the SCSO-based pseudo-labeled data adaptive weight adjustment algorithm. We set Maxgen to 100, and adjusted and output α. The specific optimization process of the classifier of the T2-Flair model is shown in Fig 4. Finally, the optimization ends when the α is 0.2.

Download:

Fig 4. Optimization process of adaptive weight adjustment based on the SCSO.

https://doi.org/10.1371/journal.pone.0321404.g004

C. Feature extraction of glioma image sequences and IDH1 mutation status classification

We conduct our experiments on a machine with a Windows 10 operating system, a 2.10 GHz Intel Xeon (dual core) processor, 64 GB memory, and 3×GeForce RTX 2080Ti graphics card. We used PyCharm as the development environment, Keras as the deep learning framework, and Python as the programming language.

Image preprocessing included registering images of different sequences to identical 1-mm isovoxel spatial coordinates, performing data augmentation (such as rotation, reflection, flipping, and color adjustment) on the images of some IDH1-mutant cases, and subjecting the images of each sequence to signal intensity normalization. Finally, the images were resampled to sizes of 256 × 256 × 16. These steps ensure consistency across imaging data and enhance the model’s ability to handle variations in spatial resolution, intensity, orientation, contrast, and brightness.

In this experiment, 7 models are constructed for different sequences of glioma images, namely, the T1C model, T2 model, FLAIR model, DWI model, T1C-T2 model, T2-FLAIR model and T1C-T2-FLAIR model. The performance comparisons of these seven models are shown in Table 7.

Download:

Table 7. Performance comparisons of the seven models.

https://doi.org/10.1371/journal.pone.0321404.t007

As shown in the above table, when using a single sequence to train the model, the T2 model obtain the highest accuracy rate, follow by the T1C model, and the DWI model had the lowest accuracy rate. Then, two multisequence models, T1C-T2 and T2-Flair, are trained and obtain higher accuracy. When using more sequences, such as in the T1C-T2-Flair model, the accuracy decreases. Therefore, we chose the T2-Flair model with the highest accuracy for subsequent experiments.

First, the feature extractor based on the improved U-Net is used to extract features of the glioma image sequence. In this experiment, the preprocessed glioma image sequence is input into the feature extractor based on the improved U-Net to train the model and extract the required features. Networks are implement use an adaptive moment estimation optimizer and a mean square error loss function. The initial learning rate is set to 10 − 5 with a batch size of 4. Fig 5 shows the process of the T2-Flair model using labeled data to train the feature extraction model and using all data, including pseudo-labeled data, to train the feature extraction model. We stop training the models when the loss curve reaches its lowest point.

Download:

Fig 5. Feature extraction model training process.

https://doi.org/10.1371/journal.pone.0321404.g005

As shown in Fig 5, we train the models for 300 epochs. It can be seen from the figures that the accuracy of the feature extraction model trains use data include pseudo-labeled data is significantly higher than that of the model using only labeled data, and the loss convergence speed is much higher. For further comparison, we perform feature extraction on the labeled data and data including pseudo-labeled data and then input the extract feature sequences into the CRNN classifier for training and testing.

In this experiment, the weight of the labeled data is fixed at 1, and the weight of the pseudo-labeled data is adjusted to 0.2 during the classification training process of the T2-Flair model. The CRNN classifier is implemented uses an adaptive moment estimation optimizer and a cross-entropy loss function. The batch size of the UCNet model is set to 40, and the batch size of the F-UCNet model is set to 30. The initial learning rate is set to 10–4. In addition, to avoid overfitting, this experiment used dropout in the CRNN classifier. The dropout ratio is set to 0.5. Then, the model is validated on an independent validation cohort. Figs 6–8 show the training process of the T2-Flair model in three different situations: using labeled data, using all data including pseudo-labeled data, and adjusting the weight of pseudo-labeled data.

Download:

Fig 6. T2-Flair model training process using labeled data.

https://doi.org/10.1371/journal.pone.0321404.g006

Download:

Fig 7. T2-Flair model training process using all data.

https://doi.org/10.1371/journal.pone.0321404.g007

Download:

Fig 8. T2-Flair model training process using all data.

https://doi.org/10.1371/journal.pone.0321404.g008

As shown in Figs 6-8, the model using the adjusted weights for the pseudo-labeled data converged faster and more consistently. The ablation experiments used data from an independent validation cohort in a local affiliated hospital dataset. Our results are shown in Table 8, where the proposed algorithm achieved 95.63% of the optimal results. As shown in Table 9, we also validated our model using TCIA and compared the results with those published by previous authors. The method in this paper has the highest accuracy of 93.44%.

Download:

Table 8. The results of the ablation experiment.

https://doi.org/10.1371/journal.pone.0321404.t008

Download:

Table 9. Comparison with previously published methods for classifying glioma IDH1 status.

https://doi.org/10.1371/journal.pone.0321404.t009

Discussion

In this study, we propose a semisupervised adaptive deep learning model that uses MRI to predict the IDH1 mutation status of gliomas. Firstly, we propose a pseudo-labeled algorithm based on radiomics and rough sets for a large amount of unlabeled data. The algorithm extracts numerous radiomics features from the labeled data and then employs an independent-sample t test to initially filter out significant features from a vast set of features. To further remove redundant features, the algorithm utilizes an attribute reduction method based on the positive region of the rough set, resulting in a feature set after attribute reduction. The algorithm extracts the radiomics features from the unlabeled data according to the feature set after attribute reduction and labels the unlabeled data according to the threshold of the feature set. Subsequently, to address potential inaccuracies in the pseudo-labeled data, this paper introduces an SCSO-based adaptive weight adjustment algorithm for pseudo-labeled data. Furthermore, we construct an IDH1 mutation status classification model based on glioma image sequences. This model consists of a feature extractor utilizing an improved U-Net and a classifier based on a CRNN. The feature extractor based on the improved U-Net is composed of an encoder based on a dense block and a decoder based on a residual block. The encoder performs feature extraction on the input glioma image sequence, and the decoder restores and outputs the feature sequence. The more similar the generated image is to the original image, the more representative the extracted features become. We input the feature sequence extracted by the feature extractor into the CRNN classifier for IDH1 mutation status classification. In the classification stage, the SCSO algorithm fixes the weight of the labeled data and uses the algorithm to adaptively adjust the weight of the pseudo-labeled data. Then the optimal weight is output. We constructed 7 models for different sequences of glioma imaging, namely, the T1C model, T2 model, FLAIR model, DWI model, T1C-T2 model, T2-FLAIR model and T1C-T2-FLAIR model. The IDH1 mutation status classification models are trained on 122 patients (7,808 images) with IDH1 mutation status and 114 patients (7,296 images) without IDH1 mutation status, and their performance is validated on an independent validation cohort (80 patients).

Prior studies [7–9] utilized 2D slice classification methods based on labeled data, with many of them necessitating tumor presegmentation. However, in reality, there are more data without IDH1 mutation status labels, which causes considerable data waste. Therefore, this paper proposes a pseudo-labeled algorithm based on radiomics and rough sets. Pseudo-labeled and the judicious utilization of unlabeled data can enhance data efficiency and contribute to model accuracy improvement. In addition, the IDH1 mutation status classification model proposed in this paper is end-to-end and does not require tumor presegmentation. The model is divided into two parts: a feature extractor based on the improved U-Net and a classifier based on a CRNN. Incorporating dense blocks and residual blocks in the feature extractor based on the enhanced U-Net effectively mitigates the vanishing gradient issue, enhance the accuracy and dependability of feature extraction. The CRNN-based classifier considers the relationship between the slices and can help the model extract the features between the slices, so it is very suitable for sequence data. Leveraging the SCSO-based adaptive weight adjustment algorithm for pseudo-labeled data significantly expedites model convergence, enhances model accuracy, and ensures model stability. The experimental results show that the performance of the model using all data include pseudo-labeled data is better than that use only labeled data. The performance of the model that adaptively adjusted the weight of the pseudo-labeled data is better than that of the model that did not. In the end, the T2-Flair model obtain the highest accuracy rate for IDH1 mutation status classification, which is higher than that of using only a single sequence. Nevertheless, when additional sequences are included, as in the case of the T1C-T2-Flair model, the accuracy exhibited a decrease.

Our analysis presents an alternative noninvasive method for predicting the IDH1 mutation status of glioma patients. The experimental results demonstrate that the method presented in this paper can enhance the accuracy of intelligent IDH1 mutation status diagnosis and the utilization of glioma imaging data. As a result, this method holds promising applications in alleviating patient suffering, reducing economic burdens, and aid in clinical diagnosis.

Although our algorithm has achieved significant results in predicting IDH1 mutation status, it also exhibits certain limitations. Firstly, the algorithm requires a substantial amount of MRI imaging data for training and validation, which may be constrained by data acquisition and processing in certain scenarios. Secondly, while we conducted model validation in our study, its generalization capability to other datasets or medical centers remains insufficiently validated and is contingent upon the stability of specific equipment and techniques. Additionally, the accuracy of pseudo-labeled data remains a challenge, potentially leading to unstable model training or misleading outcomes. Moving forward, we will further research and refine the algorithm proposed in this paper to enhance its effectiveness and reliability in real-world clinical applications.

Conclusion

In this paper, we propose a semi-supervised adaptive prediction model for IDH1 mutation status in gliomas. For unlabeled image data, we introduce a pseudo-labeling algorithm based on radiomics and rough sets, and an adaptive weight adjustment algorithm for pseudo-labeled data based on SCSO. For glioma image sequences, we introduce a feature extractor based on improved U-Net and a classifier based on CRNN to predict IDH1 mutation status. Experimental results show that the method achieves 95.63% accuracy in IDH1 mutation state classification. This indicates that the proposed method is reliable and promises to aid in clinical diagnosis. In the future, we will conduct a more in-depth study to address the shortcomings of the algorithm, such as insufficient data size and errors in the accuracy of pseudo-labeled data.

Acknowledgments

Not applicable.

References

1. Parent M-E, Turner MC, Lavoué J, Richard H, Figuerola J, Kincl L, et al. Lifetime occupational exposure to metals and welding fumes, and risk of glioma: a 7-country population-based case-control study. Environ Health. 2017;16(1):90.
- View Article
- Google Scholar
2. Zhang M, Huang N, Yang X. A Novel Protein Encoded by the Circular Form of the SHPRH Gene Suppresses Glioma Tumorigenesis. ONCOGENE. 2018;37(13):1805–14. pmid:29343848
3. Louis DN, Perry A, Reifenberger G, von Deimling A, Figarella-Branger D, Cavenee WK, et al. The 2016 World Health Organization Classification of Tumors of the Central Nervous System: A Summary. Acta Neuropathol. 2016;131(6):803–20. pmid:27157931
4. Lu J, Li X, Li H. Perfusion parameters derived from MRI for preoperative prediction of IDH mutation and MGMT promoter methylation status in glioblastomas. Magn Reson Imaging. 2021;83:189–95. pmid:34506909
5. Bangalore YC, Shah BR, Vejdani-Jahromi M, et al. A Novel Fully Automated MRI-based Deep-Learning Method for Classification of IDH Mutation Status in Brain Gliomas. Neuro Oncol. 2020; 22(3):402–11.
- View Article
- Google Scholar
6. Wang S, Shi J, Ye Z. Predicting EGFR mutation status in lung adenocarcinoma on computed tomography image using deep learning. Eur Respir J. 2019;53(3):1800986. pmid:30635290
7. Zhang B, Chang K, Ramkissoon S, Tanguturi S, Bi WL, Reardon DA, et al. Multimodal MRI features predict isocitrate dehydrogenase genotype in high-grade gliomas. Neuro Oncol. 2017;19(1):109–17. pmid:27353503
8. Chang P, Grinband J, Weinberg BD, Bardis M, Khy M, Cadena G, et al. Deep-learning convolutional neural networks accurately classify genetic mutations in gliomas. AJNR Am J Neuroradiol. 2018;39(7):1201–7. pmid:29748206
9. Choi YS, Bae S, Chang JH, Kang S-G, Kim SH, Kim J, et al. Fully automated hybrid approach to predict the IDH mutation status of gliomas via deep learning and radiomics. Neuro-Oncology. 2021;23(2):304–13. pmid:32706862
10. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RGPM, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. 2012;48(4):441–6. pmid:22257792
11. Zhang L, Fried DV, Fave XJ, Hunter LA, Yang J, Court LE. IBEX: an open infrastructure software platform to facilitate collaborative work in radiomics. Med Phys. 2015;42(3):1341–53.
- View Article
- Google Scholar
12. Pawlak Z. Rough Sets. Int J Comput Inform Sci. 1982;11(5):341–56.
- View Article
- Google Scholar
13. Seyyedabbasi A, Kiani F. Sand Cat swarm optimization: a nature-inspired algorithm to solve global optimization problems. Eng Comput. 2022;39(4):2627–51.
- View Article
- Google Scholar
14. Ronneberger O, Fischer P, Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In: 2015-01-01 2015; Cham: Springer International Publishing; 2015. p. 234–41.
15. Shi B, Bai X, Yao C. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans Pattern Anal Mach Intell. 2017;39(11):2298–304. pmid:28055850
16. Taha B, Li T, Boley D, Chen CC, Sun J. Detection of isocitrate dehydrogenase mutated glioblastomas through anomaly detection analytics. Neurosurgery. 2021;89(2):323–8. pmid:33887763
17. He A, Wang P, Zhu A, Liu Y, Chen J, Liu L. Predicting IDH mutation status in low-grade gliomas based on optimal radiomic features combined with multi-sequence magnetic resonance imaging. Diagnostics (Basel). 2022;12(12):2995. pmid:36553002
18. Choi KS, Choi SH, Jeong B. Prediction of IDH genotype in gliomas with dynamic susceptibility contrast perfusion MR imaging using an explainable recurrent neural network. Neuro Oncol. 2019;21(9):1197–209.
- View Article
- Google Scholar
19. Tripathi PC, Bag S. An attention-guided CNN framework for segmentation and grading of glioma using 3D MRI scans. IEEE/ACM Trans Comput Biol Bioinform. 2022.
- View Article
- Google Scholar

[ref1] 1. Parent M-E, Turner MC, Lavoué J, Richard H, Figuerola J, Kincl L, et al. Lifetime occupational exposure to metals and welding fumes, and risk of glioma: a 7-country population-based case-control study. Environ Health. 2017;16(1):90.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Zhang M, Huang N, Yang X. A Novel Protein Encoded by the Circular Form of the SHPRH Gene Suppresses Glioma Tumorigenesis. ONCOGENE. 2018;37(13):1805–14. pmid:29343848
View Article
PubMed/NCBI
Google Scholar

[5] View Article

[6] PubMed/NCBI

[7] Google Scholar

[ref3] 3. Louis DN, Perry A, Reifenberger G, von Deimling A, Figarella-Branger D, Cavenee WK, et al. The 2016 World Health Organization Classification of Tumors of the Central Nervous System: A Summary. Acta Neuropathol. 2016;131(6):803–20. pmid:27157931
View Article
PubMed/NCBI
Google Scholar

[9] View Article

[10] PubMed/NCBI

[11] Google Scholar

[ref4] 4. Lu J, Li X, Li H. Perfusion parameters derived from MRI for preoperative prediction of IDH mutation and MGMT promoter methylation status in glioblastomas. Magn Reson Imaging. 2021;83:189–95. pmid:34506909
View Article
PubMed/NCBI
Google Scholar

[13] View Article

[14] PubMed/NCBI

[15] Google Scholar

[ref5] 5. Bangalore YC, Shah BR, Vejdani-Jahromi M, et al. A Novel Fully Automated MRI-based Deep-Learning Method for Classification of IDH Mutation Status in Brain Gliomas. Neuro Oncol. 2020; 22(3):402–11.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref6] 6. Wang S, Shi J, Ye Z. Predicting EGFR mutation status in lung adenocarcinoma on computed tomography image using deep learning. Eur Respir J. 2019;53(3):1800986. pmid:30635290
View Article
PubMed/NCBI
Google Scholar

[20] View Article

[21] PubMed/NCBI

[22] Google Scholar

[ref7] 7. Zhang B, Chang K, Ramkissoon S, Tanguturi S, Bi WL, Reardon DA, et al. Multimodal MRI features predict isocitrate dehydrogenase genotype in high-grade gliomas. Neuro Oncol. 2017;19(1):109–17. pmid:27353503
View Article
PubMed/NCBI
Google Scholar

[24] View Article

[25] PubMed/NCBI

[26] Google Scholar

[ref8] 8. Chang P, Grinband J, Weinberg BD, Bardis M, Khy M, Cadena G, et al. Deep-learning convolutional neural networks accurately classify genetic mutations in gliomas. AJNR Am J Neuroradiol. 2018;39(7):1201–7. pmid:29748206
View Article
PubMed/NCBI
Google Scholar

[28] View Article

[29] PubMed/NCBI

[30] Google Scholar

[ref9] 9. Choi YS, Bae S, Chang JH, Kang S-G, Kim SH, Kim J, et al. Fully automated hybrid approach to predict the IDH mutation status of gliomas via deep learning and radiomics. Neuro-Oncology. 2021;23(2):304–13. pmid:32706862
View Article
PubMed/NCBI
Google Scholar

[32] View Article

[33] PubMed/NCBI

[34] Google Scholar

[ref10] 10. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RGPM, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. 2012;48(4):441–6. pmid:22257792
View Article
PubMed/NCBI
Google Scholar

[36] View Article

[37] PubMed/NCBI

[38] Google Scholar

[ref11] 11. Zhang L, Fried DV, Fave XJ, Hunter LA, Yang J, Court LE. IBEX: an open infrastructure software platform to facilitate collaborative work in radiomics. Med Phys. 2015;42(3):1341–53.
View Article
Google Scholar

[40] View Article

[41] Google Scholar

[ref12] 12. Pawlak Z. Rough Sets. Int J Comput Inform Sci. 1982;11(5):341–56.
View Article
Google Scholar

[43] View Article

[44] Google Scholar

[ref13] 13. Seyyedabbasi A, Kiani F. Sand Cat swarm optimization: a nature-inspired algorithm to solve global optimization problems. Eng Comput. 2022;39(4):2627–51.
View Article
Google Scholar

[46] View Article

[47] Google Scholar

[ref14] 14. Ronneberger O, Fischer P, Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In: 2015-01-01 2015; Cham: Springer International Publishing; 2015. p. 234–41.

[ref15] 15. Shi B, Bai X, Yao C. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans Pattern Anal Mach Intell. 2017;39(11):2298–304. pmid:28055850
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref16] 16. Taha B, Li T, Boley D, Chen CC, Sun J. Detection of isocitrate dehydrogenase mutated glioblastomas through anomaly detection analytics. Neurosurgery. 2021;89(2):323–8. pmid:33887763
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref17] 17. He A, Wang P, Zhu A, Liu Y, Chen J, Liu L. Predicting IDH mutation status in low-grade gliomas based on optimal radiomic features combined with multi-sequence magnetic resonance imaging. Diagnostics (Basel). 2022;12(12):2995. pmid:36553002
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref18] 18. Choi KS, Choi SH, Jeong B. Prediction of IDH genotype in gliomas with dynamic susceptibility contrast perfusion MR imaging using an explainable recurrent neural network. Neuro Oncol. 2019;21(9):1197–209.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref19] 19. Tripathi PC, Bag S. An attention-guided CNN framework for segmentation and grading of glioma using 3D MRI scans. IEEE/ACM Trans Comput Biol Bioinform. 2022.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

Figures

Abstract

I. Introduction

II. Materials and methods

A. Data collection

B. Pseudo-labeled algorithm based on radiomics and rough sets

C. SCSO-based adaptive weight adjustment algorithm for pseudo-labeled data

D. Feature extraction of glioma image sequences and IDH1 mutation status classification

Results

A. Pseudo-labeled based on radiomics and rough sets

B. SCSO-based adaptive weight adjustment algorithm for pseudo-labeled data

C. Feature extraction of glioma image sequences and IDH1 mutation status classification

Discussion

Conclusion

Acknowledgments

References