Figures
Abstract
In recent times, chronic kidney cancer has been considered a significant cause of cancer, and Renal Cell Carcinoma (RCC) has become a significant prevalent among various kidney cancer conditions. The analysis of kidney cancer, an important and often time-sensitive medical task, has seen a breakthrough alteration by incorporating deep learning (DL) methods, mainly in analyzing histopathological images (HIs). Given manual analysis’s inherent complexity and time-consuming aspect, automatic systems leveraging DL methods provide a promising solution. Automated techniques powered by DL methods showcase a notable capability to analyze intricate details within HIs. These methods are adept at recognizing complex patterns and anomalies within HIs, accelerating the diagnostic method and increasing accuracy. The combination of advanced computational methods with the assessment of kidney cancer HIs not only overcomes the demanding requirements for timely identification but also paves the way for more effective and reliable diagnostic processes in renal oncology. This study presents the Kidney Cancer Detection and Classification employing a Snake Optimizer with Deep Learning on Pathological Images (KCDC-SODLPI) technique. The main aim of the KCDC-SODLPI method is to analyze the pathological images to determine the presence of kidney cancer. In the multifaceted process, the KCDC-SODLPI technique utilizes a Gaussian filtering (GF)-based image preprocessing approach to eliminate the noise content. Furthermore, the KCDC-SODLPI method employs the SE-DenseNet model for extracting intricate patterns from the input images. Moreover, the SO model is used to tune the hyperparameter of the SE-DenseNet method. Finally, the bidirectional long shortterm memory (BiLSTM) model is implemented to detect and classify kidney cancer. The performance of the KCDC-SODLPI technique is evaluated under the biomedical image dataset. The experimental validation of the KCDC-SODLPI method portrayed a superior accuracy value of 88.90% over existing models.
Citation: Almukadi W, Abdel-Khalek S, Bahaddad AA, Alghamdi AM (2025) Driven early detection of chronic kidney cancer disease based on machine learning technique. PLoS One 20(7): e0326080. https://doi.org/10.1371/journal.pone.0326080
Editor: Xu Yanwu, South China University of Technology, CHINA
Received: December 19, 2024; Accepted: May 25, 2025; Published: July 15, 2025
Copyright: © 2025 Almukadi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The data underlying the results presented in the study are available from https://drive.google.com/drive/folders/1BQZGKHKOUji9rDA5IVltlLnXLk7_Elic?usp=share_link.
Funding: This work was funded by the University of Jeddah, Jeddah, Saudi Arabia, under grant No. (UJ-24-DR-2074-1). Therefore, the authors thank the University of Jeddah for its technical and financial support.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
Kidney cancer is a general kind of cancer, which is highly responsible for nearly 2.4 percent of cancer. The American Cancer Society calculates that 73,750 novel patients have been identified with this melanoma in the US, and around 14,830 persons will die of this illness [1]. RCC is a significantly prevalent kind and the reason for almost 85 percent of kidney cancer. RCC becomes an assorted type of cancer with dissimilar molecular features, reactions to treatment, histology, and medical results [2]. The most general sub-types of RCC are chromophobe (4–8%), clear cell (70–80%), papillary (14–17%), and clear cell papillary RCC (4%). These RCC sub-types can mainly depend upon the morphological feature experiential on histological training marked with eosin and hematoxylin (H&E strain) [3]. Compared to other sub-types, clear cell RCC and clear cell papillary RCC are the most critical morphologic overlay, particularly the common occurrence of clear cells. The difference between clear cell papillary RCC and clear cell RCC will be vital to define suitable patient administration. Clear cell RCC has a poor prediction owing to the higher danger of spreading with metastasis [4]. The physical ranking of complex histologic patterns of medical images is a tiresome challenge, and it is exposed to flaws and mistakes, so it needs an expert pathologist [5]. A complete automatic and exact model of ranking kidney tumours from HIs is a higher demand for classifying malignant cancers. DL is used to analyze the HIs of the liver, breast, kidney, and different parts, and it contains many tasks like nuclei segmentation, recognition and classification of cancer sub-types, and ranking. DL is the most significant machine learning (ML) technique, which can mechanically learn numerous patterns and features without human involvement [6].
DL enabled the construction of analytical methods for the initial analysis of cancer illness, and experts employed verified pattern analysis models. DL models have beaten classical ML owing to their higher accurate outcomes. Furthermore, it often equals or exceeds the performance of humans. So, they are suggested as the optimal model for imageries [7]. There has been an enormous awareness of image processing, particularly in the medical domain, because radiology was mainly affected by removing beneficial data from images. The convolutional neural network (CNN) was commonly employed to remove image features and identify dissimilar features. It functions as the source of weight sharing. The convolutional is a vital part that clarifies how a single function affects each other [8]. The dimension, amount of imageries, the number of functioning layers, and the method of activation function employed in CNN may differ. CNN variables are nominated experimentally on an experimental and error base. Also, each CNN contains numerous layers; the most significant are the pooling and convolution layers [9]. The complexity of RCC arises from its diverse subtypes, each with distinct molecular characteristics and treatment responses, making early detection and personalized treatment crucial. Accurate and timely detection of RCC is essential for improving patient outcomes, yet current diagnostic methods often rely on histological analysis, which can be subjective and time-consuming. The integration of ML techniques presents a promising solution for improving the precision and efficiency of early detection, enabling better prognosis and treatment planning for patients with kidney cancer [10].
This study presents the Kidney Cancer Detection and Classification employing a Snake Optimizer with Deep Learning on Pathological Images (KCDC-SODLPI) technique. The main aim of the KCDC-SODLPI method is to analyze the pathological images to determine the presence of kidney cancer. In the multifaceted process, the KCDC-SODLPI technique utilizes a Gaussian filtering (GF)-based image preprocessing approach to eliminate the noise content. Furthermore, the KCDC-SODLPI method employs the SE-DenseNet model for extracting intricate patterns from the input images. Moreover, the SO model is used to tune the hyperparameter of the SE-DenseNet method. Finally, the bidirectional long short-term memory (BiLSTM) model is implemented to detect and classify kidney cancer. The performance of the KCDC-SODLPI technique is evaluated under the biomedical image dataset. The major contribution of the KCDC-SODLPI technique is listed below.
- The KCDC-SODLPI model utilizes GF for image preprocessing, effectually mitigating noise and improving image quality. This improves the accuracy of kidney cancer detection in HIs. By ensuring more precise and reliable input, the model supports better analysis and classification of kidney cancer.
- The KCDC-SODLPI technique employs SE-DenseNet for feature extraction, allowing the model to capture intrinsic patterns from input images. This methodology improves the detection of subtle details that are significant for detecting kidney cancer. Utilizing this technique enhances the model’s ability to differentiate between normal and cancerous tissue in HIs.
- The KCDC-SODLPI methodology utilizes the SO method for hyperparameter tuning, improving the performance of the SE-DenseNet model. This optimization enhances the technique’s accuracy and efficiency in extracting relevant features from HIs. By fine-tuning the model, SO ensures more reliable and precise kidney cancer detection.
- The KCDC-SODLPI model utilizes the BiLSTM approach to detect and classify kidney cancer, improving its capability to capture both forward and backward temporal dependencies. This approach allows the model to better comprehend the sequential nature of histopathological patterns. As a result, the system becomes more accurate in detecting and classifying cancerous features in the images.
- The novelty of the KCDC-SODLPI technique is in its integration of advanced DL techniques, such as SE-DenseNet, BiLSTM, and SO. This fusion improves both the accuracy and effectiveness of analyzing HIs. By employing these cutting-edge approaches, the model efficiently addresses the complexity of cancer detection, providing a robust outcome for precise classification.
2. Related works
Mahootiha et al. [11] implemented a DL method comprising two networks such as a survival and classifier network. The 3D CNN method was employed. The technique also deployed the ISUP grading features extraction by the model as the input to the survival network. A DL-based method was also used. A discrete Logistic Hazard-based loss was also used to extract RCC cancer’s complex survival features from CT images. In [12], an innovative multiscale weakly-supervised DL method was employed. The technique was utilized to implement the RGB-histogram requirement stain normalization. Afterwards, the method follows the several-instance learning technique by separating the input data into numerous overlapping models for maintaining the tissue connections. In conclusion, three multiscale CNN methods have been trained, and the decision fusion method was implemented for the last classification. Hao et al. [13] implemented whole-slide images (WSIs) scanned through light microscopy and immune-fluorescence images for categorizing patients with glomerulonephritis. The model mainly comprises techniques like multimodal fusion, self-reliance coefficient extraction, and glomerular segmentation. This architecture primarily recognizes and segments the glomerulus, followed by glomerular methods trained to extract the features of every glomerulus. In [14], the authors developed a wide-ranging DL method. The developed architectures include three methods: survival prediction, clinical variable selection, and 3D image feature extractor. According to the 3D-CNN model, the feature extractor method forecasts the ISUP evaluation of RCC related to death rates in CT images. Clinical variables have been analytically chosen to employ the random forest (RF) and Spearman score significance score as standards. Chanchal et al. [15] examined a strong and computationally effective fully automatic RCC Grading Network (RCCGNet) method. This method comprises a shared channel residual (SCR) block that permits the model to learn feature maps related to various input forms in 2 parallel manners. The SCR block transfers the data between 2 diverse layers and functions the exchanged information individually by offering useful complements to one another.
In [16], a classification technique was developed by combining the features in a DL method with the removed texture features. Five texture features from 3 texture feature types have been implemented to match Alex-Net for the wide-ranging analysis of the combination of the texture and deep features. The texture features could be obtained from (i) statistic feature types like local binary pattern (LBP), grey-level cooccurrence matrix (GLCM), and histogram of gradient (HOG); (ii) transform based texture feature type such as Gabor filters; and (iii) system-based texture feature type: Markov random field (MRF). Mukashyaka et al. [17] projected an unsupervised and rapid SAMPLER technique. Image-level views of SAMPLER have been produced by encoding the collective distribution process of multiscale tile-level features. Besides, non-small cell lung carcinoma (NSCLC), slide-level representations of breast carcinoma (BRCA), and RCC-WSIs of The Cancer Genome Atlas (TCGA) have been implemented. Additionally, BRCA and NSCLC methods could be externally confirmed under frozen WSIs. In [18], a hybrid multi-instance learning system was developed, dependent upon the Transformer and Graph Attention Network (GAN) termed TGMIL. A feature pyramid called MMFP was primarily designed with several lower magnifications of WSI. Secondarily, TGMIL combines the abilities of TGMIL. Employing the GAN method, a simple and effective method utilizing the mean pooling and max pooling, produces the graph adjacency matrix. In conclusion, the outcomes of the two methods of TGMIL could be integrated. Kumar et al. [19] present an advanced approach for kidney disease detection using RF and gradient boosting (GB) approaches.
Uddin et al. [20] propose an ML method utilizing algorithms like support vector machine (SVM), Decision Tree (DT), RF, and k-nearest neighbour (KNN). Kadhim and Mohammed [21] develop ML models, SVM and MLP, for precise kidney cancer classification using CT images. It also utilizes GLCM and Gabor filters for feature extraction and data preprocessing to improve model reliability and prevent overfitting. Praveen et al. [22] propose an ML-based Neuro-Fuzzy model for early CKD prediction, utilizing image processing to detect fibrosis in kidney tissues. Singh and Gulati [23] aim to predict chronic diseases utilizing ML, specifically CNNs for feature extraction and K-nearest neighbour for classification. Data preprocessing, noise removal, and training/testing splits are employed to improve model accuracy. Zheng et al. [24] develop and evaluate four ML approaches, namely logistic regression (LR), RF, neural networks, and XGBoost, for predicting CKD progression, using AUC-ROC to assess model performance. Jayashree and Anitha [25] introduce a novel ML methodology incorporating quantum mutual correlation and radial basis extreme learning for chronic kidney disease (CKD) diagnosis. It extracts key features, utilizes quantum mutual information for selection, and uses radial basis extreme learning for classification, addressing data imbalance and computational challenges. Liu et al. [26] explore strategies for incorporating multi-omics data with ML to improve kidney disease prediction, understand molecular mechanisms, and apply image recognition in renal pathology. Rojas et al. [27] develop an interpretable ML techniques by utilizing XGBoost to predict glomerular filtration rate (GFR), integrating SHapley Additive exPlanation (SHAP) values for model interpretability and validation. Ghosh and Khandoker [28] develop a predictive model for CKD using ML methods, leveraging clinical data. For model interpretability, five ML techniques were utilized, with SHAP and Local Interpretable Model-agnostic Explanations (LIME).
Despite the improvements in ML for kidney disease prediction, various limitations and gaps remain. Many existing methods face difficulty with data imbalance, making it difficult to precisely predict outcomes in underrepresented groups. Furthermore, some models lack interpretability, which is significant for clinical adoption. There is also a requirement for more robust techniques to handle the high dimensionality of multi-omics and imaging data. Many approaches depend on a single input data type, whereas integrating several data sources such as clinical, demographic, and genomic information could improve model performance. Moreover, most studies focus on small datasets, limiting the generalizability and external validation of the models. Finally, there is restricted exploration of real-time prediction and deployment of these models in clinical settings.
3. The proposed method
This article develops the KCDC-SODLPI method, which analyzes pathological images to determine the presence of kidney cancer. The multifaceted process of the KCDC-SODLPI technique contains different processes, such as image preprocessing, SE-DenseNet-based feature extractor, SO-based parameter tuning, and BiLSTM-based classification. Fig 1 represents the workflow of the KCDC-SODLPI model.
3.1. GF-based image preprocessing
Initially, the KCDC-SODLPI technique begins with a GF-based image preprocessing approach to eliminate the noise content [29]. This model is chosen for its capability to effectively mitigate noise while preserving significant image details, which is crucial for accurate kidney cancer detection in HIs. Unlike other methods that blur or distort critical features, GF smooths the image by averaging pixel values in a local neighbourhood, ensuring that the fine details related to cancerous patterns are retained. This preprocessing step improves the quality of input images, resulting in more reliable feature extraction in subsequent stages. Additionally, GF-based filtering is computationally efficient, making it ideal for massive datasets typical in medical imaging. Its simplicity and efficiency in removing high-frequency noise make it a preferred choice for cancer detection. Fig 2 demonstrates the structure of the GF model.
GF is a predominant image preprocessing method that can increase the quality of digital photos by mitigating noise and improving crucial features. This system exploits a GF kernel, a bell-shaped precise function with the input image. The convolution method employs a weighted average to the pixel values, with the weights determined by the Gaussian distribution. Higher-frequency noise will be effectively smoothed out, and the overall image should be more uniform and visually attractive. The primary significant benefit of GF is its capability to preserve critical structural details while lessening adverse artefacts. The smoothing impact accomplished through convolution is mainly valuable in decreasing the effect of random differences in pixel intensities like Gaussian noise or salt-and-pepper noise. This is particularly advantageous in these conditions where the raw image data will be degraded in transmission or acquisition.
3.2. SE-densenet feature extractor
The KCDC-SODLPI technique utilizes the SE-DenseNet model to extract intricate patterns of the input images [30]. This model was chosen for its capability to capture intrinsic patterns in HIs due to its dense connectivity and attention mechanism. DenseNet ensures effective feature propagation by connecting each layer to every other layer, enhancing gradient flow and mitigating the risk of vanishing gradients, which is critical when dealing with deep networks. The addition of the SE block improves this by adaptively recalibrating channel-wise feature responses, allowing the model to concentrate on more informative features. This makes SE-DenseNet appropriate for detecting subtle details in kidney cancer images, giving superior accuracy to conventional CNNs or even simpler DenseNet models. Integrating these advanced features results in enhanced performance in terms of accuracy and efficiency, making it an ideal choice for medical image analysis. Fig 3 illustrates the steps involved in the SE-DenseNet model.
DenseNet introduces the dense connectivity concept that can enhance the performance of a network without computation and increase parameters compared to classical network structures. There may be bottleneck data during information transmission in classical network structure because the neuron can only receive information from the feature map of the previous layer. In DenseNet, all the layers receive a feature map from each prior layer; hence, the data from the historical layer will be used better. Furthermore, dense connectivity can improve network training stability and generalization ability and help prevent gradient vanishing. Classical networks like ResNet use the dotted and solid residual connection to add the convolutional structure’s input and output, which is later fed into the following layer. The mapping equation is expressed as:
The output of layer, as
, is attained by concatenating the output of each prior layer as
and passing through a nonlinear function
. The dense connection enables each layer to be mutually connected and concatenates the output of all prior layers as the input to the following layer. The mapping formula is given by:
In Eq. (2), denotes the output of
layers,
shows the output concatenation of the
layer and the result of each prior layer, and
is a nonlinear conversion function.
The DenseNet has 3 Transition Layers and 4 DenseBlocks and is divided into various versions based on the number of pooling and convolutional layers, like DenseNetl69, DenseNet201, DenseNetl21, and so on. This study applies the small DenseNetl21 with fewer parameters and layers. The Transition Layer has pooling and convolutional layers connecting 2 Dense Blocks. The pooling layer compresses the model, and the convolutional layer decreases the sets of features. The Dense Block has 4 Dense Layers. First, the input is normalized using
and passes over the activation function (ReLU), followed by the l
l convolutional layer to reduce computation cost. Later, it passes over another ReLU and BN and is given as output via
×
convolutional layers.
In the domain of CV, the Attention mechanism must be gaining popularity. The concept behind the attention mechanism is to devise a weight distribution for the original feature mapping and exploit it to the feature mapping. The SENet allocates weight to different image positions from the channel domain perspective to attain higher feature data via the weight matrix.
From Eq. (3), signifies the feature map of
channels in the input feature;
and
describe the width and height correspondingly;
indicates the feature mapping value at the coordinate location. Once the SENet is embedded with the specific layer, the feature map received is subjected first to
function, where
refers to the global average pooling used to reduce the size of the feature map into lxl whilst
remains unchanged. Next,
function is implemented, which has sigmoid functions, activation functions, and FC layers that normalize the specific value of
. Lastly, after the SENet operation, F-scale is used for performing matrix multiplication between the original feature map and
, and the size is unchanged. After the convolution operation of the Dense Layer and the modified Dense Layer, SENet model is embedded into all the layers. After embedding SENet, the network integrates attention features, focuses more on important information during feature extraction, improves the model’s recognition rate, ignores irrelevant data such as background, and removes interference.
3.3. Hyperparameter tuning using SO model
The SO method performs the hyperparameter tuning of the SE-DenseNet model [31]. This model is chosen due to its effectual search mechanism and capability to navigate complex, high-dimensional spaces. SO replicates the movement of a snake, adjusting its position iteratively, which is highly efficacious for fine-tuning parameters in DL methods like SE-DenseNet. Unlike conventional optimization techniques such as grid or random search, SO dynamically adapts to the landscape of the problem, averting local minima and improving convergence toward optimal solutions. The exploration and exploitation balance of the model additionally ensures robust performance, enhancing accuracy and generalization. Additionally, the capability of the SO method to fine-tune hyperparameters with minimal computational overhead makes it an ideal choice, ensuring faster and more accurate results compared to other optimization methods. This results in an efficient and effective model optimization process. Fig 4 illustrates the SO structure.
SO is a recent metaheuristic approach stimulated by the particular mating behaviour of snakes. Temperature and food availability are substantial factors in snake behaviour, affecting mating and survival. Adequate food promotes reproductive health, and when food is abundant, snakes consider mating; however, if food is scarce, they focus on searching for it. Temperature also affects mating, with cooler environments encouraging female snakes to mate. This model has two phases: exploration and exploitation. During exploration, snakes search for food in response to environmental factors such as heat or food scarcity, while exploitation focuses on localized searches for better solutions. Mating occurs when there is sufficient food and cool temperatures, with males competing for females in combat mode, and females selecting the best mates. Successful mating leads to egg-laying and the hatching of new snakes.
The SO model initializes a population randomly within defined boundaries, segmenting it into two groups: males and females. Each individual’s position is updated based on fitness and a random number generator. The algorithm adapts the search strategy depending on temperature and food availability.
The temperature decreases with the iteration count, influencing the exploration (when food is scarce,
) and exploitation (when food is abundant,
) strategies. When food is scarce, the positions of individuals are updated utilizing a randomized exploration equation, where males and females update their positions based on foraging strength.
For males:
In Eq. (4), the location of the male individual is
, the area of randomized male individuals is
, and
is a random number within [0,1]. In Eq. (5),
denotes the foraging strength of male individuals, the fitness of
is
denotes the fitness of
individuals in the population, and a constant
is set as 0.05. When the environment is hot (Temp>0.6), individuals move towards the food, with position updates based on food location.
For females:
In Eq. (6), the location of female is
, the random area of the female is
,
indicates the random integer within [0,1]. In Eq. (7), the foraging power of female individuals is
,
denotes the fitness of
female individuals in the population,
refers to the fitness of
and a constant
is set to 0.05. Snakes are cryogenically reproduced creatures. The temperature ranges between [
]. If
, then the food is sufficient, and the temperature condition is considered.
Position update towards food:
In Eq. (8), the individual position is represented as ,
is the fittest individual location, and a constant
is set as 2. In cooler conditions (Temp<0.6), the algorithm switches to a combat or mating pattern. For combat:
Combat position updates for males:
Combat position updates for females:
In Eq. (9), the location of male is denoted as
,
, indicating the location of the fittest individuals among females, and the fighting strength of males is expressed as
. In Eq. (10), the area of
female is
,
signifies the location of fittest individuals amongst males, and
means the fighting strength of females. FM and FF are calculated using the fitness values of the best individuals in the male and female populations:
FM and FF:
In Eqs. (11–12), the fitness values of individuals in the female and male groups are denoted as and
, and m, and the individual fitness values are indicated as
. Finally, mating behaviour is incorporated, and individuals update positions based on mating capabilities
and
:
Mating position updates:
In Eqs. (13–14), , and
are the positions of
males and females in the population, and
are the mating capabilities of males and females, respectively.
In summary, the SO algorithm optimizes individuals’ positions based on environmental conditions, foraging strength, and fitness, balancing exploration and exploitation to find the optimal solution.
Fitness selection is a significant parameter that manipulates the effectiveness of the SO method. The hyperparameter selection method comprises the encoding system for evaluating the efficiency of the candidate outcomes. During this study, the SO method deliberates accuracy as the critical measure to calculate the fitness function (FF) represented below.
In mathematical expression, FP means the false positive, and TP signifies the true positive values.
3.4. BiLSTM-based classification process
In conclusion, the BiLSTM model detects and classifies kidney cancer [32]. This model is chosen because it can capture both past and future dependencies in sequential data. Unlike conventional LSTM models that only capture data from the past, BiLSTM processes data in both forward and backward directions, improving its understanding of temporal patterns and accuracy. This bidirectional approach benefits complex tasks like kidney cancer detection, where the relationship between diverse image features can be temporal and spatial. Moreover, the capacity of the BiLSTM model to effectively handle long-range dependencies assists in recognizing subtle discrepancies in HIs, resulting in more accurate classification. Compared to simpler models, BiLSTM is better suited for tasks requiring context-aware decision-making, thus giving superior performance in this application. Fig 5 illustrates the substructure of BiLSTM.
The Bi-LSTM block is the second module in the technique, comprising of a BiLSTM network, dense, and dropout layers. LSTM, a type of RNN, addresses the long-term dependency issue found in classical RNNs by using memory cells to store relevant data. Each LSTM unit includes three gates and a memory cell, which assists in controlling data flow within the hidden layer (HL). The gates determine what data should be forgotten or retained, allowing the model to efficiently process time-series data and capture long-term dependencies. Acquiring long‐term dependencies by LSTM and resolving the before-stated problem is highly crucial. The input, output, and forget are three types of gates. Then, their characteristics are briefly defined. The sigmoid function is used for data in the current input and the preceding HL
in the forget gate. This task is signified by
, which yields a value between
and
, specifying the ratio of data that must be retained.
The input gate takes the data in the existing input and preceding HL and then permits them over 2nd sigmoid function by altering this data into a value between zero and one. Also, similar data permits a function of that aids in controlling the system, returning a value between −1 and 1. Next, the sigmoid output
increased by the
output
to choose which data is significant to save. Now, sufficient data is utilized to calculate the cell state. At initial, the preceding cell state
has been multiplied by the output of the forget gate. Then, the input gate output was included, upgrading the cell state with novel values that were measured related by the system. The outcome of these dual processes is the novel cell state
.
Lastly, the output gate selects the value of subsequent HL. At initial, the data from the existing input and preceding HL permits over a 3rd sigmoid function. Next, the novel cell state licenses over a function. Both outputs have been multiplied point-wise. The yield of the output gate
is the novel HL
. So, the new cell state and HL will be taken to the next time step. Officially, the processes that occur in an LSTM are given below:
Here, means the input data in time
;
and
signify a sigmoid and hyperbolic tangent function, respectively;
embodies the HL in time
,
, and
signifies a weight and bias, respectively. After clarifying the performance of an LSTM model, the BiLSTM network functions must be defined. It can generally be the development of LSTM dependent upon bidirectional RNN. A BiLSTM model has a forward and backward LSTM, which uses input values ranging from
to
and
to
, respectively. The backward
and forward
network outputs have been calculated using the device defined beforehand for one LSTM. The BiLSTM layer yields a vector
, which is attained by using the mentioned expression:
Where a sigmoid function unites the single LSTM network output, it is noticeable that BiLSTM uses a dropout method to control overfitting. The rate of dropout has a hyperparameter to study in the description of this method. Lastly, a fully connected (FC) layer that predicts the outputs is loaded.
4. Performance validation
The validation evaluation of the KCDC-SODLPI method is examined by employing a biomedical image dataset [15,33]. The dataset encompasses 3000 instances with five classes, as defined in Table 1. Grade-0 (Cells are well organized and are expected in number), Grade-1 (Nucleoli are not noticeable even 400 × magnification Nucleoli are realized as eosinophilic at 400 × magnification and not more prominent at 100 × magnification), Grade-2 (moderately unequal contour compared to normal nuclei), Grade-3(Clearly visible cancers represents graded as grade-3), Grade-4(Rhabdoid or sarcomatoid difference). Fig 6 illustrates the instances images.
Fig 7 displays the classifier outcomes of the KCDC-SODLPI method at 80%TRAS and 20%TESS. Fig 7a-7b showcases the confusion matrices made by the KCDC-SODLPI method. This figure indicates that the KCDC-SODLPI method is exactly recognized and categorized into five classes. Moreover, Fig 7c showcases the PR result of the KCDC-SODLPI technique. This figure shows that the KCDC-SODLPI technique gains higher PR effectiveness with five classes. In conclusion, Fig 7d exemplifies the ROC result of the KCDC-SODLPI technique. This figure showed that the KCDC-SODLPI technique provides effective outcomes with increased ROC values in 5 classes.
In Table 2, the overall kidney cancer detection results of the KCDC-SODLPI model are under 80%TRAS and 20%TESS. Fig 8 represents the average outcomes of the KCDC-SODLPI technique with 80% TRAS. The figure illustrates that the KCDC-SODLPI technique gains productive performance when recognizing five classes. It is also noticed that the KCDC-SODLPI technique accomplishes an average of 85.57%,
of 65.53%,
of 64.02%,
of 62.51%, and MCC of 55.25%, respectively.
Fig 9 examines the average outcomes of the KCDC-SODLPI method with 20% TESS. The figure demonstrates that the KCDC-SODLPI method performs proficiently in recognizing five classes. The KCDC-SODLPI technique attains an average of 85.40%,
of 63.71%,
of 62.92%,
of 61.76%, and MCC of 53.85%, correspondingly.
The efficiency of the KCDC-SODLPI model at 80%TRAS and 20%TESS is graphically demonstrated in Fig 10 in the form of training accuracy (TRAA) and validation accuracy (VALA) curves. This figure displays a valuable analysis of the behaviour of the KCDC-SODLPI technique over varying epoch counts, representing its learning method and generalization capabilities. The figure assumes a constant improvement in the TRAA and VALA, with progress in epochs. It confirms the adaptive aspect of the KCDC-SODLPI technique in the pattern recognition process under TRA and TES data. The increased trends in VALA outline the capability of the KCDC-SODLPI technique to adapt to the TRA data and provide precise classification of unnoticed data, pointing out robust generalization abilities.
Fig 11 illustrates an extensive view of the training loss (TRLA) and validation loss (VALL) results of the KCDC-SODLPI technique on 80%TRAS and 20%TESS over distinct epochs. The progressive minimization in TRLA highlights the KCDC-SODLPI method, enhancing the weights and diminishing the classification error on the TRA and TES data. The figure specifies a perfect understanding of the KCDC-SODLPI method related to the TRA data, highlighting its proficiency in capturing patterns. The KCDC-SODLPI method significantly improves its parameters in decreasing the differences between the prediction and real TRA classes.
Fig 12 examines the classifier analysis of the KCDC-SODLPI method with 70%TRAS and 30%TESS. Fig 12a-12b represents the confusion matrices offered by the KCDC-SODLPI method. The figure signified that the KCDC-SODLPI method can correctly identify and categorize with five classes. Meanwhile, Fig 12c demonstrates the PR result of the KCDC-SODLPI method. This figure specifies that the KCDC-SODLPI method has gained greater PR effectiveness in five classes. Also, Fig 12d showcases the ROC result of the KCDC-SODLPI method. This figure shows that the KCDC-SODLPI technique offers efficient outcomes with increased ROC values on five classes.
A detailed overall kidney cancer detection outcomes of the KCDC-SODLPI method at 70%TRAS and 30%TESS are described in Table 3. Fig 13 displays the average outcomes of the KCDC-SODLPI method with 70% TRAS. The figure exhibits that the KCDC-SODLPI technique provides effectual performance under the recognition of five classes. This is also remarked that the KCDC-SODLPI technique realizes an average of 88.90%,
of 72.06%,
of 72.12%,
of 72.01%, and MCC of 65.14%, respectively.
Fig 14 shows the average results of the KCDC-SODLPI method with 30% TESS. The figure explains that the KCDC-SODLPI method performs excellently in recognizing five classes. This can also be perceived that the KCDC-SODLPI approach achieves an average of 87.64%,
of 68.93%,
of 69.50%,
of 68.93%, and MCC of 61.45%.
The efficiency of the KCDC-SODLPI method at 70%TRAS and 30%TESS is graphically reported in Fig 15 in the form of TRAA and VALA curves. The figure displays a functional interpretation of the behaviour of the KCDC-SODLPI technique over varying epoch counts, representing its learning process and generalization abilities. Notably, the figure steadily heightens the TRAA and VALA with progress in epochs. It ensures the adaptive aspect of the KCDC-SODLPI method in the pattern recognition process at the TRA and TES data. The higher trends in VALA outline the capability of the KCDC-SODLPI method to adapt to the TRA data and also surpass in offering precise classification on unnoticed data, exhibiting robust generalization abilities.
Fig 16 represents the TRLA and VALL results of the KCDC-SODLPI technique under 70%TRAS and 30%TESS over distinct epochs. The progressive reduction in TRLA highlights the KCDC-SODLPI technique, which improves weights and reduces classification errors with TRA and TES data. The figure specifies a better understanding of the KCDC-SODLPI method relevant to the TRA data, highlighting its proficiency in capturing patterns. The KCDC-SODLPI method continually improves its parameters, lessening the differences between the prediction and real TRA classes.
Table 4 and Fig 17 [15] provide a wide-ranging comparative result of the KCDC-SODLPI method with recent techniques. The results highlighted that the ShuffleNet and LiverNet models have shown the lowest performance over other models. The ResNet50, IncResV2, NASNet, BHCNet, and BreastNet models have shown considerable performance. However, the KCDC-SODLPI technique gains maximum performance over others with increased of 72.06%,
of 72.12%,
of 72.01%, and
of 88.90%.
Therefore, the KCDC-SODLPI technique is used to automate the detection and classification of kidney cancer on medical images.
The performance of several models is evaluated in terms of computational time (CT), measured in seconds in Table 5 and Fig 18, with the results indicating the efficiency of each model. ResNet50, IncResV2, NASNet, and other models, such as BHCNet, BreastNet, and LiverNet, exhibit varying CTs, ranging from 6.93 to 9.52 seconds. Notably, the KCDC-SODLPI model outperforms all the other models with the lowest CT of 5.93 seconds, emphasizing its superior efficiency. This faster CT suggests that the KCDC-SODLPI model is effective in detecting and classifying kidney cancer and optimized for quicker processing, making it appropriate for real-time applications in clinical settings.
5. Conclusion
In this article, the KCDC-SODLPI method is developed. The primary aim of the KCDC-SODLPI technique is to analyze the pathological images to determine the presence of kidney cancer. The KCDC-SODLPI technique employs a GF-based image preprocessing approach to eliminate the noise content in the multifaceted process. Besides, the KCDC-SODLPI method utilizes the SE-DenseNet model to extract intricate patterns from the input images. Moreover, the SO model is used to tune the hyperparameter of the SE-DenseNet method. Finally, the BiLSTM model is used to detect and classify kidney cancer. The performance of the KCDC-SODLPI technique is evaluated under the biomedical image dataset. The experimental validation of the KCDC-SODLPI method portrayed a superior accuracy value of 88.90% over existing models. The limitations of the KCDC-SODLPI method comprise various key challenges that affect its broader application. First, the dataset used for model training may not be diverse enough to represent the full spectrum of CKD patients, limiting the generalizability of the results. Furthermore, despite the high accuracy, the model’s performance may decrease when applied to real-world clinical settings due to variability in data quality and patient demographics. Moreover, the models could face difficulty handling missing or incomplete data, which is common in healthcare datasets. The lack of interpretability in some ML models also hinders clinical adoption. Future work should expand datasets to comprise a broader range of populations, enhance model robustness in real-world conditions, and incorporate explainable AI techniques to improve trust and usability in clinical decision-making. Furthermore, exploring real-time prediction systems and integrating longitudinal data could improve early CKD detection and management.
References
- 1. Lal S, Chanchal AK, Kini J, Upadhyay GK. FPGA implementation of deep learning architecture for kidney cancer detection from histopathological images. Multimedia Tools and Applications. 2024;1–19.
- 2. Bolocan V-O, Secareanu M, Sava E, Medar C, Manolescu LSC, Cătălin Rașcu A-Ștefan, et al. Convolutional Neural Network Model for Segmentation and Classification of Clear Cell Renal Cell Carcinoma Based on Multiphase CT Images. J Imaging. 2023;9(12):280. pmid:38132698
- 3. Deng W, He X, Wang R, Ding B, Dai S, Wei C, et al. Application of new optical imaging equipment based on deep learning in kidney tumor image segmentation and recognition. Optical and Quantum Electronics. 2024;56(4):522.
- 4. Lin Z, Yang W, Zhang W, Jiang C, Chu J, Yang J, et al. Recognizing pathology of renal tumor from macroscopic cross-section image by deep learning. Biomed Eng Online. 2023;22(1):3. pmid:36670469
- 5.
Anari PY, Obiezu F, Lay N, Firouzabadi FD, Chaurasia A, Golagha M, et al. Using YOLO v7 to Detect Kidney in Magnetic Resonance Imaging: A Supervised Contrastive Learning. arXiv preprint. 2024. https://doi.org/arXiv:2402.05817
- 6. Bhattacharjee A, Rabea S, Bhattacharjee A, Elkaeed EB, Murugan R, Selim HMRM, et al. A multi-class deep learning model for early lung cancer and chronic kidney disease detection using computed tomography images. Front Oncol. 2023;13:1193746. pmid:37333825
- 7. Abdeltawab HA, Khalifa FA, Ghazal MA, Cheng L, El-Baz AS, Gondim DD. A deep learning framework for automated classification of histopathological kidney whole-slide images. J Pathol Inform. 2022;13:100093. pmid:36268061
- 8. Yang K, Chang S, Wang Y, Yang J, Liu S, Liu K, et al. Using MsfNet to predict the ISUP grade of renal clear cell carcinoma in digital pathology images. Computers, Materials & Continua. 2024;78(1).
- 9. Mustafa MY, Algamal ZY. Neutrosophic inverse power Lindley distribution: A modeling and application for bladder cancer patients. Full Length Article. 2023;21(2):216–216.
- 10. Javaheri T, Heidari S, Yang X, Yerra S, Seidi K, Gharib MH, et al. XKidneyOnco: An Explainable Framework to Classify Renal Oncocytoma and Chromophobe Renal Cell Carcinoma with a Small Sample Size. bioRxiv. 2024;2024–01.
- 11. Mahootiha M, Qadir HA, Aghayan D, Fretland ÅA, von Gohren Edwin B, Balasingham I. Deep learning-assisted survival prognosis in renal cancer: A CT scan-based personalized approach. Heliyon. 2024;10(2).
- 12. Abu Haeyeh Y, Ghazal M, El-Baz A, Talaat IM. Development and Evaluation of a Novel Deep-Learning-Based Framework for the Classification of Renal Histopathology Images. Bioengineering (Basel). 2022;9(9):423. pmid:36134972
- 13. Hao F, Liu X, Li M, Han W. Accurate kidney pathological image classification method based on deep learning and multimodal fusion method with application to membranous nephropathy. Life. 2023;13(2):399.
- 14. Mahootiha M, Qadir HA, Bergsland J, Balasingham I. Multimodal deep learning for personalized renal cell carcinoma prognosis: Integrating CT imaging and clinical data. Comput Methods Programs Biomed. 2024;244:107978. pmid:38113804
- 15. Chanchal AK, Lal S, Kumar R, Kwak JT, Kini J. A novel dataset and efficient deep learning framework for automated grading of renal cell carcinoma from kidney histopathology images. Sci Rep. 2023;13(1):5728. pmid:37029115
- 16. Cai J, Liu M, Zhang Q, Shao Z, Zhou J, Guo Y, et al. Renal Cancer Detection: Fusing Deep and Texture Features from Histopathology Images. BioMed Research International. 2022;2022.
- 17. Mukashyaka P, Sheridan TB, Foroughi Pour A, Chuang JH. SAMPLER: unsupervised representations for rapid analysis of whole slide tissue images. EBioMedicine. 2024;99:104908. pmid:38101298
- 18. Sun X, Li W, Fu B, Peng Y, He J, Wang L, et al. TGMIL: A hybrid multi-instance learning model based on the Transformer and the Graph Attention Network for whole-slide images classification of renal cell carcinoma. Comput Methods Programs Biomed. 2023;242:107789. pmid:37722310
- 19.
Kumar M, Khare N, Mani S, Bhakta M, Saha G. A Comprehensive Approach for Enhancing Kidney Disease Detection Using Random Forest and Gradient Boosting. Genomics at the Nexus of AI, Computer Vision, and Machine Learning. 2025. 395–416.
- 20. Uddin KMM, Nahid MdNH, Ullah MdMH, Mazumder B, Khan MdSI, Dey SK. Machine Learning-Based Chronic Kidney Cancer Prediction Application: A Predictive Analytics Approach. Biomedical Materials & Devices. 2024;2(2):1028–48.
- 21. Kadhim DA, Mohammed MA. Advanced Machine Learning Models for Accurate Kidney Cancer Classification Using CT Images. Mesopotamian Journal of Big Data. 2025;2025:1–25.
- 22. Praveen SP, Jyothi VE, Anuradha C, Venu Gopal K, Shariff V, Sindhura S. Chronic kidney disease prediction using ML-based neuro-fuzzy model. International Journal of Image and Graphics. 2024;24(06):2340013.
- 23.
Singh Y, Gulati N. Machine Learning Techniques for Accurate Prediction and Detection of Chronic Diseases. Machine Learning in Multimedia. CRC Press; 2025. 1–21.
- 24. Zheng J-X, Li X, Zhu J, Guan S-Y, Zhang S-X, Wang W-M. Interpretable machine learning for predicting chronic kidney disease progression risk. Digit Health. 2024;10:20552076231224225. pmid:38235416
- 25. Jayashree M, Anitha N. Conceptual metaphor quantum correlation and radial basis extreme learning for predicting chronic kidney disease. Computers and Electrical Engineering. 2025;122:109933.
- 26. Liu X, Shi J, Jiao Y, An J, Tian J, Yang Y, et al. Integrated multi-omics with machine learning to uncover the intricacies of kidney disease. Brief Bioinform. 2024;25(5):bbae364. pmid:39082652
- 27. Rojas LH, Pereira-Morales AJ, Amador W, Montenegro A, Buelvas W, de la Espriella V. Development and validation of interpretable machine learning models to predict glomerular filtration rate in chronic kidney disease Colombian patients. Ann Clin Biochem. 2025;62(1):57–66. pmid:39242084
- 28. Ghosh SK, Khandoker AH. Investigation on explainable machine learning models to predict chronic kidney diseases. Sci Rep. 2024;14(1):3687. pmid:38355876
- 29. Suryanarayana G, Chandran K, Khalaf OI, Alotaibi Y, Alsufyani A, Alghamdi SA. Accurate Magnetic Resonance Image Super-Resolution Using Deep Networks and Gaussian Filtering in the Stationary Wavelet Domain. IEEE Access. 2021;9:71406–17.
- 30. Hu B, Tang F, Tan S. Minority Clothing Recognition based on Improved DenseNet. Journal of Computing and Electronic Information Management. 2023;10(2):52–6.
- 31. Zhang SH, Wang JS, Zhang SW, Li YX, Xing YX, Zhang YH. Snake optimizer with oscillating factors to solve edge computing task unloading and scheduling optimization problems. Alexandria Engineering Journal. 2024;91:273–304.
- 32. Méndez M, Merayo MG, Núñez M. Long-term traffic flow forecasting using a hybrid CNN-BiLSTM model. Engineering Applications of Artificial Intelligence. 2023;121:106041.
- 33. https://github.com/shyamfec/RCCGNet