Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Short noncoding RNAs as predictive biomarkers for the development from inflammatory bowel disease unclassified to Crohn’s disease or ulcerative colitis

  • Jaslin P. James ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing – original draft

    jaslin.pallikkunnath.james@regionh.dk

    Affiliation Department of Pathology, Herlev University Hospital, Herlev, Denmark

  • Lene Buhl Riis,

    Roles Conceptualization, Funding acquisition, Project administration, Supervision, Writing – review & editing

    Affiliations Department of Pathology, Herlev University Hospital, Herlev, Denmark, Institute of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark

  • Rolf Søkilde,

    Roles Investigation, Methodology, Software, Validation, Writing – review & editing

    Affiliation Bioneer A/S, Hørsholm, Kogle Allé 2, Hørsholm, Denmark

  • Mikkel Malham,

    Roles Conceptualization, Resources, Supervision, Writing – review & editing

    Affiliations The Pediatric Department, Copenhagen University Hospital—Amager and Hvidovre, Hvidovre, Denmark, Copenhagen Center for Inflammatory Bowel Disease in Children, Adolescents and Adults, Hvidovre Hospital, University of Copenhagen, Hvidovre, Denmark

  • Estrid Høgdall,

    Roles Conceptualization, Project administration, Supervision, Writing – review & editing

    Affiliations Department of Pathology, Herlev University Hospital, Herlev, Denmark, Institute of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark

  • Ebbe Langholz,

    Roles Conceptualization, Resources, Supervision, Writing – review & editing

    Affiliations Institute of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark, Gastroenheden, Herlev University Hospital, Herlev, Denmark

  • Boye Schnack Nielsen

    Roles Conceptualization, Methodology, Resources, Supervision, Writing – review & editing

    Affiliation Bioneer A/S, Hørsholm, Kogle Allé 2, Hørsholm, Denmark

Abstract

Numerous pathogenic processes are mediated by short noncoding RNAs (sncRNA). Twenty percent of inflammatory bowel disease (IBD) patients are labelled as IBD unclassified (IBDU) at disease onset. Most IBDU patients are reclassified as Crohn’s disease (CD) or ulcerative colitis (UC) within few years. Since the therapeutic methods for CD and UC differ, biomarkers that can forecast the categorization of IBDU into CD or UC are highly desired. Here, we investigated whether sncRNAs can predict CD or UC among IBDU patients. 35 IBDU patients who were initially diagnosed with IBDU were included in this retrospective investigation; of them, 12, 15, and 8 were reclassified into CD (IBDU-CD), UC (IBDU-UC), or remained as IBDU (IBDU-IBDU), respectively. Eight IBD patients, were included as references. SncRNA profiling on RNA from mucosal biopsies were performed using Affymetrix miRNA 4.0 array. Selected probe sets were validated using RT-qPCR. Among all patients and only adults, 306 and 499 probe sets respectively were differentially expressed between IBDU-CD and IBDU-UC. Six of the probe sets were evaluated by RT-qPCR, of which miR-182-5p, miR-451a and ENSG00000239080 (snoU13) together with age and sex resulted in an AUC of 78.6% (95% CI: 60–97) in discriminating IBDU-CD from IBDU-UC. Based on the three sncRNAs profile it is possible to predict if IBDU patients within 3 years will be reclassified as CD or UC. We showed that the expression profile of IBDU patients differ from that of definite CD or UC, suggesting that a subgroup of IBDU patients may compose a third unique IBD subtype.

Introduction

Inflammatory bowel disease unclassified (IBDU) is a subtype of inflammatory bowel disease (IBD) alongside ulcerative colitis (UC) and Crohn’s disease (CD). IBDU is characterised by clinical and endoscopic symptoms of chronic colitis without clear distinct characteristics of UC or CD, but rather modest features of both [1]. Differential diagnosis is crucial for appropriate clinical management of IBD patients since each subtype of IBD entails unique therapeutic strategies [2,3]. Based on a number of epidemiological studies, Tontini estimated that 10% of IBD patients in Europe and North America are diagnosed with IBDU [4]. Among paediatric IBDU patients, IBDU is twice as prevalent as adult-onset IBDU, with younger ages having the highest prevalence [5]. Up to 80% of paediatric IBDU patients will later be diagnosed with either CD or UC [5,6]. This suggests that some IBDU patients may represent early manifestations of CD or UC [7]. Interestingly, another study has shown up to 69% of paediatric IBDU patients remain diagnosed as IBDU into their adulthood [8]. The dynamics and the eluding character of IBDU suggest that a specific diagnosis at disease onset is a moving target, thus preventing identification of strong molecular profiles for differential diagnosis of IBDU patients.

We and others have previously reported on genetic and epigenetic biomarkers for differentiating CD and UC samples based on mucosal tissue biopsies [4,9,10]. In line with findings regarding gene expression in inflamed mucosa, IBD genetics have emphasized a shared T helper (Th) 17/Interleukin (IL) 23 pathway for CD and UC, and showed relatively few disease-specific mechanisms, such as barrier integrity relating to UC and autophagy relating to CD [1114]. This level of genetic risk sharing suggests that nearly all the susceptibility loci associated with IBD may also have an impact on both IBD subtypes, which limits the use of genetic panels to resolve discrimination of CD and UC. The majority of the single nucleotide polymorphisms (SNPs) associated with IBD are found in noncoding areas of the genome, suggesting an indirect function in the control of gene expression [15]. The early epigenetic studies in IBD identified differential expression of small noncoding RNAs such as microRNAs (miRNAs) in UC colonic mucosa compared to normal mucosa [16]. Combinations of epigenetic alterations may result in activation of genes that promote chronic inflammation or of blocking anti-inflammatory genes [17]. Certain mRNA levels have been demonstrated to be closely controlled by miRNAs resulting in significant changes in protein synthesis [18], for example Let-7e and Let-7f mediated dysregulation of IL-23R signalling linked to an SNP in the IL-23R gene that is associated with IBD susceptibility [19]. E.g. long noncoding RNAs, miRNAs, and circular RNAs have been studied both as biomarkers and as therapeutic targets in IBD [20,21]. MiRNAs and snoRNAs constitute two main classes of short noncoding RNAs (sncRNAs). In contrast to miRNAs that contain 18–22 nucleotides, snoRNAs have 60 to 300 nucleotides. Many miRNAs mediate post-transcriptional gene silencing by controlling translation [22,23], whereas the majority of snoRNAs function as guide RNAs at the post-transcriptional level by modification of nucleotides in various RNAs, including mRNAs [24]. We recently reviewed microRNAs as prognostic, diagnostic, and predictive biomarkers in IBD [9] and found that several studies have explored the potential of blood, tissue or saliva-derived miRNAs as biomarkers for differentiating CD from UC. SnoRNAs have been found dysregulated in inflammation [25,26], and in one study, snoRNAs were found to be involved in the progression of UC [27].

In this study we assembled endoscopy biopsies from adults and paediatric patients diagnosed with IBDU. 36% of the IBDU cases were reclassified as CD (IBDU-CD), 42% of the IBDU cases as UC (IBDU-UC) and 22% remained as IBDU (IBDU-IBDU). We explored the predictive value of sncRNAs, in the preliminary diagnostic workup in IBDU patients, and report a three-gene-target RT-qPCR that can help in the prediction of classification of IBDU patients.

Materials and methods

This study was a retrospective study examining differential expression of sncRNAs in mucosal tissue biopsies collected at time of IBD diagnosis as IBDU.

Ethical considerations

The study has been approved by the National Ethical Committee (j. nr. H-20032221) and the Danish Data Protection Agency (P-2020-737). The study was conducted according to the declaration of Helsinki, in accordance with the ethical standards and according to national and international guidelines.

Samples

Archived formalin-fixed paraffin-embedded (FFPE) mucosal tissue biopsies from 35 IBDU patients and 8 IBD patients were used in the discovery study. Multiple biopsies from the affected parts of the colon or rectum were collected by colonoscopy performed as part of the diagnostic work-up and stored in clinical biobanks at hospitals in the capital region of Denmark. One biopsy per patient was chosen based on the availability in the biobank and uniform location as possible from the affected tissue. Based on the patient charts, the samples from IBDU patients were reclassified into CDs or UCs or remained as IBDU after minimum 3 years of follow up, i.e. 12 IBDU patients were reclassified as CD (IBDU-CD) and 15 IBDU patients as UC (IBDU-UC). Eight IBDU patients remained as IBDU until 2022. All IBDU samples had thorough clinical evaluation including gastroduodenoscopy, colonoscopy, and small bowel MRI at the time of diagnosis in order to significantly reduce the risk of misclassification. Diagnosis of IBDU was based on Porto criteria [28] (children) or Copenhagen criteria (adults [29,30]). The FFPE blocks used in this study were collected from 1998 to 2018, and the study was conducted in 2022 to 2023. Only LBR, MM and EL had access to the patient files during the study. All patients included in the study were followed at Department of Gastroenterology, Copenhagen University Hospital—Herlev and Gentofte, Denmark or the Department of Paediatric and Adolescence Medicine, Copenhagen University Hospital—Hvidovre, Denmark. According to the amount of total RNA remaining after the discovery study, 32 IBDU samples (11 IBDU-CDs, 14 IBDU-UCs and 7 IBDU-IBDUs) were available for RT-qPCR validation.

RNA extraction

Total RNA was extracted from two 10μm thick sections of FFPE tissue samples using MagMAX™ FFPE DNA/RNA Ultra Kit from Applied Biosystems™ for the microarray discovery study and MiRNeasy FFPE Kit from Qiagen for the RT-qPCR validation study. Both RNA extraction protocols were performed according to the manufacturer’s instructions. RNA quantity and purity were measured using the NanoDrop spectrophotometer (ThermoFisher Scientific) and Qubit RNA HS Assay kit (Life Technologies). Total RNA samples were stored at −80°C until further use.

MiRNA microarray

Expression levels of miRNAs in the discovery study samples were assessed using GeneChip® microarray and Flashtag™ Bundle (Affymetrix, CA, USA). Labeling for miRNAs with FlashTag Biotin HSR was performed with samples containing 130ng of total RNA in 8μl of nuclease free water according to the manufacturer’s instructions. Samples were then hybridized to GeneChipTM miRNA 4.0 microarrays. Afterwards the miRNA microarray chips were washed and stained using the Genechip Fluidics Station 450. Subsequently miRNA microarray chips were scanned using an Affymetrix GeneChip 7G scanner.

RT-qPCR

Primer design.

Sequences for RT-qPCR primers for miR-182-5p, miR-451a, miR-628-5p, miR-1298-3p, miR-4793-3p and snoU13 were obtained from miRbase (https://www.mirbase.org/) for miRNAs and Ensembl (http://www.ensembl.org/) for the snoRNA. Forward and reverse primers were designed using miRprimer software [31]. Primer3 version 4.1.0 (https://primer3.ut.ee/) was used for the design of primers for snoRNAs and RN7SL, with default settings and a Product Size Range of 40-200bp. The primers used in the custom panel are listed in S1 File. Primers were purchased from IDT as standard small-scale DNA oligos with desalting (IDTDNA).

cDNA synthesis and qPCR reaction.

cDNA synthesis was performed as described in Balcells et al. [32], with the following reagents: 10ng of total RNA, Poly-A polymerase (NEB: M0276L), MulV RT enzyme (NEB M0253L) and dNTPs (NEB N0447L). The cDNA was diluted 10-fold for the RT-qPCR. The qPCR reaction was performed in RealQ Plus 2x Master Mix Green (without Rox, Ampliqon, A323406), with cycling conditions: 1x 95°C for 15 minutes, 40 cycles of (95°C for 30sec and 60°C for 30sec) using a Light Cycler 480 system (Roche).

Statistical analysis

Microarray data analysis.

Original datafiles were retrieved from the command console as.CEL files and further data analysis was performed in RStudio with R version 4.2.2. MiRNA expression among different samples were preprocessed using robust multichip average (RMA) algorithm, including background subtraction, quantile normalization and summarization [33,34]. Lowly expressed probe sets were removed, and batch correction was performed using ComBat batch correction [35]. Probe sets were filtered for species homo sapiens. For annotation, pd.mirna.4.0 package [36] was used. Subsequently linear modeling was performed to find the differentially expressed miRNAs between IBDU-CDs and IBDU-UCs using limma package [37]. Probe sets were sorted according to the Benjamini-Hochberg adjusted-p-values (adjusted for multiple testing) and fold changes in expression values. The AUC (area under the receiver operating characteristic curve (ROC)) was calculated as a measure of discrimination, and the adjusted p values less than 0.05 were considered significant. Multivariate analysis including age, sex, disease duration, and sample location were performed. Significant probe sets were filtered for MirGeneDB 2.1 database [38,39] for further RT-qPCR based validation, except miR-4793-3p and ENSG00000239080 (snoU13). Those two probe sets were included for further RT-PCR validation as miR-4793-3p was already a common potential candidate from the discovery study in both analysis 1 and 2, and snoU13 had AUC values > 80% for both transcript forms. Microarray expression data is uploaded into the gene expression omnibus with series name GSE228622.

MiRNA differential expression from microarray data were analyzed as described above in two different set of samples, such as analysis 1 with all IBDU samples (12 IBDU-CDs, 15 IBDU-UCs, and 8 IBDU-IBDUs) and analysis 2 with only adult IBDU (9 IBDU-CDs, 8 IBDU-UCs, and 5 IBDU-IBDUs) samples. Analysis with only paediatric samples was not performed, due to the limited number of samples in this group.

RT-qPCR data analysis.

Melting curve analysis and late Cq values were used to filter for assays with missing detection. This was the case for miR-628-5p, miR-1298-3p and miR-4793-3p, which were below detection limit and hence they were excluded from further analysis. RT-qPCR data was analyzed using the 2–(ΔCt) method [40]. Primer pairs for miR-27a-3p, miR-16-5p, miR-191-5p, SNORD48 and RN7SL were used as reference assays according to previous literatures and manufacture’s recommendation [4143]. The geometric mean of all the reference probe sets were used to yield relative fold expression levels of the 7 different probe sets [44]. All statistical analysis for RT-qPCR data were performed using R. AUC values for discriminating CD from UC, using the selected sncRNAs expression values from RT-qPCR were calculated for all samples. Different generalized linear models (GLM) were created using GLM function in R and different combinations of factors such as age at the time of diagnosis, sex, and relative expression values of miR-182-5p, miR-451a, and/or snoU13. To identify which of the miRNAs could discriminate IBDU-CD from IBDU-UC, we trained the GLM models in the RT-qPCR based expression data. The AUC method was used to summarize the prediction’s sensitivity and specificity. Genes which have AUC values close to 1 (100%) are more effective at discriminating between CD and UC patients.

Results

A total of 43 archived mucosal biopsies from IBD patients are included and the patient characteristics are summarized in Table 1. Note that 35 patients (13 paediatric patients and 22 adult patients) were diagnosed as IBDU and later reclassified into either CD (n = 12), UC(n = 15), or remained as IBDU (n = 8). A graphical representation of the sample collection and workflow of the study is shown in Fig 1.

thumbnail
Fig 1. Traditional IBD diagnosis and classification.

Patient cohort selection and workflow of the study are highlighted in yellow boxes.

https://doi.org/10.1371/journal.pone.0297353.g001

thumbnail
Table 1. Patient characteristics of the discovery study and data validation cohort (subset of discovery cohort).

https://doi.org/10.1371/journal.pone.0297353.t001

Discovery study

Microarray analysis was performed on RNA isolated from endoscopic pinch biopsies of the whole sample cohort. 306 probe sets were differentially expressed between IBDU-CD and IBDU-UC (Fig 2A). We found six probe sets, including miR-182-5p, miR-3176, miR-483-5p, miR-4462, miR-4793-3p, and miR-668-5p with differences of more than one log fold change in expression between IBDU-CD and IBDU-UC. Expression of miR-4462, miR-483-5p, miR-4793-3p, and miR-668-5p was found to be lower (P<0.05), whereas expression of miR-182-5p and miR-3176 was found to be higher in IBDU-CD compared to IBDU-UC. Fig 2B shows a heat map presenting the expression values of the six probe sets. The IBDU-CD and IBDU-UC samples gathered into separate clusters in the principal component analysis (PCA) plot (Fig 2C) based on the expression of the six probe sets. When plotted with other samples, eight of the IBDU-IBDU samples showed a tendency to be either IBDU-CD, IBDU-UC, or IBDU-IBDU (Fig 2D). Within the IBDU-UC cluster, samples were also grouped as adult or paediatric, indicating the need for additional analysis based on age at diagnosis.

thumbnail
Fig 2. Differential probe set expression analysis between IBDU-CD and IBDU-UC samples.

(A) A summary plot showing probe sets highly expressed or lowly expressed in IBDU-CD compared to IBDU-UC represented as blue and red dots, respectively. Probe sets with -1> log FC <1 is highlighted as larger dots and respective names are indicated. (B) Heatmap showing microarray-based expression of 6 probe sets. Age group, duration of IBD, tissue location, sex and disease type are shown as colored bars at the top of the heatmap. Expression levels are normalized to row means. (C) PCA plot showing IBDU-CD and IBDU-UC clustered under unsupervised clustering based the expression levels of the 6 probe sets. (D) PCA plot with IBDU-CD, IBDU-UC and IBDU-IBDU samples based the expression levels of the 6 probe sets. Different symbols represent different age groups among the samples.

https://doi.org/10.1371/journal.pone.0297353.g002

In adult samples (n = 22), differential expression analysis revealed 499 probe sets with altered expression (adjusted P value <0.05) between IBDU-CD and IBDU-UC (Fig 3A and 3B). Among the differentially expressed probe sets, 10 of them (miR-1298-3p, x st ENSG00000239080, st ENSG00000239080 (snoU13), miR-451a, miR-628-5p, miR-4793-3p, miR-1273d, miR-3619-5p, U15B and hsa-miR-548x-3p) had a log fold change of -1<log FC>2 and were selected for further analysis. Unsupervised sample clustering based on the expression levels of the 10 probe sets revealed clustering of IBDU-CD and IBDU-UC samples (Fig 3C). The IBDU-IBDU samples showed a tendency towards either being in the IBDU-CD or IBDU-UC clusters (Fig 3D).

thumbnail
Fig 3. Differential probe set expression analysis between adult IBDU-CD and IBDU-UC samples.

(A) Summary plot showing probe sets highly expressed or lowly expressed in IBDU-CD adults compared to IBDU-UC adults represented as blue and red dots respectively. Probe sets with -1> log FC <2 is highlighted as larger dots and respective names are indicated. (B) Heatmap showing microarray-based expression of the 10 probe sets. Duration of IBD, tissue location, sex and disease type are shown as colored bars at the top of the heatmap. Expression levels are normalized to row means. (C) PCA plot showing IBDU-CD and IBDU-UC clustered under unsupervised clustering based the expression levels of the 10 probe sets. (D) PCA plot with IBDU-CD, IBDU-UC and IBDU-IBDU samples based the expression levels of the 10 probe sets.

https://doi.org/10.1371/journal.pone.0297353.g003

IBDU samples vs definite CD and UC diagnosed samples.

In the microarray dataset, we identified a list of probe sets that were differentially expressed between IBDU-CD and IBDU-UC samples from analyses 1 (with all samples) and 2 (with only adult samples); see also Table 2. Probe sets for miR-3176, miR-182-5p, miR-451a, miR-1298-3p, miR-548x-3p, and snoU13 had AUCs of above 80% in predicting IBDU-CD and IBDU-UC. Samples from patients who had been diagnosed as CD or UC at the time of disease onset were also evaluated using probe set candidates derived from analyses 1 and 2. The best candidate probe sets for differentiating IBDU-CD and IBDU-UC samples had relatively low AUC values in CD and UC sample discrimination. As example, miR-3176 had an AUC of 85.6% for differentiating between IBDU-CD and IBDU-UC, but between definite CD and UC diagnoses, the AUC was 68.8%. Similar trends were observed for probe sets miR-182-5p and miR-451a. Probe sets for miR-1298-3p, miR-548x-3p, and snoU13 showed AUCs above 80.0% in differentiating both IBDU-CD vs IBDU-UC and CD vs UC, when considered as individual probe sets (Table 2).

thumbnail
Table 2. Results of analysis 1 and 2 based on microarray expression values of differentially expressed probe sets between IBDU-CD and IBDU-UC samples.

https://doi.org/10.1371/journal.pone.0297353.t002

Validation study

After filtering for MirGeneDB 2.1 database (miR-182-5p, miR-451a, miR-628-5p, miR-1298-3p, miR-4793-3p and snoU13), six sncRNAs from analysis 1 and 2 were selected for further RT-qPCR based validation in a subset of 32 samples used in the discovery study. Relative expression of snoU13 was found to be significantly higher in adult IBDU-CD samples compared to adult IBDU-UC samples (median, IBDU-CD: 0.037 and IBDU-UC: 0.006, P = 0.028, Fig 4A), thus, in agreement with their relative expression levels in the microarray analysis. Expression levels of miR-182-5p, miR-451a, and snoU13 in IBDU-CD and IBDU-UC samples are shown in Fig 4B. Based on the relative expression values of miR-182-5p, miR-451a, and snoU13 along with the age at diagnosis and sex of the patients, a generalized linear modelling (GLM) model was created and tested on a set of 7 IBDU-IBDU cases. The AUC value for discriminating IBDU-CD and IBDU-UC using the GLM model was 78.57% (95% CI: 60–97) (Fig 5). Expression of miR-628-5p, miR-1298-3p, and miR-4793-3p were found to be below the detection limit of the RT-qPCR method and excluded from the analysis.

thumbnail
Fig 4. Expression levels determined by RT-qPCR.

Box plots showing RT-qPCR based relative expression levels of miR-182-5p, miR-451a, and ENSG00000239080 (SnoU13) in all IBDU-CD and IBDU-UC samples (A, B and C) and among only adult IBDU-CD and IBDU-UC samples (D, E and F).

https://doi.org/10.1371/journal.pone.0297353.g004

thumbnail
Fig 5. A model based on the short noncoding RNA’s expression, and the age and sex of the patient for prediction of CD and UC development from IBDU.

(A) PCA plot showing samples clustered based on the model into IBDU-CD (green) and IBDU-UC (pink). IBDU-IBDU samples shown in blue color show a tendency towards IBDU-CD or IBDU-UC. (B) ROC plot showing prediction based on 3 individual candidate’s expression and the model based on the relative expression values of miR-182-5p, miR-451a, and ENSG00000239080 (SnoU13) along with the age at diagnosis and sex of the patients.

https://doi.org/10.1371/journal.pone.0297353.g005

Discussion

IBDU can be perceived as a cumbersome diagnosis that can leave the patient in a limbo with an unsettled diagnosis that may turn into CD or UC. It also leaves the clinician with a significant tentative therapeutic strategy. Therefore, molecular biomarkers that, could help predicting a definite diagnosis at disease onset are of significant interest. Many studies on IBD have reported molecular profiles that can differentiate CD from UC [10,45]. However, those profiles have failed when applied to predicting CD or UC among IBDU cases [4648], probably because the studies were based on definite CD and UC cases. In this study, we based our biomarker discovery analysis on IBDU patients with well-annotated follow-up. In our discovery analysis, we found several sncRNA transcripts with high prediction rates some of which were validated in RT-qPCR analyses. This resulted in an RT-qPCR-based three-gene-profile composed of miR-182-5p, miR-451a, and snoU13, together with the age and sex of the patient with an AUC value of 79%. In a clinical setting, the three-gene RT-qPCR-based GLM model, would thus help to sub-classify almost 4 out of 5 IBD patients diagnosed with IBDU, who would then receive the appropriate treatment based on their correct diagnosis avoiding years of delay.

We included pinch biopsy material from IBDU patients who were followed for a minimum of three years and up to 23 years. 77% of them were reclassified as CD or UC during the follow-up period. It has been estimated that 80% of IBDU cases are reclassified as CD or UC within 8 years of disease course [4,49], thus, our IBDU patient cohort appears representative for IBDU cases. Considering that reclassification occurred for 77% of the patients also indicates that the remaining 23% may still encounter reclassification. It has been shown that a majority will remain as IBDU through their lifespan [4,50].

The sncRNAs identified to have significant predictive value in IBDU samples were also tested for their ability to discriminate CD from UC. In the microarray-based analysis, among the interesting candidates, miRNAs miR-1298-3p and miR-548x-3p, and the snoRNA snoU13 showed high discriminatory AUC values among adult IBDU cases, and a high prediction rate in discrimination of CD from UC. However, only the snoRNA could be detected and validated using RT-qPCR. In our three-gene RT-qPCR-based GLM model, we included miR-182-5p and miR-451a, which were detectable by RT-qPCR, and that showed predictive AUC values > 80% in the microarray analysis. As mentioned above, the GLM model based on the three gene-targets, resulted in an AUC of 79% in the prediction of IBDUs to turn into CD or UC, however, the GLM model failed to discriminate definite diagnosis of CDs from that of UCs. Our data suggest that dynamic changes of the molecular profiles of the CD and UC patients during the development of their disease may have affected the output of our pre-determined expression profile. On the other hand, the IBDU early disease stage may express a distinct molecular profile that is linked to only the IBDU diagnosis. We hypothesize that the molecular signatures of CD and UC can evolve over time among patients initially diagnosed with IBDU, and thus may exhibit distinct gene expression profiles at different stages during the disease course. Indeed, we noted that IBDU samples with no apparent phenotype of IBDU-CD or IBDU-UC have a distinct molecular profile different from IBD cases with definite phenotypes of CD or UC, thus, supporting the assumption that the IBDU diagnosis comprise a particular entity different from IBDU-CD and IBDU-UC, here called IBDU-IBDU.

In the microarray analysis we found that the AUC values for each individual candidate probe set such as miR-182-5p, miR-451a and snoU13 were having AUC values above 80% in discriminating IBDU-CD from IBDU-UC, whereas the individual AUC values based on RT-qPCR expression values were reduced to below 70%. A possible explanation for this could be that in the validation using RT-qPCR method the expression values are normalized to the geometrical mean of a set of 5 reference candidates to reduce variation related to the technique.

Earlier studies have shown an age dependent variation in phenotypes for several diseases including IBD [5153]. In their review, Ruel et al. stated that the characteristics and natural history of IBD appear to vary depending on the age of onset [54]. Also, phenotypic expression profiles of late-onset IBD compared with those in the younger population were found to be different [55,56]. Therefore, the age dependent clustering of samples in our discovery study was not surprising. Age dependent expression differences of miRNAs and snoRNAs have also been reported by others [57,58].

Biological aspects of sncRNA target candidates

IBD is an inflammatory condition that is believed to be caused by compromised barrier function, where the epithelial barrier is penetrated by bacterial influx, and un-balanced immune response that insufficiently fight the infection. Generally, CD is characterized by inflammation throughout the gastrointestinal tract and by transmural inflammation, whereas UC is characterized by relapsing and remitting mucosal inflammation extending from the rectum to proximal segments of the colon. However, other parameters also contribute to the characteristics of the two diseases. Changes in molecular expression in IBD may be related to one or more of these processes. Based on our microarray data, the three target genes, miR-182-5p, miR-451a, and snoU13, in our GLM model, were all higher in IBDU-CD compared to IBDU-UC.

MiR-182-5p is one of three miRNAs in the miR-183/96/182 cluster. miR-182 has been reported to be strongly induced in B cells [59]. In miR-182 deficient mice, miR-182 was found to be involved in antibody response at early time-points during immunization when challenged with a T-cell-dependent antigen and suggested to be responsible for defective extrafollicular response [60]. In DSS-induced UC in mice, inhibition of miR-182-5p was reported to ameliorate intestinal barrier dysfunction [61]. One of the interesting targets of miR-182-5p in the relation to IBD, is Claudin-2, which is one of the tight junction proteins downregulated in inflammatory bowel disease [62]. Taken together, elevated miR-182-5p levels in IBD may contribute to both an unbalanced immune response and to disrupted epithelial barrier function.

MiR-451a has been shown to serve as a blood-derived biomarker in a variety of conditions, including colorectal cancer (CRC) and arthritis [63,64]. In the study by Juzenas et al., [65] of miRNAs in human peripheral blood, miR-451a was found most abundant in CD235a positive cells, which are considered erythrocytes, suggesting that the miR-451 levels may be directly related to bleeding or inflammation. Moret-Tatay et al. developed a panel of plasma miRNAs that included miR-451a to identify patients at high post-operative recurrence risk among CD [66]. In a study of bone marrow derived macrophages from miR-451 knock-out mice, it was reported that miR-451 and reactive oxygen species (ROS) functionally crosstalk to regulate macrophage oxidant stress [67]. Our finding that miR-451a is higher in IBDU-CD than IBDU-UC may reflect different inflammatory stages of the disease in terms of presence of erythrocytes, ROS, and macrophages.

The third candidate in our panel is the snoRNA with the Ensembl gene identifier ENSG00000239080 (snoU13), which is also known as LOC124905271 or small nucleolar RNA U13 [68]. SnoU13 is poorly described in the literature, but has been found to contribute to biomarker profiles [69,70]. Yang et al. found that the expression of snoRA15, snoRA41 and snoRD33 was upregulated in UC and in CRC tissues compared with matched non-cancerous tissues, with an increasing trend from healthy control to UC and CRC [27]. Exploring the functions of snoRNAs in IBD will be needed to better understand their role.

The presented study is exploratory with a limited number of patient cases. The unique sample set with numerous inclusion criteria limits the number of samples that were accessible to investigate in this study. However, the IBDU cases included were carefully followed over time and allowed identification of differentially expressed sncRNA. The number of paediatric patients was particularly few, which prevented detailed analysis of the paediatric group. The remaining group of IBDU-IBDU patients may ultimately settle with a CD or UC diagnosis, however, this may still take years to occur. This of course directs our study towards the need for biomarkers that can helps reclassifying them to CD or UC or an independent group with unique characteristics that could be defined as IBDU group, and thereby aid in providing the optimal treatment for the patients.

Conclusions

In summary, we have developed a prediction model based on the expression of three sncRNAs, and age and sex of the patient that can assist the clinician in properly classifying IBDU patients into CD or UC at disease onset. Our study also suggests that true IBDU patients may have different pathogenic mechanisms compared to IBDU patients later diagnosed with CD or UC. Further research based on a larger cohort of real scenario of IBD patients could help in understanding the distinct molecular pathways of IBDU subgroups. Different factors such as age, gender, and other disease characteristics should be thoroughly investigated in a larger patient cohort. Future developments on diagnostic strategies and personalized therapies for IBDU and possible development towards CD or UC are necessary, which will plausibly introduce biomarkers such as miRNAs and snoRNAs as promising components to the analysis tool kit.

Supporting information

S1 File. The primer sequences for all the short non-coding RNAs used in the RT-qPCR validation.

https://doi.org/10.1371/journal.pone.0297353.s001

(PDF)

Acknowledgments

We would like to thank laboratory technicians Christina Grønhøj, Department of pathology, Herlev hospital and Trine Møller, Bioneer A/S for their valuable technical assistance.

References

  1. 1. Geboes K, Colombel J-F, Greenstein A, Jewell DP, Sandborn WJ, Vatn MH, et al. Indeterminate colitis: A review of the concept—Whatʼs in a name? Inflamm Bowel Dis. 2008;14: 850–857. pmid:18213696
  2. 2. Dignass A, Van Assche G, Lindsay JO, Lémann M, Söderholm J, Colombel JF, et al. The second European evidence-based Consensus on the diagnosis and management of Crohn’s disease: Current management. J Crohn’s Colitis. 2010;4: 28–62. pmid:21122489
  3. 3. Dignass A, Lindsay JO, Sturm A, Windsor A, Colombel JF, Allez M, et al. Second European evidence-based consensus on the diagnosis and management of ulcerative colitis Part 2: Current management. J Crohn’s Colitis. 2012;6: 991–1030. pmid:23040451
  4. 4. Tontini GE. Differential diagnosis in inflammatory bowel disease colitis: State of the art and future perspectives. World J Gastroenterol. 2015;21: 21. pmid:25574078
  5. 5. Rinawi F, Assa A, Eliakim R, Mozer-Glassberg Y, Nachmias Friedler V, Niv Y, et al. The natural history of pediatric-onset IBD-unclassified and prediction of Crohn’s disease reclassification: a 27-year study. Scand J Gastroenterol. 2017;52: 558–563. pmid:28128677
  6. 6. Carvalho RS, Abadom V, Dilworth HP, Thompson R, Oliva-Hemker M, Cuffari C. Indeterminate colitis: A significant subgroup of pediatric IBD. Inflamm Bowel Dis. 2006;12: 258–262. pmid:16633047
  7. 7. Geboes K, De Hertogh G. Indeterminate colitis. Inflamm Bowel Dis. 2003;9: 324–331. pmid:14555917
  8. 8. Winter DA, Karolewska-Bochenek K, Lazowska-Przeorek I, Lionetti P, Mearin ML, Chong SK, et al. Pediatric IBD-unclassified is less common than previously reported; Results of an 8-year audit of the EUROKIDS registry. Inflamm Bowel Dis. 2015;21: 2145–2153. pmid:26164665
  9. 9. James JP, Riis LB, Malham M, Høgdall E, Langholz E, Nielsen BS. MicroRNA Biomarkers in IBD-Differential Diagnosis and Prediction of Colitis-Associated Cancer. Int J Mol Sci. 2020;21: 7893. pmid:33114313
  10. 10. James JP, Nielsen BS, Christensen IJ, Langholz E, Malham M, Poulsen TS, et al. Mucosal expression of PI3, ANXA1, and VDR discriminates Crohn’s disease from ulcerative colitis. Sci Rep. 2023;13: 18421. pmid:37891214
  11. 11. Iboshi Y, Nakamura K, Ihara E, Iwasa T, Akiho H, Harada N, et al. Multigene Analysis Unveils Distinctive Expression Profiles of Helper T-cell–related Genes in the Intestinal Mucosa that Discriminate Between Ulcerative Colitis and Crohnʼs Disease. Inflamm Bowel Dis. 2014;20: 1. pmid:24739631
  12. 12. Wu F, Dassopoulos T, Cope L, Maitra A, Brant SR, Harris ML, et al. Genome-wide gene expression differences in Crohnʼs disease and ulcerative colitis from endoscopic pinch biopsies: Insights into distinctive pathogenesis. Inflamm Bowel Dis. 2007;13: 807–821. pmid:17262812
  13. 13. Brand S. Crohn’s disease: Th1, Th17 or both? The change of a paradigm: new immunological and genetic insights implicate Th17 cells in the pathogenesis of Crohn’s disease. Gut. 2009;58: 1152–1167. pmid:19592695
  14. 14. Hampe J, Schreiber S, Shaw SH, Lau KF, Bridger S, Macpherson AJS, et al. A genomewide analysis provides evidence for novel linkages in inflammatory bowel disease in a large European cohort. Am J Hum Genet. 1999;64: 808–816. pmid:10053016
  15. 15. Mcgovern D, Kugathasan S, Cho JH. Genetics of Inflammatory Bowel Diseases HHS Public Access. Gastroenterology. 2015;149: 1163–1176. pmid:26255561
  16. 16. Wu F, Zhang S, Dassopoulos T, Harris ML, Bayless TM, Meltzer SJ, et al. Identification of microRNAs associated with ileal and colonic Crohn’s disease. Inflamm Bowel Dis. 2010;16: 1729–1738. pmid:20848482
  17. 17. Stylianou E. Epigenetics: the fine-tuner in inflammatory bowel disease? Curr Opin Gastroenterol. 2013;29: 370–377. pmid:23743674
  18. 18. Mukherji S, Ebert MS, Zheng GXY, Tsang JS, Sharp PA, Van Oudenaarden A. MicroRNAs can generate thresholds in target gene expression. Nat Genet. 2011;43: 854. pmid:21857679
  19. 19. Zwiers A, Kraal L, van de Pouw Kraan TCTM, Wurdinger T, Bouma G, Kraal G. Cutting Edge: A Variant of the IL-23R Gene Associated with Inflammatory Bowel Disease Induces Loss of MicroRNA Regulation and Enhanced Protein Production. J Immunol. 2012;188: 1573–1577. pmid:22262659
  20. 20. Mitchell PS, Parkin RK, Kroh EM, Fritz BR, Wyman SK, Pogosova-Agadjanyan EL, et al. Circulating microRNAs as stable blood-based markers for cancer detection. Proc Natl Acad Sci. 2008;105: 10513–10518. pmid:18663219
  21. 21. Ghafouri-Fard S, Eghtedarian R, Taheri M. The crucial role of non-coding RNAs in the pathophysiology of inflammatory bowel disease. Biomed Pharmacother. 2020;129: 110507. pmid:32768980
  22. 22. Ambros V. microRNAs: Tiny Regulators with Great Potential. Cell. 2001;107: 823–826. pmid:11779458
  23. 23. Lee RC, Feinbaum RL, Ambros V. The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell. 1993;75: 843–854. pmid:8252621
  24. 24. Makarova IA, Kramerov DA. Small nucleolar RNAs. Mol Biol (Mosk). 2007;41: 246–259. pmid:17514894
  25. 25. Prestayko AW, Tonato M, Busch H. Low molecular weight RNA associated with 28 s nucleolar RNA. J Mol Biol. 1970;47: 505–515. pmid:5418169
  26. 26. Shi J, Zhou T, Chen Q. Exploring the expanding universe of small RNAs. Nat Cell Biol. 2022;24: 415–423. pmid:35414016
  27. 27. Yang X, Li Y, Li L, Liu J, Wu M, Ye M. SnoRNAs are involved in the progression of ulcerative colitis and colorectal cancer. Dig Liver Dis. 2017;49: 545–551. pmid:28110922
  28. 28. Levine A, Koletzko S, Turner D, Escher JC, Cucchiara S, de Ridder L, et al. ESPGHAN revised porto criteria for the diagnosis of inflammatory bowel disease in children and adolescents. J Pediatr Gastroenterol Nutr. 2014;58: 795–806. pmid:24231644
  29. 29. Langholz E. Ulcerative colitis. An epidemiological study based on a regional inception cohort, with special reference to disease course and prognosis. Dan Med Bull. 1999;46: 400–415. pmid:10605619
  30. 30. Munkholm P. Crohn’s disease—occurrence, course and prognosis. An epidemiologic cohort-study. Dan Med Bull. 1997;44: 287–302. pmid:9233548
  31. 31. Busk PK. A tool for design of primers for microRNA-specific quantitative RT-qPCR. BMC Bioinformatics. 2014;15: 1–9. pmid:24472427
  32. 32. Balcells I, Cirera S, Busk PK. Specific and sensitive quantitative RT-PCR of miRNAs with DNA primers. BMC Biotechnol. 2011;11. pmid:21702990
  33. 33. Smyth GK. Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004;3. pmid:16646809
  34. 34. Smyth GK, Ritchie ME, Law CW, Alhamdoosh M, Su S, Dong X, et al. RNA-seq analysis is easy as 1-2-3 with limma, Glimma and edgeR. F1000Research. 2018;5: 1–30. pmid:27441086
  35. 35. Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD, Kelso J. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinforma Appl NOTE. 2012;28: 882–883. pmid:22257669
  36. 36. Carvalho B. pd.mirna.4.0: Platform Design Info for Affymetrix miRNA-4_0. R package version 3.12.0. 2015.
  37. 37. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43: e47–e47. pmid:25605792
  38. 38. Fromm B, Billipp T, Peck LE, Johansen M, Tarver JE, King BL, et al. A Uniform System for the Annotation of Vertebrate microRNA Genes and the Evolution of the Human microRNAome. Annu Rev Genet. 2015;49: 213–242. pmid:26473382
  39. 39. Fromm B, Domanska D, Høye E, Ovchinnikov V, Kang W, Aparicio-Puerta E, et al. MirGeneDB 2.0: the metazoan microRNA complement. Nucleic Acids Res. 2020;48: D132–D141. pmid:31598695
  40. 40. Schmittgen TD, Livak KJ. Analyzing real-time PCR data by the comparative CT method. Nat Protoc. 2008;3: 1101–1108. pmid:18546601
  41. 41. Newie I, Søkilde R, Persson H, Grabau D, Rego N, Kvist A, et al. The HER2-Encoded miR-4728-3p Regulates ESR1 through a Non-Canonical Internal Seed Interaction. Bardoni B, editor. PLoS One. 2014;9: e97200. pmid:24828673
  42. 42. Buonpane C, Ares G, Benyamen B, Yuan C, Hunter CJ. Identification of suitable reference microRNA for qPCR analysis in pediatric inflammatory bowel disease. Physiol Genomics. 2019;51: 169–175. pmid:30978148
  43. 43. Liu L-L, Zhao H, Ma T-F, Ge F, Chen C-S, Zhang Y-P. Identification of Valid Reference Genes for the Normalization of RT-qPCR Expression Studies in Human Breast Cancer Cell Lines Treated with and without Transient Transfection. St-Pierre Y, editor. PLoS One. 2015;10: e0117058. pmid:25617865
  44. 44. Chekhonin E, Popov E, Popov Y, Gabova A, Romushkevich R, Spasennykh M, et al. High-Resolution Evaluation of Elastic Properties and Anisotropy of Unconventional Reservoir Rocks via Thermal Core Logging. Rock Mech Rock Eng. 2018;51: 2747–2759.
  45. 45. von Stein P, Lofberg R, Kuznetsov N V., Gielen AW, Persson J, Sundberg R, et al. Multigene Analysis Can Discriminate Between Ulcerative Colitis, Crohn’s Disease, and Irritable Bowel Syndrome. Gastroenterology. 2008;134: 1869–1881. pmid:18466904
  46. 46. Von Stein P. Inflammatory bowel disease classification through multigene analysis: Fact or fiction? Expert Review of Molecular Diagnostics. 2009. pp. 7–10. pmid:19099342
  47. 47. Janczewska I, Kapraali M, Saboonchi F, Nekzada Q, Wessulv Å, Khoshkar J, et al. Clinical application of the multigene analysis test in discriminating between ulcerative colitis and Crohn’s disease: A retrospective study. Scand J Gastroenterol. 2012;47: 162–169. pmid:22229803
  48. 48. Bjerrum JT, Nyberg C, Olsen J, Nielsen OH. Assessment of the validity of a multigene analysis in the diagnostics of inflammatory bowel disease. J Intern Med. 2014;275: 484–493. pmid:24206446
  49. 49. Meucci G. What is the incidence, prevalence, and natural history of indeterminate colitis? Inflamm Bowel Dis. 2008;14: S159–S160. pmid:18816765
  50. 50. Kellermann L, Riis LB. A close view on histopathological changes in inflammatory bowel disease, a narrative review. Dig Med Res. 2021;4: 3–3.
  51. 51. Cheng J, Tang X, Yan X, Huang G, Luo S, Zhou Z, et al. Latent autoimmune diabetes in youth shows greater autoimmunity than latent autoimmune diabetes in adults: Evidence from a nationwide, multicenter, cross-sectional study. Pediatr Diabetes. 2022;23: 578–587. pmid:35451144
  52. 52. Theodorakopoulou E, Yiu ZZN, Bundy C, Chularojanamontri L, Gittins M, Jamieson LA, et al. Early- and late-onset psoriasis: a cross-sectional clinical and immunocytochemical investigation. Br J Dermatol. 2016;175: 1038–1044. pmid:27459949
  53. 53. James JP, Nielsen BS, Langholz E, Malham M, Høgdall E, Riis LB. Estimating tissue-specific TNF mRNA levels prior to anti-TNFα treatment may support therapeutic optimisation in IBD patients. Scand J Gastroenterol. 2023; 1–9. pmid:37246424
  54. 54. Ruel J, Ruane D, Mehandru S, Gower-Rousseau C, Colombel J-F. IBD across the age spectrum—is it the same disease? Nat Rev Gastroenterol Hepatol. 2014;11: 88–98. pmid:24345891
  55. 55. Guariso G, Gasparetto M, Visonà Dalla Pozza L, D’Incà R, Zancan L, Sturniolo G, et al. Inflammatory Bowel Disease Developing in Paediatric and Adult Age. J Pediatr Gastroenterol Nutr. 2010;51: 698–707. pmid:20639778
  56. 56. Freeman HJ. Comparison of longstanding pediatric-onset and adult-onset Crohn’s disease. J Pediatr Gastroenterol Nutr. 2004;39: 183–186. pmid:15269625
  57. 57. Malham M, James JP, Jakobsen C, Hoegdall E, Holmstroem K, Wewer V, et al. Mucosal microRNAs relate to age and severity of disease in ulcerative colitis. Aging (Albany NY). 2021;13: 6359–6374. pmid:33647883
  58. 58. Steinbusch MMF, Fang Y, Milner PI, Clegg PD, Young DA, Welting TJM, et al. Serum snoRNAs as biomarkers for joint ageing and post traumatic osteoarthritis. Sci Rep. 2017;7. pmid:28252005
  59. 59. Pucella JN, Yen W-F, Kim M V., van der Veeken J, Socci ND, Naito Y, et al. miR-182 Is Largely Dispensable for Adaptive Immunity: Lack of Correlation between Expression and Function. J Immunol. 2015;194: 2635–2642. pmid:25672759
  60. 60. Li YF, Ou X, Xu S, Jin ZB, Iwai N, Lam KP. Loss of miR-182 affects B-cell extrafollicular antibody response. Immunology. 2016;148: 140–149. pmid:26849109
  61. 61. Xu Y, Yang J, Chen X, Deng J, Gong H, Li F, et al. MicroRNA-182-5p aggravates ulcerative colitis by inactivating the Wnt/β-catenin signaling pathway through DNMT3A-mediated SMARCA5 methylation. Genomics. 2022;114: 110360. pmid:35378241
  62. 62. Zeissig S, Bürgel N, Günzel D, Richter J, Mankertz J, Wahnschaffe U, et al. Changes in expression and distribution of claudin 2, 5 and 8 lead to discontinuous tight junctions and barrier dysfunction in active Crohn’s disease. Gut. 2007;56: 61–72. pmid:16822808
  63. 63. Zhang Z, Zhang D, Cui Y, Qiu Y, Miao C, Lu X. Identification of microRNA-451a as a Novel Circulating Biomarker for Colorectal Cancer Diagnosis. Biomed Res Int. 2020;2020. pmid:32908896
  64. 64. Li J, Gao X, Zhu W, Li X. Integrative Analysis of the Expression of microRNA, Long Noncoding RNA, and mRNA in Osteoarthritis and Construction of a Competing Endogenous Network. Biochem Genet. 2022;60: 1141–1158. pmid:34796409
  65. 65. Juzenas S, Venkatesh G, Hübenthal M, Hoeppner MP, Du ZG, Paulsen M, et al. A comprehensive, cell specific microRNA catalogue of human peripheral blood. Nucleic Acids Res. 2017;45: 9290–9301. pmid:28934507
  66. 66. Moret-Tatay I, Cerrillo E, Hervás D, Iborra M, Sáez-González E, Forment J, et al. Specific Plasma MicroRNA Signatures in Predicting and Confirming Crohn’s Disease Recurrence: Role and Pathogenic Implications. Clin Transl Gastroenterol. 2021;12: e00416. pmid:34695034
  67. 67. Ranjan R, Lee YG, Karpurapu M, Syed MA, Chung S, Deng J, et al. p47phox and reactive oxygen species production modulate expression of microRNA-451 in macrophages. Free Radic Res. 2015;49: 25–34. pmid:25287330
  68. 68. Tyc K, Steitz JA. U3, U8 and U13 comprise a new class of mammalian snRNPs localized in the cell nucleolus. EMBO J. 1989;8: 3113–3119. pmid:2531075
  69. 69. Huang R, Liao X, Li Q. Integrative genomic analysis of a novel small nucleolar RNAs prognostic signature in patients with acute myelocytic leukemia. Math Biosci Eng. 2022;19: 2424–2452. pmid:35240791
  70. 70. Di Pietro V , O’Halloran P, Watson CN, Begum G, Acharjee A, Yakoub KM, et al. Unique diagnostic signatures of concussion in the saliva of male athletes: The Study of Concussion in Rugby Union through MicroRNAs (SCRUM). Br J Sports Med. 2021;55: 1395–1404. pmid:33757972