Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A 7-Gene Signature Depicts the Biochemical Profile of Early Prefibrotic Myelofibrosis

Abstract

Recent studies have shown that a large proportion of patients classified as essential thrombocythemia (ET) actually have early primary prefibrotic myelofibrosis (prePMF), which implies an inferior prognosis as compared to patients being diagnosed with so-called genuine or true ET. According to the World Health Organization (WHO) 2008 classification, bone marrow histology is a major component in the distinction between these disease entities. However, the differential diagnosis between them may be challenging and several studies have not been able to distinguish between them. Most lately, it has been argued that simple blood tests, including the leukocyte count and plasma lactate dehydrogenase (LDH) may be useful tools to separate genuine ET from prePMF, the latter disease entity more often being featured by anemia, leukocytosis and elevated LDH. Whole blood gene expression profiling was performed in 17 and 9 patients diagnosed with ET and PMF, respectively. Using elevated LDH obtained at the time of diagnosis as a marker of prePMF, a 7-gene signature was identified which correctly predicted the prePMF group with a sensitivity of 100% and a specificity of 89%. The 7 genes included MPO, CEACAM8, CRISP3, MS4A3, CEACAM6, HEMGN, and MMP8, which are genes known to be involved in inflammation, cell adhesion, differentiation and proliferation. Evaluation of bone marrow biopsies and the 7-gene signature showed a concordance rate of 71%, 79%, 62%, and 38%. Our 7-gene signature may be a useful tool to differentiate between genuine ET and prePMF but needs to be validated in a larger cohort of “ET” patients.

Introduction

The Philadelphia-negative myeloproliferative neoplasms (MPNs) include essential thrombocythemia (ET), polycythemia vera (PV) and primary myelofibrosis (PMF) [1]. Recent studies have shown that a large proportion of patients classified as ET actually have early primary prefibrotic myelofibrosis (prePMF) [214]. Since prognosis and likely in the future also treatment of these entities are different, it is important to distinguish between them [8,11]. According to the WHO 2008 classification, bone marrow histology is a major component in the distinction between ET and prePMF [214]. However, the differential diagnosis between these two entities may be challenging and some studies have not been able to distinguish between them [1516]. Most lately, it has been argued that simple blood tests–the hemoglobin (Hb) concentration, the leukocyte count and plasma lactate dehydrogenase (LDH)—may be useful tools to separate genuine ET from early prePMF, the latter disease entity more often being featured by anemia, leukocytosis and elevated LDH [12]. Genome wide expression profiling data has been used to perform classification in patients with ET, PV and PMF [1719]. However, to our knowledge, no study has applied a supervised classification model to distinguish between genuine ET and early prePMF. Herein, we report that a simple gene signature–composed of a few genes–in our whole blood gene expression profiling design [2026] is able to separate true ET from early prePMF.

Materials and Methods

Seventeen patients with ET were enrolled in the study. All patients were diagnosed according to the World Health Organization (WHO) 2008 criteria [27] and followed in two institutions in Denmark. Most patients were on cytoreductive therapy, which for the large majority included hydroxyurea (HU). The study was performed in accordance with the Helsinki declaration and was approved by the local ethics committee.

Samples were collected in Paxgene tubes (Preanalytix, Hombrechtikon, Switzerland) and stored at room temperature for 24 hours, then at -20°C for minimum one day, and finally transferred to a -80°C freezer. The Paxgene Blood RNA kit (Qiagen, Franklin Lakes, NJ, USA) was applied to isolate total RNA from whole blood. Quantity of RNA was assessed with the NanoDrop spectrophotometer ND-8000 (NanoDrop Technologies), and RNA integrity was determined with the Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA). 300 ng of purified RNA was converted to biotin-labeled amplified RNA (aRNA) using the Message-AmpTM III RNA amplification kit (Ambion, Austin, TX). Fragmented aRNA was hybridized to Affymetrix HG-U133 Plus 2.0 microarray chips (Affymetrix, Santa Clara, CA, USA).

Data preprocessing and statistical analysis of microarrays were done in the R statistical software (www.r-project.org). The robust multi-array average expression measure (rma) was used to perform background correction, normalization, and expression index calculation [28]. Only perfect match probes were used for data analysis. Data are available from Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo; accession no. GSE26049).

Gene expression values for all microarray chips were uploaded to DNA-chip Analyzer (dChip) software [29] and filtering of genes with high variation across all samples was performed. To uncover unknown patterns in the filtered data set, an unsupervised method, principal component analysis (PCA) was applied to the filtered gene list.

Plasma lactate dehydrogenase (LDH) and the leukocyte count at the time of diagnosis were used to divide patients into subgroups with different risks for a specific outcome. Because measurement of plasma LDH in all patients was done using different references, the LDH value required dichotomization into either normal range (genuine ET) or elevated range (prePMF), taking into account the assumption that the large majority of patients with genuine ET have normal or only slightly elevated LDH, whereas patients with prePMF exhibit elevated LDH values. Likewise, patients with leukocyte counts below or above 10.0 x 109/L were assigned to the group of patients with genuine ET or prePMF, respectively. The patients in the genuine ET or prePMF groups are defined as having either low or high risk of developing post-ET myelofibrosis or overt PMF, respectively.

Microarray data were used to develop a supervised classification model to predict the outcome of ET patients. Whole blood from patients with ET and PMF and from healthy controls were used for gene expression profiling. Only 1 of 9 patients with PMF received treatment. Comparing gene expression in PMF patients vs. controls, the 12 genes MPO, CEACAM8, ELA2, DEFA4, OLFM4, CRISP3, MS4A3, CEACAM6, CTSG, AZU1, HEMGN, and MMP8 are among the top 20 most significantly upregulated genes. None of these 12 genes is significantly deregulated in ET vs. controls. The remaining 8 top 20 genes are significantly deregulated in ET vs. controls and are therefore not considered relevant for classification [1925]. Genes deregulated in ET may not be predictive of fibrosis which is one of the diagnostic criteria for prePMF and PMF. Therefore, to separate patients with prePMF from patients with genuine ET, these 8 genes are not included in the gene signature.” In our previous paper [25], a distinct 5-gene signature consisting of AZU1, CTSG, ELA2, OLFM4, and DEFA4 identified patients with advanced MPN; however, in the present paper, we sought to identify a gene signature able to distinguish prePMF (PMF in early stage) from genuine ET. Therefore, to find a gene signature with classification performance in prePMF, these 5 genes were omitted from the 12-gene signature resulting in a 7-gene signature including MPO, CEACAM8, CRISP3, MS4A3, CEACAM6, HEMGN, and MMP8. Thus, the genes in the 7-gene signature are chosen based on gene expression results from patients with PMF vs. controls.

The performance of the 7-gene signature as a classifier in the ET patients was tested using support vector machine (SVM) and leave one out cross validation (LOOCV). Using LOOCV one sample is left out at a time and the rest of the samples is used for classification. Thus, the sample which is left out and classified is not involved in the training process. Gene expression values for the 17 ET patients for the selected 7 genes were uploaded to the SVM classification procedure in the statistical program R using the package e1071 to build a hyperplane to separate the genes in the data set with maximal margin. Classification and cross validation leaving one sample out at a time (LOOCV) were performed, a probability for prePMF was calculated, and the classification accuracy determined. The model strategy is outlined in Fig 1.

thumbnail
Fig 1. Outline of the classification procedure using SVM and LOOCV.

The classification model is built on the 7 genes. The optimal model was selected and classification accuracy determined.

https://doi.org/10.1371/journal.pone.0161570.g001

We also tried to perform supervised classification with SVM and LOOCV using the top 500 significant genes obtained from the t-test (genuine ET patients vs. prePMF patients) using both the leukocyte and LDH values obtained at the time of diagnosis to separate the genuine ET patients from the prePMF patients. The performance of the models was assessed from the accuracy values which were quite low, and the models may not be suitable for predicting development of prePMF (data not shown).

Bone marrow biopsies from the time of diagnosis from all 17 patients were evaluated by 4 hematopathologists with extensive experience in the field of MPN classification. Three bone marrow biopsies were not suitable for analysis and therefore excluded from the study. Furthermore, two of the 4 hematopathologists classified one of the ET bone marrow biopsies as unsuitable. Biopsies were fixed in formalin and embedded in paraffin. The sections were stained with hematoxylin and eosin and Gordon and Sweet for reticulin. The biopsies were evaluated strictly according to the WHO-defined morphological criteria [27]. The hematopathologists did not have access to any cytological material and was blinded to all clinical data except patient age. Five diagnoses were used for the histological evaluation: ET, PV, prePMF, overtPMF and “other”.

Literature research and gene ontology analysis were performed to describe the biological function of the 7 genes, and biological network analysis was done using the Igenuity® Pathway AnalysisTM Software (IPA, www.ingenuity.com). The IPA software includes the Ingenuity Knowledge Base (IKB) and the Global Molecular Network (GMN) [30]. IKB is a large curated database with a repository of known biological interactions and functional annotations created from peer-reviewed publications of individual studies of genes in human, mouse and rat. The GMN is a large network composed of thousands of genes and gene products that interact with each other. The global network algorithm uses the information from GMN and the user uploaded list of genes to create the optimal networks.

Ethics statement

The study was approved by The Regional Scientific Ethical Committees for Southern Denmark and was performed in accordance with the Helsinki Declaration. All patients provided written informed consent to participate in the study.

Results

The clinical parameters for all patients are summarized in Table 1.

Based on LDH values obtained at the time of diagnosis, 9 patients were assigned to the genuine ET group and 8 patients were assigned to the prePMF group. According to the LDH value, no difference in age or the JAK2V617F % between genuine ET and prePMF was observed. When patients were divided into groups according to the leukocyte value obtained at the time of diagnosis, 9 patients were assigned to the genuine ET group and 8 patients to the prePMF group; however, the composition of the patients in the LDH (Table 1) and leukocyte groups differed (S1 Table)

Of the 54,675 probe sets on the microarray, 1976 genes met the filtering criteria in dChip. Unsupervised PCA applied to the filtered gene list revealed a tendency for the prePMF patients to group together and the genuine ET patients to be more scattered (S1A and S1B Fig). These results suggest that the PCA may not be optimal to separate the two groups of patients.

To explore the performance of a supervised classification model using a predefined gene set, we selected the 7 genes that were among the top 20 most significantly upregulated genes in PMF and not significantly upregulated in ET and not included in the 5-gene signature [25]. The resulting genes included MPO, CEACAM8, CRISP3, MS4A3, CEACAM6, HEMGN, and MMP8 (S2 Table). In the classification model, the cut off limit for prePMF was set to 0.47 (8/17) which is the number of prePMF patients according to the LDH or leukocyte count divided by the total number of patients.

Using LDH as the biochemical variable, the resulting 7-gene signature correctly predicted 8 of 8 patients in the prePMF group (sensitivity 100%) and correctly classified 8 of 9 patients in the genuine ET group (specificity 89%) and showed a balanced accuracy of 94% (Fig 2).

thumbnail
Fig 2. Classification performance of the 7-gene signature in 17 ET patients using LOOCV and SVM.

The y-axis shows the probability of developing preMF. The x-axis shows the sample number. Red circles to the left of the vertical line represent patients in the prePMF group and blue circles to the right represent patients in the genuine ET group. Red circles above the horizontal line to the left are correctly predicted patients with prePMF and blue circles below the line to the right are correctly classified as genuine ET patients. Circles below the line to the left and above the line to the right are misclassified patients. Patients are divided according to the LDH value at the time of diagnosis. Balanced accuracy = 94%.

https://doi.org/10.1371/journal.pone.0161570.g002

A receiver operating characteristics (ROC) curve with confidence intervals is depicted in S2 Fig. The area under the ROC curve (AUC) is 0.958 showing an excellent separation of genuine ET and prePMF. Using the leukocyte value as the biochemical variable, the 7-gene signature resulted in 62% sensitivity and 78% specificity and showed a balanced accuracy of 70% (data not shown).

The performance of the 7-gene signature using LDH at the time of diagnosis as the biochemical variable was tested against the bone marrow evaluation from the hematopathologists. The concordance between the classification and the 4 hematopathologists was 71%, 79%, 62%, and 38% (Table 2).

thumbnail
Table 2. Concordance between classification and bone marrow evaluation.

https://doi.org/10.1371/journal.pone.0161570.t002

Functional analysis and literature research showed that the 7 genes are involved in key processes associated with the pathophysiology of MPNs (Table 3).

CRISP3, MMP8 and MPO are neutrophil granule proteins playing important roles in immune and inflammatory responses [20, 3134]. CEACAM6 and CEACAM8 are involved in cell adhesion, cellular invasiveness, angiogenesis, and inflammation [20, 3539]. MS4A3 is a hematopoietic cell cycle regulator [40], and HEMGN plays an important role in hematopoietic development and neoplasms and is also involved in differentiation and proliferation [4142].

To identify interactions of the 7 genes with each other and other genes, molecular network analysis was performed using IPA software. The resulting network consists of 34 genes both directly and indirectly connected to the 7 genes (Fig 3).

thumbnail
Fig 3. The figure illustrates the network from IPA software analysis.

The 7 genes are shown in red. The top functions of the network are inflammatory response, cellular movement and immune cell trafficking. Genes are represented as nodes and the relationship between genes are represented as edges. Solid lines with an arrow: acts on molecule; dashed lines: indirect interaction; solid line: direct interaction.

https://doi.org/10.1371/journal.pone.0161570.g003

The top functions of the genes in the resulting network were inflammatory response, cellular movement, and immune cell trafficking well known to be highly relevant in MPNs [2025]. Furthermore, NFKB and IL2, which act as hubs—i.e. interacting nodes (genes) with high network degrees assumed to correspond to essential gene products in the cell—in the network, are known to play central roles in inflammation and immune regulation [4345].

Discussion

Primary myelofibrosis is aligned along a biological continuum from the early neoplastic stage—prefibrotic myelofibrosis–to the advanced metastatic myelofibrosis stage–“myelofibrosis with myeloid metaplasia” (MMM) [46], which is characterized by bone marrow failure with severe anemia and huge splenomegaly due to extramedullary hematopoiesis with seeding of CD34+ positive cells in the spleen, liver and elsewhere [1,4748]. The egress of CD34+ cells from the stem cell niches into the circulation has been compared to the metastatic process in solid tumors [49], in which a large number of chemokines, cytokines and proteolytic enzymes are deeply involved [2326, 5053]. In this process–as well as in the progression of myelofibrosis from early cancer stage to the advanced cancer stage (MMM)–inflammation and immune deregulation is today considered of major importance [2123, 5357]. Thus, the MPNs have been described as “A Human Inflammation Model for Cancer Development” [55], in which chronic inflammation is considered to be the driving force for clonal evolution, increasing genomic instability and additional mutations with ensuing leukemic transformation [5356]. In recent years, several studies and reviews have supported the concept of chronic inflammation as a highly important factor for disease progression in MPNs as evidenced by a marked increase in several inflammatory cytokines [53]. Furthermore, whole blood transcriptional profiling studies have shown a massive deregulation of inflammation and immune genes and oxidative stress /antioxidative stress genes as well [2123,26].

The concept “Early Prefibrotic Myelofibrosis” has since 2008 been included in the WHO-classification of MPNs [27]. However, several studies have shown that this disease entity may be difficult to diagnose due to lack of consistency and reproducibility of classical bone marrow features in ET, PV and PMF [6,16] for reasons discussed elsewhere [13]. Thus, bone marrow histology has been criticized for insufficiency in discriminating between the 2 entities and inter-observer variability has been a concern [6]. Regarding genuine ET this entity is histopathologically featured by no or only slight increase in age-matched cellularity, no significant increase in granulo- and erythropoiesis, prominent large to giant mature megakaryocytes with hyperlobulated or deeply folded nuclei, dispersed or loosely clustered in the marrow space and no or very rarely minor increase in reticulin fibers [8]. On the contrary, early prePMF stage is among others characterized by a marked increase in age-matched cellularity with a pronounced proliferation of granulopoiesis and a reduction of erythroid precursors, prominent large to giant atypical megakaryocytes with hyperchromatic nuclei often arranged in dense clusters, and no or no significant increase in reticulin fibers [8]. Clinically, patients with genuine ET have–in general–no enlargement of the spleen, whereas patients with prePMF more often have an enlarged spleen [8,9,11,12]. Importantly, enlargement of the spleen in genuine ET and prePMF reflects extramedullary hematopoiesis due to egress of CD34+ cells and progenitors from the bone marrow into the circulation to seed preferentially in the spleen and liver. This process is intimately associated with an altered proteolytic micromilieau consequent to in vivo activation of granulocytes and release of their neutrophil granule contents, including e.g. elastase and other proteolytic enzymes [50,51]. This “metastatic “process may not only be driven by clonal activation of neutrophil granule release, elicited by e.g. the JAK2V617F-mutation per se [51] but also likely mediated by an inflamed bone marrow milieau [5356]. Thus, today chronic inflammation is considered to be the driving force behind clonal evolution in MPNs [5357]. Of note, we have previously shown that patients with PMF exhibit a distinct 5-gene signature composed of several genes of importance for neutrophil granule release (e.g. elastase, defensins) [25]. Furthermore, we have also shown that patients with MPNs are characterized by upregulation of a large number of inflammation and oxidative stress genes [2223,26] supporting the contention that chronic inflammation may have a major role in MPN pathogenesis [5357], including the process by which CD34+ cells migrate from the bone marrow to seed in the spleen and liver [2223,2526].

When considering some of the previous studies on genuine ET versus early prePMF, it remains a concern that several patients with genuine ET actually had an enlarged spleen and leukocytosis as well [89,11]. If strictly adhering to the histopathological description of genuine ET versus early prePMF, patients with genuine ET have no significant granulopoiesis in the bone marrow8 and accordingly should have no or no significant leukocytosis and no significant splenomegaly, when taking into account that splenic enlargement implies extramedullary hematopoiesis due to egress of CD34+ cells and progenitors from the bone marrow consequent to enhanced granulopoiesis with in vivo activation of granulocytes and ensuing release of neutrophil granule contents [5051]. Regardless, a proportion of genuine ET patients in these studies had leukocytosis, significant increase in circulating CD34+ cells and enlarged spleens (the latter accounting for about 15–20%) [89,11] but still major findings included significant differences for leukocyte count, platelet count, serum LDH, circulating CD34 cell count, incidence of palpable splenomegaly, and frequency of grade 1 BM fibrosis–all variables being greater in early/prePMF than ET [9].

Our present study adds highly important novel information to the above findings. Firstly, it has unraveled a unique gene signature composed of 7 genes (MPO, CEACAM8, CRISP3, MS4A3, CEACAM6, HEMGN, and MMP8), which are capable of distinguishing genuine ET from prePMF with a high sensitivity (100%), specificity (89%) and balanced accuracy (94%) when using LDH at the time of diagnosis as the biochemical variable. By contrast, using the leukocyte value at the time of diagnosis as the biochemical variable, the 7-gene signature resulted in a much lower sensitivity (62%), specificity (78%) and balanced accuracy (70%). Secondly, when testing the performance of the 7-gene signature using LDH at the time of diagnosis as the biochemical variable against the bone marrow evaluation from the hematopathologists, the concordance between the classification and the hematopathologists was 71%, 79%, 62%, and 38%. Thirdly, the 7 genes are involved in key processes associated with the pathophysiology of MPNs, including inflammation and immune deregulation. Thus, CRISP3, MMP8 and MPO are neutrophil granule proteins that play very important roles in immune and inflammatory responses [20,3134]; CEACAM6 and CEACAM8 are involved in cell adhesion, cellular invasiveness, angiogenesis, and inflammation [20,3539] and MS4A3 is a hematopoietic cell cycle regulator [40]. The HEMGN gene plays an important role in hematopoietic development and neoplasms and is also involved in differentiation and proliferation [4142]. Fourthly, molecular network analysis disclosed 34 genes which both directly and indirectly were connected to the 7 genes. Highly interestingly, the top functions of the genes were inflammatory response, cellular movement, and immune cell trafficking–all considered of utmost importance in the pathogenesis and pathophysiology of MPNs [2026]. To this end, NFKB and IL2 –in the network acting as hubs i.e. interacting nodes with high network degrees assumed to correspond to essential gene products in the cell, are known to play central roles in inflammation and immune regulation [4345].

Fifthly, plasma LDH was found to associate much better to our 7-gene signature than the leukocyte count, reflecting plasma LDH to be a more robust biochemical marker of incipient early myelofibrotic transformation than the leukocyte count. The design of our study does not allow us to explore this interesting observation in depth. However, it is intriguing to speculate if elevated plasma LDH levels in prePMF might be indicative of a more enhanced myeloproliferation and turnover of malignant cells than in patients with genuine ET. This interpretation is supported by the inferior survival of early prePMF patients as compared to those patients with genuine ET [9]. It should be noted that a recent study on the value of LDH in distinguishing PMF from the other MPNs displayed serum LDH to be of limited clinical utility [58]. Although serum LDH levels were higher in patients with PMF, levels were also increased in the majority of patients with ET and PV with significant overlap between the subgroups. Serum LDH correlated positively with higher leukocyte and platelet counts, and disease duration in PMF, suggesting LDH to be a biomarker for disease bulk and/or cellular proliferation. It was concluded that LDH lacks specificity for PMF–an observation consistent with the concept of a biological and phenotypic continuum between ET, PV and PMF [22,23,46,58]. In this perspective, our findings of LDH being a better marker than the leukocyte count to associate with our 7-gene signature of inflammation, immune deregulation, proliferation, angiogenesis and cell adhesion in the discrimination between the two entities are only supportive. Thus, elevated LDH may also indicate defective erythropoiesis with concurrent intramedullary and extramedullary hemolysis which are prone to occur in PMF-patients [5967], in particular in the early myelofibrosis stage, where immune deregulation seems to be more prevalent [59,6162,66]. Furthermore, the earliest stage in this biological continuum–genuine ET—also includes “triple-negative” ET patients with no JAK2V617F, CALR or MPL mutations and an exceedingly favourable prognosis, suggesting that these patients may have a less severe form of ET or, alternatively, that some of these “ET” patients may actually not have any overt MPN but rather thrombocytosis associated with a more benign process [6869]. By contrast, patients with early prePMF are already along the road towards “myelofibrosis with myeloid metaplasia” as assessed by more patients with anemia, leukocytosis, elevated LDH and palpable spleens [89,1112,46].

Our study has limitations and strengths. Firstly, it should be taken into account that our patients were not newly diagnosed but investigated at different time points after diagnosis. However, both LDH and leukocyte count were obtained at the time of diagnosis. Secondly, many patients were treated with HU at the time of microarray analysis, which potentially might have impacted the ‘‘global gene signature”. However, in our previous whole blood gene expression studies of MPN-patients on HU treatment, several genes were significantly and increasingly deregulated from the early MPN stage (ET/PV) to the advanced myelofibrosis stage, indicating that HU treatment does not seem to have a major impact upon the different genes being investigated (e.g. inflammation and immune genes) [2223,26]. Moreover, the 7-gene signature is developed on genes highly significantly deregulated in patients with PMF vs. control subjects with only 1 of 9 PMF patients receiving treatment. Thirdly, we used whole blood instead of e.g. isolated granulocytes, CD34+ cells and mononuclear cells, which have been used in many previous transcriptional profiling studies in MPNs. In the context of exploring distinct genes of value in the discrimination of genuine ET versus prePMF, we believe that such a discriminatory gene expression profile may be more reliably obtained from transcriptional profiling of whole blood than if isolated cell types were analysed. Indeed, our previous studies of whole blood transcriptional profiling are strongly supportive, since these studies–in addition to confirming gene signatures obtained by analyzing single cells by others but generally with much stronger signals–also have unravelled deregulation of several genes which may be involved in disease pathogenesis and progression [20,2126]. As being underscored elsewhere [26], our approach certainly also makes sense when considering that the ‘‘tumor tissue” being studied–whole blood–is composed of clonal ‘‘tumor” cells (being both myeloid cells, platelets, B-cells and T-cells) and non-clonal cells, including immune cells, which are all activated due to e.g. chronic inflammation and oxidative stress which are considered of crucial importance in MPN-disease pathogenesis [5357]. In this regard, whole blood transcriptional profiling is actually very similar to all other studies of gene signatures in tumor tissue. Accordingly, in the context of studying powerful gene signatures in the discrimination between genuine ET and early prePMF, our approach with whole blood transcriptional profiling is more a strength than a limitation. Fourthly, our study included a limited number of ET patients. However, the fact that we were able to isolate a 7-gene signature using sophisticated and highly informative gene methodologies in such a small cohort of patients is supportive of our gene signature possessing a high discriminatory power. However, this observation needs to be demonstrated and validated in larger cohorts of ET-patients using our whole blood gene expression profiling design.

In conclusion, we have for the first time shown that a simple 7-gene signature—composed of genes involved in inflammation, immune regulation, cell proliferation, angiogenesis and cell adhesion—is able to discriminate between genuine ET and early prePMF. Considering the concern in regard to reproducibility of histopathological findings to discriminate between these disease entities [6], further studies are urgently needed which by other methodologies aim at identifying and distinctly separating these subsets of MPN-patients. In this regard, a recent study by Pahl et al is of particular importance, showing quantitative NF-E2 immunohistochemistry of bone marrow biopsies of MPN patients with thrombocytosis to be of value in distinguishing ET from early prePMF [70]. Studies are in progress to validate our results in larger cohorts of “ET” patients with concomitant NF-E2 immunohistochemistry on bone marrow biopsies. Hopefully, this combinatorial approach using whole blood gene expression profiling together with bone marrow NF-E2 immunohistochemistry may yield novel important information to be included in decision making on treatment and prognosis of patients with ET and prePMF.

Supporting Information

S1 Fig. Principal component analysis on the 17 ET patients.

The analysis was performed on the 1976 genes with highest variation across all samples. Blue circles: genuin ET; red circles: pre-MF. (A) Patients are divided according to the LDH value. (B) Patients are divided according to the leukocyte value.

https://doi.org/10.1371/journal.pone.0161570.s001

(TIF)

S2 Fig. Receiver operator characteristics (ROC) curve.

The ROC curve shows the performance of the 7-gene signature. The area under the ROC curve (AUC) is 0.958 showing an excellent separation of genuine ET and prePMF.”

https://doi.org/10.1371/journal.pone.0161570.s002

(TIF)

S1 Table. Patients are divided into the genuine ET or pre-PMF group according to the leukocyte value.

BU = busulphan; ANA = anagrelide; HU = hydroxyurea; IFN = interferon-alpha. Age and V617F are mean and range at the time of blood sampling for gene expression profiling studies. No significant difference in age and V617F % was observed. Leukocyte values at the time of diagnosis.

https://doi.org/10.1371/journal.pone.0161570.s003

(DOCX)

S2 Table. Expression profiling of the 7 genes in patients with PMF.

Fold changes and false discovery rates (FDR) for the 7 genes in whole blood from patients with PMF compared to controls. FC = false discovery rate; FDR = false discovery rate.

https://doi.org/10.1371/journal.pone.0161570.s004

(XLSX)

Acknowledgments

We are indebted to our patients who allowed drawing and analyzing blood samples.

Author Contributions

  1. Conceived and designed the experiments: HCH VS MT TAK JT ABM KN.
  2. Performed the experiments: HCH VS ABM JT ME IS HB MB LK.
  3. Analyzed the data: HCH VS MT TAK MB.
  4. Contributed reagents/materials/analysis tools: NB RDS TSL CR MKJ OWB HCH.
  5. Wrote the paper: HCH ABM VS.
  6. Crtical revision of the manuscript and final approval of the version to be published: HCH VS MB TSL MT CR ABM HB IS LK ME RDS TAK OWB NB MKJ JT KN.

References

  1. 1. Campbell PJ, Green AR. Mechanisms of disease. The myeloproliferative disorders. N Engl J Med 2006;355:2452–66. pmid:17151367
  2. 2. Thiele J, Kvasnicka HM, Zankovich R, Diehl V. Relevance of bone marrow features in the differential diagnosis between essential thrombocythemia and early stage idiopathic myelofibrosis. Haematologica. 2000;85(11):1126–34. pmid:11064463
  3. 3. Thiele J, Kvasnicka HM, Zankovich R, Diehl V. Clinical and morphological criteria for the diagnosis of prefibrotic idiopathic (primary) myelofibrosis. Ann Hematol. 2001;80(3):160–5. pmid:11320901
  4. 4. Thiele J, Kvasnicka HM, Zankovich R, Diehl V. Early-stage idiopathic (primary) myelofibrosis—current issues of diagnostic features. Leuk Lymphoma. 2002;43(5):1035–41. pmid:12148883
  5. 5. Kvasnicka HM, Thiele J. The impact of clinicopathological studies on staging and survival in essential thrombocythemia, chronic idiopathic myelofibrosis, and polycythemia rubra vera. Semin Thromb Hemost. 2006;32(4 Pt 2):362–71. pmid:16810612
  6. 6. Wilkins BS, Erber WN, Bareford D, Buck G, Wheatley K, East CL, et al. Bone marrow pathology in essential thrombocythemia: interobserver reliability and utility for identifying disease subtypes. Blood. 2008;111(1):60–70. pmid:17885079
  7. 7. Thiele J, Kvasnicka HM. The 2008 WHO diagnostic criteria for polycythemia vera, essential thrombocythemia, and primary myelofibrosis. Curr Hematol Malig Rep. 2009;4(1):33–40. pmid:20425436
  8. 8. Thiele J, Kvasnicka HM, Müllauer L, Buxhofer-Ausch V, Gisslinger B, Gisslinger H. Essential thrombocythemia versus early primary myelofibrosis: a multicenter study to validate the WHO classification. Blood. 2011;117(21):5710–8. pmid:21447832
  9. 9. Barbui T, Thiele J, Passamonti F, Rumi E, Boveri E, Ruggeri M, et al. Survival and disease progression in essential thrombocythemia are significantly influenced by accurate morphologic diagnosis: an international study. J Clin Oncol. 2011;29(23):3179–84. pmid:21747083
  10. 10. Finazzi G, Carobbio A, Thiele J, Passamonti F, Rumi E, Ruggeri M, et al. Incidence and risk factors for bleeding in 1104 patients with essential thrombocythemia or prefibrotic myelofibrosis diagnosed according to the 2008 WHO criteria. Leukemia. 2012;26(4):716–9. pmid:21926959
  11. 11. Barbui T, Thiele J, Carobbio A, Passamonti F, Rumi E, Randi ML, et al. Disease characteristics and clinical outcome in young adults with essential thrombocythemia versus early/prefibrotic primary myelofibrosis. Blood. 2012;120(3):569–71. pmid:22700720
  12. 12. Carobbio A, Finazzi G, Thiele J, Kvasnicka HM, Passamonti F, Rumi E, et al. Blood tests may predict early primary myelofibrosis in patients presenting with essential thrombocythemia. Am J Hematol. 2012;87(2):203–4. pmid:22237692
  13. 13. Barbui T, Thiele J, Vannucchi AM, Tefferi A. Problems and pitfalls regarding WHO-defined diagnosis of early/prefibrotic primary myelofibrosis versus essential thrombocythemia. Leukemia. 2013;27(10):1953–8. pmid:23467025
  14. 14. Rupoli S, Goteri G, Picardi P, Micucci G, Canafoglia L, Scortechini AR, et al. Thrombosis in essential thrombocytemia and early/prefibrotic primary myelofibrosis: the role of the WHO histological diagnosis. Diagn Pathol. 2015;10:29. pmid:25885405
  15. 15. Hussein K, Bock O, Theophile K, von Neuhoff N, Buhr T, Schlué J, et al. JAK2(V617F) allele burden discriminates essential thrombocythemia from a subset of prefibrotic-stage primary myelofibrosis. Exp Hematol. 2009;37(10):1186–93. pmid:19616600
  16. 16. Brousseau M, Parot-Schinkel E, Moles MP, Boyer F, Hunault M, Rousselet MC. Practical application and clinical impact of the WHO histopathological criteria on bone marrow biopsy for the diagnosis of essential thrombocythemia versus prefibrotic primary myelofibrosis. Histopathology. 2010;56(6):758–67. pmid:20546341
  17. 17. Berkofsky-Fessler W, Buzzai M, Kim M K-H, Fruchtman S, Najfeld V, Min DJ, et al. Transcriptional profiling of Polycythemia Vera identifies gene expression patterns both dependent and independent from the action of JAK2V617F. Clin Cancer Res. 2010;16:4339–4352. pmid:20601445
  18. 18. Guglielmelli P, Zini R, Bogani C, Salati S, Pancrazzi A, Bianchi E, et al. Molecular Profiling of CD34+ cells in idiopathic myelofibrosis identifies a set of disease-associated genes and reveals the clinical significance of wilms’ tumor gene 1 (WT1). Stem Cells 2007;25:165–173. pmid:16990584
  19. 19. Gnatenko DV, Zhu W, Xu X, Samuel ET, Monaghan M, Zarrabi MH, et al. Class Prediction models of thrombocytosis using genetic biomarkers. Blood. 2010;115:7–14. pmid:19773543
  20. 20. Hasselbalch HC, Skov V, Larsen TS, Thomassen M, Riley CH, Jensen MK, et al. High expression of carcinoembryonic antigen-related cell adhesion molecule (CEACAM) 6 and 8 in primary myelofibrosis. Leuk Res. 2011;35(10):1330–4. pmid:21470677
  21. 21. Skov V, Riley CH, Thomassen M, Larsen TS, Jensen MK, Bjerrum OW, et al. Whole blood transcriptional profiling reveals significant down-regulation of human leukocyte antigen class I and II genes in essential thrombocythemia, polycythemia vera and myelofibrosis. Leuk Lymphoma. 2013;54(10):2269–73. pmid:23302045
  22. 22. Skov V, Larsen TS, Thomassen M, Riley , Jensen MK, Bjerrum OW, et al. Whole-blood transcriptional profiling of interferon-inducible genes identifies highly upregulated IFI27 in primary myelofibrosis. Eur J Haematol. 2011;87:54–60. pmid:21447007
  23. 23. Skov V, Thomassen M, Riley CH, Jensen MK, Bjerrum OW, Kruse TA, et al. Gene expression profiling with principal component analysis depicts the biological continuum from essential thrombocythemia over polycythemia vera to myelofibrosis. Exp Hematol. 2012;40(9):771–780. pmid:22659388
  24. 24. Skov V, Larsen TS, Thomassen M, Riley CH, Jensen MK, Bjerrum OW, et al. Increased gene expression of histone deacetylases in patients with Philadelphia-negative chronic myeloproliferative neoplasms. Leuk Lymphoma. 2012;53(1):123–129. pmid:21806350
  25. 25. Hasselbalch HC, Skov V, Larsen TS, Thomassen M, Riley CH, Jensen MK, et al. Transcriptional Profiling of Whole Blood Identifies a Unique 5-Gene Signature for Myelofibrosis and Imminent Myelofibrosis Transformation. PLoS One. 2014;9(1):e85567. pmid:24454890
  26. 26. Hasselbalch HC, Thomassen M, Riley CH, Kjær L, Larsen TS, Jensen MK, et al. Whole blood transcriptional profiling reveals deregulation of oxidative and antioxidative defence genes in myelofibrosis and related neoplasms. potential implications of downregulation of Nrf2 for genomic instability and disease progression. PLoS One. 2014;9(11):e112786. pmid:25397683
  27. 27. Swerdlow SH, Campo E, Harris NL, Jaffe ES, Pileri SA, Stein H, et al. International Agency for Research on Cancer. WHO Classification of Tumours of Haematopoietic and Lymphoid Tissue, Fourth Edition. Geneva, Switzerland: World Health Organization; 2008.
  28. 28. Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, et al: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 2003;4:249–64. pmid:12925520
  29. 29. Li C, Wong WH: Model-based analysis of oligonucleotide arrays: model validation, design issues and standard error application. Genome Biol 2001;2:1–11
  30. 30. Barabasi AL, Oltvai ZN: Network biology: understanding the cell's functional organization. Nat Rev Genet 2004;5:101–113. pmid:14735121
  31. 31. Borregaard N, Cowland JB. Granules of the human neutrophilic polymorphonuclear leukocyte. Blood 1997; 89: 3503–3521. pmid:9160655
  32. 32. Borregaard N, Theilgaard-Monch K, Sorensen OE, Cowland JB. Regulation of human neutrophil granule protein expression. Curr Opin Hematol 2001; 8: 23–27. pmid:11138622
  33. 33. Theilgaard-Monch K, Jacobsen LC, Borup R, Rasmussen T, Bjerregaard MD, Nielsen FC, et al. The transcriptional program of terminal granulocytic differentiation.Blood.2005; 105: 1785–1796. pmid:15514007
  34. 34. Hager M, Cowland JB, Borregaard N. Neutrophil granules in health and disease. J Intern Med 2010; 268: 25–34. pmid:20497300
  35. 35. Benchimol S, Fuks A, Jothy S, Beauchemin N, Shirota K, Stanners CP. Carcinoembryonic antigen, a human tumor marker, functions as an intercellular adhesion molecule. Cell 1989;57:327–34. pmid:2702691
  36. 36. Duxbury MS, Ito H, Benoit E, Ashley SW, Whang EE. CEACAM6 is a determinant of pancreatic adenocarcinoma cellular invasiveness. Br J Cancer. 2004 4;91(7):1384–90. pmid:15316565
  37. 37. Duxbury MS,Matros E, Clancy T, Bailey G, Doff M, Zinner MJ, et al. CEACAM6 is a novel biomarker in pancreatic adenocarcinoma and PanIN lesions. Ann Surg 2005;241(3):491–6. pmid:15729073
  38. 38. Kuespert K, Pils S, Hauck CR. CEACAMs: their role in physiology and pathophys iology. Curr Opin Cell Biol 2006;18(5):565–71. pmid:16919437
  39. 39. Maraqa L, Cummings M, Peter MB, Shaaban AM, Horgan K, Hanby AM, et al. Carcinoembryonic antigen cell adhesion molecule 6 predicts breast cancer recurrence following adjuvant tamoxifen. Clin Cancer Res 2008;14(2):405–11. pmid:18223215
  40. 40. Donato JL, Ko J, Kutok JL, Cheng T, Shirakawa T, Mao XQ, et al. Human HTm4 is a hematopoietic cell cycle regulator. J Clin Invest. 2002;109(1):51–8. pmid:11781350
  41. 41. Yang LV, Nicholson RH, Kaplan J, Galy A, Li L. Hemogen is a novel nuclear factor specifically expressed in mouse hematopoietic development and its human homologue EDAG maps to chromosome 9q22, a region containing breakpoints of hematological neoplasms. Mech Dev. 2001;104(1–2):105–11. pmid:11404085
  42. 42. Jiang J, Yu H, Shou Y, Neale G, Zhou S, Lu T, et al. Hemgn is a direct transcriptional target of HOXB4 and induces expansion of murine myeloid progenitor cells. Blood. 2010;116(5):711–9. pmid:20393131
  43. 43. Li Q, Verma IM. NF-kappaB regulation in the immune system. Nat Rev Immunol. 2002;2(10):725–734. pmid:12360211
  44. 44. Pikarsky E, Porat RM, Stein I, Abramovitch R, Amit S, Kasem S, et al. NF-kappaB functions as a tumour promoter in inflammation associated cancer. Nature. 2004;431(7007):461–466. pmid:15329734
  45. 45. Visekruna A, Volkov A, Steinhoff U. A key role for NF-κB transcription factor c-Rel in T-lymphocyte-differentiation and effector functions. Clin Dev Immunol. 2012;2012:239368. pmid:22481964
  46. 46. Barosi G, Rosti V, Bonetti E, Campanelli R, Carolei A, Catarsi P, et al. Evidence that prefibrotic myelofibrosis is aligned along a clinical and biological continuum featuring primary myelofibrosis.PLoS One. 2012;7(4):e35631. pmid:22536419
  47. 47. Tefferi A. Myelofibrosis with myeloid metaplasia. N Engl J Med 2000;342 (17):1255–1265. pmid:10781623
  48. 48. Barosi G, Viarengo G, Pecci A, Rosti V, Piaggio G, Marchetti M, et al. Diagnostic and clinical relevance of the number of circulating CD34(+) cells in myelofibrosis with myeloid metaplasia. Blood 2001;98(12):3249–3255. pmid:11719361
  49. 49. Hasselbalch HC. Myelofibrosis with myeloid metaplasia: the advanced phase of an untreated disseminated hematological cancer. Time to change our therapeutic attitude with early upfront treatment? Leuk Res 2009;33(1):11–18. pmid:18632152
  50. 50. Xu M, Bruno E, Chao J, Huang S, Finazzi G, Fruchtman SM, et al. Constitutive mobilization of CD34+ cells into the peripheral blood in idiopathic myelofibrosis may be due to the action of a number of proteases. Blood 2005;105(11):4805–4815.
  51. 51. Passamonti F, Rumi E, Pietra D, Della Porta MG, Boveri E, Pascutto C, et al. Relation between JAK2 (V617F) mutation status, granulocyte activation, and constitutive mobilization of CD34+ cells into peripheral blood in myeloproliferative disorders. Blood 2006;107(9):3676–3682. pmid:16373657
  52. 52. Lataillade JJ, Pierre-Louis O, Hasselbalch HC, Uzan G, Jasmin C, Martyré MC, et al. French INSERM and the European EUMNET Networks on Myelofibrosis.Does primary myelofibrosis involve a defective stem cell niche? From concept to evidence. Blood 2008;112(8):3026–3035. pmid:18669872
  53. 53. Hasselbalch HC. The role of cytokines in the initiation and progression of myelofibrosis. Cytokine Growth Factor Rev 2013;24(2):133–45. pmid:23415024
  54. 54. Hasselbalch HC. Perspectives on chronic inflammation in essential thrombocythemia, polycythemia vera, and myelofibrosis: is chronic inflammation a trigger and driver of clonal evolution and development of accelerated atherosclerosis and second cancer? Blood 2012;119(14):3219–25. pmid:22318201
  55. 55. Hasselbalch HC. Chronic Inflammation as a Promoter of Mutagenesis in Essential Thrombocythemia, Polycythemia Vera and Myelofibrosis. A Human Inflammation Model for Cancer Development? Leuk Res 2013;37(2):214–20. pmid:23174192
  56. 56. Hasselbalch HC. A Role of NF-E2 in Chronic Inflammation and Clonal Evolution in Essential Thrombocythemia, Polycythemia Vera and Myelofibrosis? Leuk Res 2013;38(2):263–6. pmid:23932394
  57. 57. Hermouet S, Vilaine M. The JAK2 46/1 haplotype: a marker of inappropriate myelomonocytic response to cytokine stimulation, leading to increased risk of inflammation, myeloid neoplasm, and impaired defense against infection? Haematologica 2011;96(11):1575–9. pmid:22058280
  58. 58. Beer PA, Campbell PJ, Green AR. Comparison of different criteria for the diagnosis of primary myelofibrosis reveals limited clinical utility for measurement of serum lactate dehydrogenase. Haematologica. 2010:95(11):1960–3. pmid:20534696
  59. 59. Gordon BR, Coleman M, Kohen P, Day NK. Immunologic abnormalities in myelofibrosis with activation of the complement system. Blood 1981;58:904–9. pmid:6913410
  60. 60. Hasselbalch HC, Berild D, Hansen OP. Red-cell sensitization in myelofibrosis. Scand J Haematol. 1984;32(2):179–82. pmid:6701463
  61. 61. Hasselbalch HC, Nielsen H, Berild D, Kappelgaard E. Circulating immune complexes in myelofibrosis. Scand J Haematol. 1985;34(2):177–80. pmid:3871960
  62. 62. Ester A, Feliu E, Aguilar JL, Jou JM, Martí M, Marín P, et al. Fagocitosis de complejos inmunes. Interes diagnostica y patogenetico en la mielofibrosis idiopatica. Sangre 1981;26:567–74.
  63. 63. Vellenga E, Mulder NH, The TH, Nieweg HO. A study of the cellular and humoral immune response in patients with myelofibrosis. Clin Lab Haematol 1982;4:239–46. pmid:7172606
  64. 64. Hasselbalch HC. Idiopathic myelofibrosis: a clinical study of 80 patients. Am J Hematol. 1990;34 (4):291–300. pmid:2195869
  65. 65. Barosi G, Magrini U, Gale RP. Does auto-immunity contribute to anemia in myeloproliferative neoplasms (MPN)-associated myelofibrosis? Leuk Res. 2010;34(9):1119–1120. pmid:20538336
  66. 66. Barcellini W, Iurlo A, Radice T, Imperiali FG, Zaninoni A, Fattizzo B, et al. Increased prevalence of autoimmune phenomena in myelofibrosis: relationship with clinical and morphological characteristics, and with immunoregulatory cytokine patterns. Leuk Res. 2013;37(11):1509–15. pmid:24080022
  67. 67. Barosi G. An immune dysregulation in MPN. Curr Hematol Malig Rep. 2014;9(4):331–9. pmid:25139710
  68. 68. Rotunno G, Mannarelli C, Guglielmelli P, Pacilli A, Pancrazzi A, Pieri L, et al. Impact of calreticulin mutations on clinical and hematological phenotype and outcome in essential thrombocythemia. Blood 2014;123(10):1552–1555. pmid:24371211
  69. 69. Koschmieder S, Mughal T, Hasselbalch HC, Barosi G, Valent P, Kiladjian JJ, et al. Myeloproliferative neoplasms and inflammation: Whether to target the malignant clone or the inflammatory process or both. Leukemia. 2016 May;30(5):1018–24. pmid:26854026
  70. 70. Aumann K, Frey AV, May AM, Hauschke D, Kreutz C, Marx JP, et al. Subcellular mislocalization of the transcription factor NF-E2 in erythroid cells discriminates prefibrotic primary myelofibrosis from essential thrombocythemia. Blood. 2013;122(1):93–9. pmid:23670178