Genomic profiling in hidradenitis suppurativa: InterOmics pipeline for DNA-RNA sequencing highlights HLA variants, keratin-associated mutations and extracellular matrix alterations as contributing factors to HS pathogenesis

Lucas André Cavalcanti Brandão; Ronald Rodrigues de Moura; Bruno Rodrigo Assunção; Cecilia Del Vecchio; Adamo Pio d’Adamo; Gudrun Ratzinger; Barbara Böckle; Nina Frischhut; Wolfram Jaschke; Matthias Schmuth; Muhammad Suleman; Angelo Valerio Marzano; Chiara Moltrasio; Paola Maura Tricarico; Sergio Crovella

doi:10.1371/journal.pone.0326458

Abstract

Hidradenitis suppurativa (HS) is a chronic autoinflammatory skin disorder with a complex genetic and molecular basis. To advance its characterization, we applied InterOmics, a novel bioinformatics pipeline integrating whole exome sequencing (WES) and RNA sequencing (RNA-seq), to saliva and skin biopsy samples from six HS patients. This approach enabled a comprehensive multiomics investigation, identifying disease-associated genetic variants and transcriptomic alterations. A key innovation of InterOmics is the Multiomics Variant Category, which classifies variants based on DNA and RNA data, capturing regulatory mechanisms such as allele-specific expression, RNA editing, nonsense-mediated decay, and gain-of-function mutations. Our findings highlight HLA gene variants and keratin-related mutations as potential contributors to HS pathogenesis. By bridging genomic and transcriptomic data, InterOmics enhances variant interpretation. This study underscores the power of multiomics-driven approaches in deciphering complex diseases, paving the way for precision medicine in HS.

Citation: Cavalcanti Brandão LA, Rodrigues de Moura R, Rodrigo Assunção B, Del Vecchio C, d’Adamo AP, Ratzinger G, et al. (2025) Genomic profiling in hidradenitis suppurativa: InterOmics pipeline for DNA-RNA sequencing highlights HLA variants, keratin-associated mutations and extracellular matrix alterations as contributing factors to HS pathogenesis. PLoS One 20(6): e0326458. https://doi.org/10.1371/journal.pone.0326458

Editor: Donatella Mentino, University of Bari Aldo Moro: Universita degli Studi di Bari Aldo Moro, ITALY

Received: October 5, 2024; Accepted: May 29, 2025; Published: June 20, 2025

Copyright: © 2025 Cavalcanti Brandão et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the manuscript and its Supporting Information files. However, the whole exome sequencing (WES) and RNA sequencing (RNA-seq) data generated in this study are available upon reasonable request. Due to ethical restrictions concerning human genomic data privacy, raw sequencing files cannot be publicly shared but can be accessed through a controlled request process. Researchers interested in obtaining the data should contact the corresponding author (sgrovella@qu.edu.qa) and provide documentation of institutional ethical approval for human genetic research. Data access will be granted upon agreement to comply with ethical and privacy regulations governing human genomic data sharing.

Funding: Our study has been supported by the project EraPerMed 2018-137 BATMAN (Biomolecular Analyses for Tailored Medicine in Acne iNversa) project, by the project CNPq/MCTI Nº 10/2023 – UNIVERSAL from the Brazilian Council for Research and by the project RC16/18 (Ricerca Corrente) from IRCCS Burlo Garofolo, Trieste, Italy. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

Competing interests: The authors have declared that no competing interests exist.

Introduction

Hidradenitis suppurativa (HS) is a chronic, autoinflammatory skin disorder characterized by painful nodules, abscesses, and interconnected tunnels in apocrine gland-bearing areas. Affecting approximately 1–4% of the global population, HS has a complex pathogenesis involving both genetic predisposition and environmental triggers such as smoking, obesity, and mechanical friction [1]. Familial clustering, with about 35% of patients reporting a family history, suggests a strong hereditary component in disease susceptibility [2]. Despite advances in clinical management, the heterogeneous nature of HS and variable treatment responses highlight the need for a deeper molecular understanding to improve patient stratification and therapeutic strategies.

The rapid evolution of next-generation sequencing (NGS) technologies has significantly advanced our ability to profile the molecular mechanisms underlying complex diseases. Whole exome sequencing (WES) and whole genome sequencing (WGS) are now widely used to identify genetic abnormalities, including single nucleotide variants (SNVs) and copy number variations (CNVs), that contribute to disease pathogenesis [3]. On the other hand, RNA sequencing (RNA-seq) provides a dynamic snapshot of the transcriptome, offering insights into gene expression patterns, alternative splicing events, and post-transcriptional modifications that are crucial for understanding disease biology [4–7].

In the context of HS, transcriptomic analyses have largely focused on differentially expressed genes (DEGs) to identify dysregulated pathways involved in inflammation, tissue remodeling, and immune responses [8–11]. While DEG analysis remains valuable, recent advances in NGS technology have expanded RNA-seq applications beyond gene expression profiling, enabling the detection of genetic variants, transcript isoforms, and RNA editing events [12]. However, variant detection in RNA-seq data presents unique challenges. The variability in transcript isoforms, influenced by alternative splicing, makes it difficult to distinguish true genetic variants from isoform-specific alterations [13]. RNA editing, particularly adenosine-to-inosine (A-to-I) modifications, can also be misinterpreted as SNVs, complicating the analysis and interpretation of sequencing data [14]. Additionally, genes with low expression levels may have insufficient sequencing coverage, increasing background noise and making it more challenging to confidently call SNVs [15].

Given these challenges, integrating DNA and RNA sequencing data provides a more comprehensive and accurate molecular characterization of disease-associated variants. Here, we introduce InterOmics, a bioinformatics pipeline specifically designed to combine WES and RNA-seq data for a multiomic analysis of genetic and transcriptomic variations. InterOmics implements a systematic variant classification approach, linking genetic mutations with gene expression changes to uncover regulatory mechanisms that may drive disease phenotypes.

In this study, we applied InterOmics to DNA and RNA sequencing data from six HS patients, aiming to establish a structured framework for investigating the genetic basis of HS and its impact on transcriptomic profiles. By leveraging multiomic integration, we seek to identify novel molecular signatures associated with HS and explore potential pathways for personalized therapeutic interventions. This approach represents a significant step toward precision medicine applications in HS, providing a scalable model for studying other complex inflammatory diseases.

Materials and methods

Study design, patients and sequencing

This study was approved by the Ethical Committee “Comitato Etico Unico Regionale (CEUR) of Friuli Venezia Giulia (FVG) (RC 16/18, Prot. N.0001094 (14/01/2019), CEUR-2018-Sper-127-BURLO IRCCS Burlo Garofolo” Italy. This Ethical Committee also covered the enrollment of patients from the European Consortium (Italy, France, Austria, France and Slovenia) for tailored diagnosis of patients affected by Hidradenitis suppurativa, EraPerMed2018–137. The study included six unrelated Austrian patients (4 men and 2 women, median age 41.5 years, range 29–67 years) diagnosed with moderate-to-severe HS. All participants provided written informed consent prior to sample collection..All participants underwent saliva sampling and surgical removal of HS lesional areas through an 8 mm punch biopsy (Kai Medical, Seki City, Gifu, Japan), in January 2024, at the Department of Dermatology, Venereology, and Allergology, Medical University of Innsbruck, Austria.

The mRNA was extracted from lesional skin biopsies of the patient enrolled, utilizing the RNeasy Lipid Tissue Mini Kit (Qiagen, Milano, Italy) as per the manufacturer’s instructions. This was followed by the extraction of genomic DNA from saliva samples with the Oragene-DNA (Oragene®, Ottawa, Canada) kit, also adhering to the manufacturer’s guidelines. Both DNA and mRNA samples were then sent to Macrogen Europe (Amsterdam, Netherlands) for sequencing.

The Exome Sequencing Analysis achieved an average coverage of 150x, employing the Illumina® SureSelect Human V7 Kit (San Diego, CA, USA) for library preparation and sequencing on the Illumina® HiSeq 2500 System (San Diego, CA, USA). This process produced paired end reads of 150 base pairs in length. For RNA-seq, we sequenced an average of 60 million paired-end reads using the Illumina® TruSeq Stranded Library. The quality of both sequencing outputs was assessed using FastQC on the raw sequencing files (fastq.gz format), evaluating parameters such as average read length, quality scores for reads and bases, and the presence of adapters.

Whole Exome Sequencing (WES)

The WES reads were analyzed following the good practices of Broad Institute [16]. In summary, the residual adapters, short reads (below 25 base pairs) and/or low-quality reads (Q < 20) were removed using TrimGalore (Version 0.6.5 https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/). Unmapped reads were aligned with NCBI Genome Reference Consortium Human Build 38 (GRCh38) as human reference genome using Burrows-Wheeler Aligner (BWA) algorithm [17]; marking and removal of duplicated reads as well as base quality score recalibration was carried out using Picard Tools v.2.7.0 (https://broadinstitute.github.io/picard/) and GATK v.4.1.2.0 [18].

RNA sequencing (RNA-Seq)

The reads from the raw sequencing files were processed using Trimmomatic software v.0.39 [19] to trim Illumina adapters and to exclude reads counting fewer than 25 bases. Then, the remaining reads were mapped on the National Center for Biotechnology (NCBI) human GRCh38 reference genome and sorted by coordinates using STAR aligner v. 2.7.6a [20]. Then, the mapped reads were counted using the gene quantification function, which can link the mapped read to its respective gene. The count is concluded by its normalization. Both methods were calculated using DESeq2 v. 1.40.2 [21]. Gene and isoform transcript expressions were calculated for each individual using RSEM v.1.3.3 [22].

SNV and small insertion and deletion (Indel) detection

Variant calling was made using Strelka2.9.10 [23], where both SNVs and indels were filtered out using the following parameters: QD < 2.0, FS > 60.0, MQ < 40.0, MQ Rank Sum <−12.5, Read Pos Rank Sum <−8.0 and GQ < 20.0, for SNPs; and QD < 2.0, FS > 200.0, Read Pos Rank Sum <−20.0 and GQ < 20.0, for indels. Variant annotation was made using Annovar software v.2019Oct24 [24].

Furthermore, we did a ‘in silico reverse transcription’ (from now on, cDNA dataset) of the mapped RNA-seq bam files using the same steps we did for WES analysis as well as variant calling and annotation, with the addition of one step to splitting N-CIGAR reads into multiple alignments after marking and removing duplicated reads.

Integrating WES and RNA-Seq data

We used GRCh38 as a reference genome for both whole exome sequencing (WES) and RNA sequencing (RNA-seq) alignment. Our approach was designed to balance consistency, transcriptome completeness and accurate variant detection. This choice ensures consistency in variant calling and transcript quantification, providing a standardized reference point for comparing genetic and transcriptomic variations across patients. The pipeline generates a WES-derived variant profile by mapping sequencing reads to GRCh38, identifying deviations from this reference genome in each individual.

In parallel, RNA-seq reads are also aligned to GRCh38-based transcript annotations, allowing us to quantify gene expression and identify transcriptomic variants such as allele-specific expression (ASE), i.e., variants that is heterozygous in the WES data, but homozygous for the alternative allele in the RNA-seq data, and RNA editing (RNAe) events, which consists of variants that are homozygous for the alternate allele in the WES data and heterozygous in the RNA-seq data.

Also, with both WES and RNA-seq data it is possible to infer the occurrence of Nonsense-mediated mRNA decay (NMD) [25], based on the expression of the genes. Substantially, we could expect a gene being targeted as NMD if there is a premature stop codon or a frameshift mutation in a gene found to be less or not expressed in the patient in comparison to a reference expression for the same tissue. In this case, we obtained the transcript per million (TPM) values for skin not exposed to the sun from GTex database using R “hpar” package v.1.48.0 [26].

Similarly, one can also infer Gain-of-Function (GoF) mutations by selecting non-intronic, non- intergenic, non-synonymous SNV nor frameshift INDEL variants that has medium or high expression in the patient, while it is below cutoff or lower expressed in the reference expression dataset.

These four categories are classified as Multiomics Variants, since their assignment depends on both WES and RNA-seq data.

Results and discussion

InterOmics workflow integrates genomic and transcriptomic data through a systematic three-stage process. In the first stage, raw sequencing files undergo quality control, cleaning, alignment, and annotation, establishing a foundation for reliable variant detection. The second stage organizes this processed data in PosgreSQL database (v.16.0), enabling efficient data retrieval and analysis. The third stage implements the integration protocol, designed to obtain an individual multiomics variation profile (Fig 1).

Download:

Fig 1. Interomics pipeline: A step-by-step approach to personalized genomic profiling.

The InterOmics pipeline begins with the collection of biological samples from the individual being tested. It then progresses to the simultaneous analysis of Whole Exome Sequencing (WES) and RNA Sequencing (RNASeq). Following this parallel analysis, the data are integrated to identify Single Nucleotide Variants (SNVs), Copy Number Variants (CNVs), insertions and deletions (indels), and expression isoforms. This comprehensive approach culminates in the creation of a personalized genomic and transcriptomic profile for the individual, offering detailed insights into their unique genetic makeup. In the Figure, The Multiomics Category variants are: Alelle specific expression (ASE); Predicted Nonsense-mediated decay (NMD); RNA editing events (RNAe); and Predicted gain-of-function (GoF).

https://doi.org/10.1371/journal.pone.0326458.g001

The integration of WES and RNA sequencing data yielded high-quality genomic information, with approximately 98% of reads successfully aligning to the reference genome. Table 1 shows the distribution of WES and RNA-seq variants in the HS patients.

Download:

Table 1. Variants count according to NGS (WES and RNA-seq) findings in the six HS patients. The counts for the multiomics categories (ASE and RNAe) are also shown. WES: whole exome sequencing; RNA-seq: RNA sequencing; ASE: Allele Specific Expression; RNAe: RNA editing events; NDM: Nonsense-mediated mRNA decay; GoF: Gain-of-function variant.

https://doi.org/10.1371/journal.pone.0326458.t001

ASE and RNAe variants, though less numerous, could provide insights into gene regulation in HS [27]. We identified some ASE and RNAe variants in genes involved in inflammatory responses, especially in the HLA loci. S1 and S2 Tables in S1 File show all ASE and RNAe variants, respectively, where Tables 2 and 3 show selected ASE and RNAe variants, respectively, after applying the following filters:

Download:

Table 2. List of filtered Allele specific expression (ASE) variants, i.e., only the alternative (alt) allele was found to be expressed, among the patients.

https://doi.org/10.1371/journal.pone.0326458.t002

Download:

Table 3. List of filtered RNA editing (RNAe) variants, i.e., the genotype is heterozygous in the RNA but homozygous for the alternative (alt) allele in the DNA, was found to be expressed, among the patients.

https://doi.org/10.1371/journal.pone.0326458.t003

excluding intronic, intergenic, upstream and downstream variants;
excluding synonymous SNV;
excluding variants with minor allele frequency (MAF) > 1%.

A third insight obtained by integrating WES and RNA-seq was the inference of NMD. In Table 4, we summarize the variants possibly causing NMD. The list with all NMD variants can be seen in S3 Table in S1 File, while Table 4 shows only variants with MAF < 0.01. All five variants with MAF < 0.01 were heterozygous, with two of them being in genes directly associated with keratin formation (rs2857667 and rs200049107), and the others are present in genes associated with cell metabolism.

Download:

Table 4. List of NMD variants among the patients with minor allele frequency (MAF) lower than 1%.

https://doi.org/10.1371/journal.pone.0326458.t004

A fourth category are the possible GoF variants. Table 5 shows the number of GoF variants per sample, while the S4 Table in S1 File displays the 87 GoF found among the patients with MAF < 0.01. Patient AUT108 has one heterozygous GoF deletion in the 3’ UTR of the COL19A1 gene (rs36112821), therefore having a higher expression for this gene in the skin.

Download:

Table 5. Number of possible Gain-of-Function (GoF) variants among the samples, according to their location in the gene.

https://doi.org/10.1371/journal.pone.0326458.t005

Numerous frameworks have been developed for the joint analysis of DNA and RNA sequencing data, each tailored to specific study designs and objectives. Expression Quantitative Trait Loci (eQTL) analysis pipelines (e.g., Matrix eQTL, FastQTL) seek to associate genetic variants with transcript abundance but typically require very large sample sizes (n > 200) to achieve sufficient statistical power and to adjust for multiple testing and population structure confounders. Similarly, unsupervised multi-omics clustering frameworks like MOFA and iClusterPlus are highly effective for large heterogeneous cohorts, enabling the discovery of disease subtypes or novel biological pathways by extracting latent factors across omics layers.

However, these tools are not easily applicable to small-scale, clinical investigations due to their sample size requirements and underlying assumptions. InterOmics was explicitly designed to address this gap, providing a variant-centric framework that enables individual-level genomic-transcriptomic integration. Rather than performing association analyses or unsupervised clustering, InterOmics classifies genetic variants based on their transcriptional consequences—such as allele-specific expression, RNA editing, nonsense-mediated decay, or gain-of-function—within the same subject.

This focus allows InterOmics to be particularly useful for small sample-size studies typical of pilot projects, rare disease cohorts, or early translational research. It tolerates sample sizes that would render statistical association methods underpowered and instead emphasizes biologically interpretable integration designed to inform hypothesis generation and clinical decision-making. Therefore, InterOmics complements existing frameworks, offering a unique solution when the research context involves a limited number of deeply characterized individuals.

The integration of whole exome sequencing (WES) and RNA sequencing (RNA-seq) has advanced our ability to analyze both genetic variation and gene expression, yet the clinical application of multiomics strategies remains limited. In this study, we introduced InterOmics, a systematic pipeline designed to merge WES and RNA-seq data, providing a more comprehensive characterization of disease-associated variants in HS. Unlike widely used multiomic approaches such as iCluster and MOFA, which focus on clustering molecular profiles, InterOmics was specifically developed to integrate genomic and transcriptomic data within a biological context, enabling a more refined classification of variants and their regulatory impact.

A key innovation of this study is the Multiomics Variant Category system, which categorizes variants based on their detection in both DNA and RNA data, allowing the identification of functionally relevant alterations. Our results reveal that allele-specific expression (ASE) variants were predominantly found in HLA genes, supporting recent evidence that specific HLA alleles may modulate susceptibility to HS [28,29]. Additionally, RNA editing (RNAe) variants, particularly adenosine-to-inosine (A-to-I) conversions, were identified, further emphasizing post-transcriptional regulation in HS pathogenesis.

Furthermore, we identified nonsense-mediated mRNA decay (NMD) variants associated with keratin-related genes (KRT83 and KRTAP2–4). These variants introduce premature stop codons, likely impairing keratin protein synthesis, which may contribute to tissue remodeling and skin barrier dysfunction in HS. Furthermore, these findings support the proposed concept of HS as an autoinflammatory keratinization disease (AiKD) [30].

Conversely, gain-of-function (GoF) variants, particularly in regulatory regions such as the 3’ UTR, were observed. Notably, a single base deletion (rs36112821) in COL19A1, a key collagen-related gene, was linked to altered transcript stability, reinforcing the role of extracellular matrix remodeling in HS [31].

Our findings highlight the importance of integrating WES and RNA-seq data to capture not only genetic mutations but also their transcriptional consequences. The high concordance between genetic and transcriptomic alterations observed in this study suggests that certain variants actively modulate gene expression, emphasizing their potential functional relevance in HS pathogenesis.

Despite these promising insights, some limitations must be acknowledged. The relatively small patient cohort restricts the generalizability of our findings, highlighting the need for larger validation studies. Additionally, while InterOmics provides a robust framework for variant classification, functional assays will be essential to confirm the biological impact of the identified variants and their relevance to HS pathogenesis.

Conclusions

This study demonstrates the power of multiomics integration in deciphering the genetic and transcriptomic landscape of HS. The InterOmics pipeline enabled a systematic classification of genomic and transcriptomic variants, revealing novel molecular signatures associated with HS pathogenesis. Our findings underscore the role of HLA variants, keratin-associated mutations, and extracellular matrix alterations in disease progression, reinforcing the utility of multiomic-driven approaches in complex inflammatory disorders.

While this study establishes a foundation for integrating DNA and RNA sequencing in HS research, further investigations are necessary to translate these findings into clinical applications. Expanding the patient cohort will be critical to distinguishing disease-driving variants from individual-specific variations. Additionally, functional validation studies are essential to confirm the biological significance of multiomic variants and their potential as biomarkers or therapeutic targets.

All in all, our findings highlight the transformative potential of multiomics approaches in understanding complex diseases like HS.

Supporting information

S1 File. S1 Table. Allele specifi expressed variants.

List of Allele Specific Expressed (ASE) variants with their location at both chromosomal and gene level, reference and alternat alleles, variant consequence, minor allele frequency (MAF), dbSNP (rs) id, base quality and depth metrics, Transcript per million (TPM) values for both GTex skin (not exposed to sun) gene expression and sample. S2 Table. RNA editing. List of RNA editing (RNAe) variants with their location at both chromosomal and gene level, reference and alternat alleles, variant consequence, minor allele frequency (MAF), dbSNP (rs) id, base quality and depth metrics, Transcript per million (TPM) values for both GTex skin (not exposed to sun) gene expression and sample. S3 Table. Nonsense-mediated messenger RNA. List of inferred Nonsense-mediated messenger RNA (mRNA) decay (NMD) variants with their location at both chromosomal and gene level, reference and alternat alleles, variant consequence, minor allele frequency (MAF), dbSNP (rs) id, base quality and depth metrics, Transcript per million (TPM) values for both GTex skin (not exposed to sun) gene expression and sample. The expression categories are also showed. S4 Table: Gain of function variants. List of inferred Gain of Function (GoF) variants with their location at both chromosomal and gene level, reference and alternat alleles, variant consequence, minor allele frequency (MAF), dbSNP (rs) id, base quality and depth metrics, Transcript per million (TPM) values for both GTex skin (not exposed to sun) gene expression and sample. The expression categories are also shown.

https://doi.org/10.1371/journal.pone.0326458.s001

(ZIP)

Acknowledgments

Fig 1 has been designed using resources from canva.com. Reference manager was accomplished by Mendeley Reference Manager.

References

1. Chu Y-L, Yu S. Hidradenitis suppurativa: an understanding of genetic factors and treatment. Biomedicines. 2024;12(2):338. pmid:38397941
- View Article
- PubMed/NCBI
- Google Scholar
2. Moltrasio C, Tricarico PM, Romagnuolo M, Marzano AV, Crovella S. Hidradenitis suppurativa: a perspective on genetic factors involved in the disease. Biomedicines. 2022;10(8):2039. pmid:36009585
- View Article
- PubMed/NCBI
- Google Scholar
3. Zhang Y, Yu J, Xie X, Jiang F, Wu C. Application of genomic data in translational medicine during the big data era. Front Biosci (Landmark Ed). 2024;29(1):7. pmid:38287799
- View Article
- PubMed/NCBI
- Google Scholar
4. Smail C, Montgomery SB. RNA Sequencing in Disease Diagnosis. Annu Rev Genomics Hum Genet. 2024;25(1):353–67. pmid:38360541
- View Article
- PubMed/NCBI
- Google Scholar
5. Webb S, Haniffa M. Large-scale single-cell RNA sequencing atlases of human immune cells across lifespan: possibilities and challenges. Eur J Immunol. 2023;53(12):e2250222.
- View Article
- Google Scholar
6. Huang D, Ma N, Li X, Gou Y, Duan Y, Liu B, et al. Advances in single-cell RNA sequencing and its applications in cancer research. J Hematol Oncol. 2023;16(1):98. pmid:37612741
- View Article
- PubMed/NCBI
- Google Scholar
7. Schreibing F, Anslinger TM, Kramann R. Fibrosis in pathology of heart and kidney: from deep RNA-sequencing to novel molecular targets. Circ Res. 2023;132(8):1013–33. pmid:37053278
- View Article
- PubMed/NCBI
- Google Scholar
8. Smirnov D, Konstantinovskiy N, Prokisch H. Integrative omics approaches to advance rare disease diagnostics. J Inherit Metab Dis. 2023;46(5):824–38. pmid:37553850
- View Article
- PubMed/NCBI
- Google Scholar
9. Jin C, Zhou D, Li J, Bi L, Li L. Single‑cell multi‑omics advances in lymphoma research (Review). Oncol Rep. 2023;50(4):184. pmid:37615192
- View Article
- PubMed/NCBI
- Google Scholar
10. Vasconcelos CFM, Ribas VT, Petrs-Silva H. Shared molecular pathways in glaucoma and other neurodegenerative diseases: insights from RNA-Seq analysis and miRNA regulation for promising therapeutic avenues. Cells. 2023;12(17):2155. pmid:37681887
- View Article
- PubMed/NCBI
- Google Scholar
11. Xiong J, Chen J, Guo Z, Zhang C, Yuan L, Gao K. A novel machine learning-based programmed cell death-related clinical diagnostic and prognostic model associated with immune infiltration in endometrial cancer. Front Oncol. 2023;13:1224071. pmid:37534256
- View Article
- PubMed/NCBI
- Google Scholar
12. Van Horebeek L, David M, Dedoncker N, Mallants K, Bijnens B, Goris A, et al. A targeted sequencing extension for transcript genotyping in single-cell transcriptomics. Life Sci Alliance. 2023;6(11):e202301971. pmid:37696578
- View Article
- PubMed/NCBI
- Google Scholar
13. Wang R, Helbig I, Edmondson AC, Lin L, Xing Y. Splicing defects in rare diseases: transcriptomics and machine learning strategies towards genetic diagnosis. Brief Bioinform. 2023;24(5):bbad284.
- View Article
- Google Scholar
14. Chan JJ, Tabatabaeian H, Tay Y. 3’UTR heterogeneity and cancer progression. Trends Cell Biol. 2023;33(7):568–82. pmid:36372614
- View Article
- PubMed/NCBI
- Google Scholar
15. Pool A-H, Poldsam H, Chen S, Thomson M, Oka Y. Recovery of missing single-cell RNA-sequencing data with optimized transcriptomic references. Nat Methods. 2023;20(10):1506–15. pmid:37697162
- View Article
- PubMed/NCBI
- Google Scholar
16. Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, et al. From FastQ data to high confidence variant calls: the genome analysis toolkit best practices pipeline. Curr Protoc Bioinformatics. 2013;43(1110):11.10.1–11.10.33. pmid:25431634
- View Article
- PubMed/NCBI
- Google Scholar
17. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–60. pmid:19451168
- View Article
- PubMed/NCBI
- Google Scholar
18. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–303. pmid:20644199
- View Article
- PubMed/NCBI
- Google Scholar
19. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. pmid:24695404
- View Article
- PubMed/NCBI
- Google Scholar
20. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21. pmid:23104886
- View Article
- PubMed/NCBI
- Google Scholar
21. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550. pmid:25516281
- View Article
- PubMed/NCBI
- Google Scholar
22. Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 2011;12:323. pmid:21816040
- View Article
- PubMed/NCBI
- Google Scholar
23. Kim S, Scheffler K, Halpern AL, Bekritsky MA, Noh E, Källberg M, et al. Strelka2: fast and accurate calling of germline and somatic variants. Nat Methods. 2018;15(8):591–4. pmid:30013048
- View Article
- PubMed/NCBI
- Google Scholar
24. Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164. pmid:20601685
- View Article
- PubMed/NCBI
- Google Scholar
25. Kim Y-G, Kang H, Lee B, Jang H-J, Park J-H, Ha C, et al. A spectrum of nonsense-mediated mRNA decay efficiency along the degree of mutational constraint. Commun Biol. 2024;7(1):1461. pmid:39511375
- View Article
- PubMed/NCBI
- Google Scholar
26. Human protein atlas in R. Available from https://lgatto.github.io/hpar/.
- View Article
- Google Scholar
27. Belkadi A, Thareja G, Halama A, Mahmoud Y, Jones D, Agnew S, et al. Identification of genetic variants controlling RNA editing and their effect on RNA structure stabilization. Eur J Hum Genet. 2020;28(12):1753–62. pmid:32651550
- View Article
- PubMed/NCBI
- Google Scholar
28. Bao A, Kollings J, Ma E, Manjunath J, D’Amiano A, Driscoll MS, et al. Association of human leukocyte antigen allelic variants with hidradenitis suppurativa across Fitzpatrick skin types: a cross-sectional analysis. J Am Acad Dermatol. 2025;92(3):567–9. pmid:39461503
- View Article
- PubMed/NCBI
- Google Scholar
29. Ocejo-Vinyals JG, Gonzalez-Gay MA, Fernández-Viña MA, Cantos-Mansilla J, Vilanova I, Blanco R, et al. Association of human leukocyte antigens class II variants with susceptibility to hidradenitis suppurativa in a Caucasian Spanish population. J Clin Med. 2020;9(10):3095. pmid:32992947
- View Article
- PubMed/NCBI
- Google Scholar
30. Akiyama M. Diseases categorized as autoinflammatory keratinization diseases (AiKDs), and their pathologies and treatments. Nagoya J Med Sci. 2024;86(1):1–15. pmid:38505726
- View Article
- PubMed/NCBI
- Google Scholar
31. Kashyap MP, Khan J, Sinha R, Jin L, Atigadda V, Deshane JS, et al. Advances in molecular pathogenesis of hidradenitis suppurativa: dysregulated keratins and ECM signaling. Semin Cell Dev Biol. 2022;128:120–9. pmid:35131152
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Chu Y-L, Yu S. Hidradenitis suppurativa: an understanding of genetic factors and treatment. Biomedicines. 2024;12(2):338. pmid:38397941
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Moltrasio C, Tricarico PM, Romagnuolo M, Marzano AV, Crovella S. Hidradenitis suppurativa: a perspective on genetic factors involved in the disease. Biomedicines. 2022;10(8):2039. pmid:36009585
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Zhang Y, Yu J, Xie X, Jiang F, Wu C. Application of genomic data in translational medicine during the big data era. Front Biosci (Landmark Ed). 2024;29(1):7. pmid:38287799
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Smail C, Montgomery SB. RNA Sequencing in Disease Diagnosis. Annu Rev Genomics Hum Genet. 2024;25(1):353–67. pmid:38360541
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Webb S, Haniffa M. Large-scale single-cell RNA sequencing atlases of human immune cells across lifespan: possibilities and challenges. Eur J Immunol. 2023;53(12):e2250222.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref6] 6. Huang D, Ma N, Li X, Gou Y, Duan Y, Liu B, et al. Advances in single-cell RNA sequencing and its applications in cancer research. J Hematol Oncol. 2023;16(1):98. pmid:37612741
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref7] 7. Schreibing F, Anslinger TM, Kramann R. Fibrosis in pathology of heart and kidney: from deep RNA-sequencing to novel molecular targets. Circ Res. 2023;132(8):1013–33. pmid:37053278
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref8] 8. Smirnov D, Konstantinovskiy N, Prokisch H. Integrative omics approaches to advance rare disease diagnostics. J Inherit Metab Dis. 2023;46(5):824–38. pmid:37553850
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref9] 9. Jin C, Zhou D, Li J, Bi L, Li L. Single‑cell multi‑omics advances in lymphoma research (Review). Oncol Rep. 2023;50(4):184. pmid:37615192
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref10] 10. Vasconcelos CFM, Ribas VT, Petrs-Silva H. Shared molecular pathways in glaucoma and other neurodegenerative diseases: insights from RNA-Seq analysis and miRNA regulation for promising therapeutic avenues. Cells. 2023;12(17):2155. pmid:37681887
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref11] 11. Xiong J, Chen J, Guo Z, Zhang C, Yuan L, Gao K. A novel machine learning-based programmed cell death-related clinical diagnostic and prognostic model associated with immune infiltration in endometrial cancer. Front Oncol. 2023;13:1224071. pmid:37534256
View Article
PubMed/NCBI
Google Scholar

[41] View Article

[42] PubMed/NCBI

[43] Google Scholar

[ref12] 12. Van Horebeek L, David M, Dedoncker N, Mallants K, Bijnens B, Goris A, et al. A targeted sequencing extension for transcript genotyping in single-cell transcriptomics. Life Sci Alliance. 2023;6(11):e202301971. pmid:37696578
View Article
PubMed/NCBI
Google Scholar

[45] View Article

[46] PubMed/NCBI

[47] Google Scholar

[ref13] 13. Wang R, Helbig I, Edmondson AC, Lin L, Xing Y. Splicing defects in rare diseases: transcriptomics and machine learning strategies towards genetic diagnosis. Brief Bioinform. 2023;24(5):bbad284.
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref14] 14. Chan JJ, Tabatabaeian H, Tay Y. 3’UTR heterogeneity and cancer progression. Trends Cell Biol. 2023;33(7):568–82. pmid:36372614
View Article
PubMed/NCBI
Google Scholar

[52] View Article

[53] PubMed/NCBI

[54] Google Scholar

[ref15] 15. Pool A-H, Poldsam H, Chen S, Thomson M, Oka Y. Recovery of missing single-cell RNA-sequencing data with optimized transcriptomic references. Nat Methods. 2023;20(10):1506–15. pmid:37697162
View Article
PubMed/NCBI
Google Scholar

[56] View Article

[57] PubMed/NCBI

[58] Google Scholar

[ref16] 16. Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, et al. From FastQ data to high confidence variant calls: the genome analysis toolkit best practices pipeline. Curr Protoc Bioinformatics. 2013;43(1110):11.10.1–11.10.33. pmid:25431634
View Article
PubMed/NCBI
Google Scholar

[60] View Article

[61] PubMed/NCBI

[62] Google Scholar

[ref17] 17. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–60. pmid:19451168
View Article
PubMed/NCBI
Google Scholar

[64] View Article

[65] PubMed/NCBI

[66] Google Scholar

[ref18] 18. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–303. pmid:20644199
View Article
PubMed/NCBI
Google Scholar

[68] View Article

[69] PubMed/NCBI

[70] Google Scholar

[ref19] 19. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. pmid:24695404
View Article
PubMed/NCBI
Google Scholar

[72] View Article

[73] PubMed/NCBI

[74] Google Scholar

[ref20] 20. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21. pmid:23104886
View Article
PubMed/NCBI
Google Scholar

[76] View Article

[77] PubMed/NCBI

[78] Google Scholar

[ref21] 21. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550. pmid:25516281
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

[ref22] 22. Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 2011;12:323. pmid:21816040
View Article
PubMed/NCBI
Google Scholar

[84] View Article

[85] PubMed/NCBI

[86] Google Scholar

[ref23] 23. Kim S, Scheffler K, Halpern AL, Bekritsky MA, Noh E, Källberg M, et al. Strelka2: fast and accurate calling of germline and somatic variants. Nat Methods. 2018;15(8):591–4. pmid:30013048
View Article
PubMed/NCBI
Google Scholar

[88] View Article

[89] PubMed/NCBI

[90] Google Scholar

[ref24] 24. Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164. pmid:20601685
View Article
PubMed/NCBI
Google Scholar

[92] View Article

[93] PubMed/NCBI

[94] Google Scholar

[ref25] 25. Kim Y-G, Kang H, Lee B, Jang H-J, Park J-H, Ha C, et al. A spectrum of nonsense-mediated mRNA decay efficiency along the degree of mutational constraint. Commun Biol. 2024;7(1):1461. pmid:39511375
View Article
PubMed/NCBI
Google Scholar

[96] View Article

[97] PubMed/NCBI

[98] Google Scholar

[ref26] 26. Human protein atlas in R. Available from https://lgatto.github.io/hpar/.
View Article
Google Scholar

[100] View Article

[101] Google Scholar

[ref27] 27. Belkadi A, Thareja G, Halama A, Mahmoud Y, Jones D, Agnew S, et al. Identification of genetic variants controlling RNA editing and their effect on RNA structure stabilization. Eur J Hum Genet. 2020;28(12):1753–62. pmid:32651550
View Article
PubMed/NCBI
Google Scholar

[103] View Article

[104] PubMed/NCBI

[105] Google Scholar

[ref28] 28. Bao A, Kollings J, Ma E, Manjunath J, D’Amiano A, Driscoll MS, et al. Association of human leukocyte antigen allelic variants with hidradenitis suppurativa across Fitzpatrick skin types: a cross-sectional analysis. J Am Acad Dermatol. 2025;92(3):567–9. pmid:39461503
View Article
PubMed/NCBI
Google Scholar

[107] View Article

[108] PubMed/NCBI

[109] Google Scholar

[ref29] 29. Ocejo-Vinyals JG, Gonzalez-Gay MA, Fernández-Viña MA, Cantos-Mansilla J, Vilanova I, Blanco R, et al. Association of human leukocyte antigens class II variants with susceptibility to hidradenitis suppurativa in a Caucasian Spanish population. J Clin Med. 2020;9(10):3095. pmid:32992947
View Article
PubMed/NCBI
Google Scholar

[111] View Article

[112] PubMed/NCBI

[113] Google Scholar

[ref30] 30. Akiyama M. Diseases categorized as autoinflammatory keratinization diseases (AiKDs), and their pathologies and treatments. Nagoya J Med Sci. 2024;86(1):1–15. pmid:38505726
View Article
PubMed/NCBI
Google Scholar

[115] View Article

[116] PubMed/NCBI

[117] Google Scholar

[ref31] 31. Kashyap MP, Khan J, Sinha R, Jin L, Atigadda V, Deshane JS, et al. Advances in molecular pathogenesis of hidradenitis suppurativa: dysregulated keratins and ECM signaling. Semin Cell Dev Biol. 2022;128:120–9. pmid:35131152
View Article
PubMed/NCBI
Google Scholar

[119] View Article

[120] PubMed/NCBI

[121] Google Scholar

Figures

Abstract

Introduction

Materials and methods

Study design, patients and sequencing

Whole Exome Sequencing (WES)

RNA sequencing (RNA-Seq)

SNV and small insertion and deletion (Indel) detection

Integrating WES and RNA-Seq data

Results and discussion

Conclusions

Supporting information

S1 File. S1 Table. Allele specifi expressed variants.

Acknowledgments

References