A subset of patients with monogenic disorders lacks disease causing mutations in the protein coding region of the corresponding gene. Here we describe a recurrent germline mutation found in two unrelated patients with complete androgen insensitivity syndrome (CAIS) generating an upstream open reading frame (uORF) in the 5’ untranslated region (5’-UTR) of the androgen receptor (AR) gene. We show in patient derived primary genital skin fibroblasts as well as in cell-based reporter assays that this mutation severely impacts AR function by reducing AR protein levels without affecting AR mRNA levels. Importantly, the newly generated uORF translates into a polypeptide and the expression level of this polypeptide inversely correlates with protein translation from the primary ORF of the AR thereby providing a model for AR-5′UTR mediated translational repression. Our findings not only add a hitherto unrecognized genetic cause to complete androgen insensitivity but also underline the importance of 5′UTR mutations affecting uORFs for the pathogenesis of monogenic disorders in general.
Citation: Hornig NC, de Beaufort C, Denzer F, Cools M, Wabitsch M, Ukat M, et al. (2016) A Recurrent Germline Mutation in the 5’UTR of the Androgen Receptor Causes Complete Androgen Insensitivity by Activating Aberrant uORF Translation. PLoS ONE 11(4): e0154158. https://doi.org/10.1371/journal.pone.0154158
Editor: Barbara Bardoni, CNRS UMR7275, FRANCE
Received: January 15, 2016; Accepted: April 8, 2016; Published: April 25, 2016
Copyright: © 2016 Hornig et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: The study has been funded by the Medical Faculty of the Christian-Albrechts-University, CAU, Kiel, Germany (Forschungsförderung 2015 – Anschub to NH) and the German Research Council (Deutsche Forschungsgemeinschaft, DFG) (Ho2073/7-1/7-3 to PMH and AM 343/2-1/2-3 to OA). The authors thank the KinderKrebsInitiative Buchholz/Holm-Seppensen for providing infrastructure.
Competing interests: The authors have declared that no competing interests exist.
Post-transcriptional control of gene expression is considered an important mechanism for cells to quickly respond to internal and external stimuli. The untranslated regions (UTRs) at both ends of an mRNA play thereby an important role. While the 3′UTR has been extensively studied as a major place of regulation e.g. through microRNA mediated mRNA degradation and translational inhibition , translational regulation through the 5′UTR is a field that more recently came into focus. 5′UTRs are the sites where ribosomes enter and scan the mRNA for a suitable translational start codon at which the translation initiation complex becomes assembled. This scanning can be influenced by secondary structures of the mRNA as well as the presence of upstream open reading frames . The latter are cis-regulatory sequences in the 5’-UTR of eukaryotic mRNAs defined by a start codon and an in frame stop codon, located either upstream of or overlapping with the main initiation codon of the gene. uORFs can reduce translation of the downstream encoded protein by hindering ribosome progression or sequestering functional pre-initiation complexes . uORF mediated post-transcriptional regulation of gene expression has been associated with stress response and mutations changing the uORF have been implicated in the etiology of human diseases [3–6].
The 5′UTR of the androgen receptor has several features characteristic for translational regulation: its 1,115 nucleotides (nt) largely exceed the typical length of 100–220 nt across species and it is GC rich, thus prone to secondary structures. The AR belongs to the nuclear receptor superfamily of transcription factors. It is activated by its ligand dihydrotestosterone (DHT) which induces a conformational change of the AR and its translocation into the nucleus. Uncontrolled high AR-activity can lead to prostate cancer . Too low AR-activity on the other hand can cause the monogenic condition androgen insensitivity syndrome (AIS), defined by the inability of the androgen receptor to properly respond to androgens i.e. DHT . AR-activity is essential for male sex development and individuals with AIS although having a male chromosome set and normal male serum testosterone levels develop with a partial or complete female phenotype. AIS is a common cause of disorders of sex development (DSD) and can be divided in three major subgroups: complete AIS (CAIS) with no residual AR-function characterized by female external genitalia, partial AIS (PAIS) with a variably impaired AR causing incomplete virilisation, and mild AIS (MAIS) presenting with normal male external genitalia but infertility [9, 10]. In over 85% of cases, CAIS is explained by loss of function mutations affecting the coding region of the AR gene, while this applies only to roughly one-third of PAIS cases [11, 12].
Androgen signaling has been extensively studied mainly in prostate cancer. Comparatively little is known about the regulation of AR mRNA stability and translation. In 1994, Mizokami et al.  showed, using reporter assays, that parts of the AR-5′UTR are essential for induction of translation but not transcription. They hypothesized that mutations in the 5′UTR of the AR could be one of the reasons for androgen insensitivity due to reduced AR translation without affecting AR mRNA levels. However, up to now, no such mutation has been described.
Here we present two unrelated 46,XY girls diagnosed with CAIS but lacking mutations in the coding region of the AR. Genomic next generation sequencing (NGS) of the complete AR locus revealed the same c.-547C>T germline mutation in the AR-5’-UTR of both patients. We show both in vitro and in vivo that this mutation introduces a translated uORF in the 5′UTR of the AR and results in a reduced AR protein expression without affecting AR mRNA accumulation.
Two individuals presenting with complete androgen insensitivity but lacking mutations within the coding region of the AR
Patient 1 was born at term with female external genitalia. At the age of two weeks, an inguinal hernia on the left side was detected. Ultrasound examination suggested a testis within this hernia and a second testis at the internal orifice on the right. Lack of a uterus in combination with a blind ending vagina was detected and confirmed during surgical exploration. Chromosomal analysis revealed a 46,XY karyotype. Laboratory testing showed a massively elevated plasma testosterone concentration of 1,065 ng/dL (normal male range 14–363 ng/dL) together with an LH level of 2.2 IU/L (normal male range <0.3–2.5 IU/L). Therefore, the phenotype of the patient and the hormonal picture indicated the clinical diagnosis of CAIS. However, subsequent Sanger sequencing of the complete AR coding gene region including the intron-exon boundaries did not reveal any mutation. Further laboratory data are listed in Table 1.
Patient 2 was a term newborn with female external genitalia. In the second year of life, inguinal hernias occurred at both sides and were surgically repaired. At the age of six years, due to tomboyish behavior the mother insisted on chromosome analysis revealing a 46,XY karyotype. Subsequently the patient was transferred to a Pediatric Endocrinology Center. An hCG test (5,000 IU hCG/m2 i.m., once) was performed and indicated normal Leydig cell function with a rise of testosterone from 28 ng/dL to 238 ng/dL after 72 h. Surgical exploration confirmed the absence of a uterus and ovaries. Histology of gonadal biopsies showed immature testicular tissue with strong immunohistochemical expression of α1-inhibin. Therefore, the patient’s phenotype in conjunction with the laboratory data indicated the clinical diagnosis CAIS. However, no mutation was detected in the AR coding gene region including the intron-exon boundaries. During her most recent visit at the age of 13.6 years, she presented with a breast Tanner stage B4 and a plasma estradiol concentration of 18.9 ng/L (normal for B3 in 46,XX girls). No clinical signs of virilisation, i.e. clitoromegaly, were present despite a very high plasma testosterone of 693ng/dL (upper male range). Further laboratory data are listed in Table 2.
Identification of a 5′UTR mutation in the AR in both patients
We hypothesized that the CAIS-phenotype of the two patients could be due to a mutation in regulatory regions of the AR gene that usually escape routine Sanger sequencing. We therefore used an NGS approach to sequence the entire AR genomic locus in the two patients′ genital skin fibroblasts (GF) including the promoter, untranslated regions and the introns. In line with routine Sanger sequencing, NGS did neither reveal mutations in the AR coding sequence nor in the intron-exon boundaries. Instead, analysis for single nucleotide polymorphisms (SNP) and small insertion deletions (indels) of the NGS-data showed a hemizygous C to T mutation in both patients located in the 5′UTR of the AR mRNA (c.-547C>T; NM_000044.3; hg19) (Fig 1A and 1B, S1 and S2 Files). In order to exclude that the patients are related, we checked the SNP distribution in the analyzed region as well as the length of the polyglutamine repeat present in exon 1 which is located only 716 base pairs downstream the c.-547C>T mutation. SNP analysis revealed two different haplotypes (S1 and S2 Files). Furthermore the polyglutamine stretch in patient 1 consists of 24 glutamines compared to 19 glutamines in patient 2 indicating that the patients are indeed not related. We subsequently screened GF derived genomic DNA from further 25 foreskin controls without detecting the c.-547C>T mutation which was also absent in publicly available SNP databases including dbSNP and the 1000 Genome Project (hg19; release 68) The presence of the mutation was confirmed by Sanger sequencing in GF (Fig 1C) as well as peripheral blood of both patients indicating that the mutation is of germline rather than somatic origin. Consistently, the mutation was detected in a heterozygous constellation in the mothers of both patients (Fig 1D).
A) Schematic representation of the AR-5′UTRs and CDS. The position of the c.-547C>T mutation and the uORF are indicated. B) A Haloplex library spanning the coding region, introns, UTRs and up and downstream sequences of the AR genomic locus was prepared from DNA of the index patients′ GF and sequenced on a MiSeq benchtop sequencer. Analysis for single nucleotide polymorphisms (SNP) and small insertion deletions (indels) was performed by the SureCall software (Agilent). Indicated is the mutation found in the 5′UTR of the AR. A frequency of 1 corresponds to 100% of the reads. The depth indicates the number of reads covering the indicated genomic position. C) Sanger sequencing of a male control and the index patients 1 and 2. The sequences are visualized as reverse complement strand using the Chromas Lite software and show the c.-547C>T mutation in both index patients but not in the male control. D) Sanger sequencing of blood derived DNA from both index patients as well as from the mothers of the index patients. The sequences are visualized as reverse complement strand using the Chromas Lite software and show the c.-547C>T mutation in both index patients and in the patients′ mothers in a heterozygous constellation.
Context analysis of the mutation showed that the c.-547C>T mutation creates a translational initiation codon ATG imbedded in a vertebrate Kozak sequence (S1 Fig) . 183 nucleotides downstream of the ATG are two consecutive Stop-codons thereby delineating a short uORF within the AR-5′UTR schematically depicted in Fig 1A.
Functional analysis of the AR-5′UTR mutation in vitro
In order to functionally characterize the impact of the identified mutation, we cloned the entire 5′UTR of either the wild type (wt) or the mutated (mut) AR directly upstream of a green fluorescent protein (GFP) reporter gene and transfected the fusion constructs in HEK293 cells. Importantly, the newly generated uORF is out of frame with the primary ORF (pORF) of the AR as well as of the GFP in the AR5′UTR-GFP vector and therefore cannot create an AR-uORF-GFP fusion protein. Transcript analysis of the GFP reporter showed no difference in mRNA accumulation levels between the wild type and mutant AR5′UTR-GFP fusion construct (Fig 2A). In order to exclude a possible GFP-amplification from plasmid DNA during the RNA quantification, we included a control without reverse transcription which showed no amplification product underlining the specificity of the GFP mRNA measurement (S2 Fig). In contrast to comparable GFP mRNA levels, GFP detection by Western blot showed a reduced GFP protein expression of the mutant AR5′UTR-GFP construct as compared to the wild type construct (Fig 2B). We also performed fluorescence activated cell sorting (FACS) analysis in order to compare the fluorescent intensity between the wild type and the mutant construct on a single cell basis. This revealed a lower GFP signal intensity in cells transfected with the mutant AR5′UTR-GFP vector as compared to wild type transfected cells (Fig 2C). Therefore, we show in two independent approaches that the mutation in the AR-5′UTR identified in both patients is causative for the reduced GFP protein level.
HEK293 cells were transfected with either empty vector, AR5′UTRwt-GFP or AR5′UTRmut-GFP. After 72h of transfection RNA and protein was isolated. A) Q-RT-PCR analysis of GFP mRNA. There is no significant difference between GFP mRNA levels of the AR5′UTRwt-GFP and the AR5′UTRmut-GFP construct (p = 0.57). GFP mRNA levels were normalized to the neomycin resistance (neo) expression of the vector. Experiments were performed in triplicate. B) GFP protein analysis. Cells transfected with the AR5′UTRmut-GFP construct show a reduced expression of the GFP protein compared to the wt-construct. GAPDH measurement served as loading control. The experiment was done in triplicate. One representative blot is displayed. C) FACS analysis of AR5′UTR-GFP transfected cells. FACS analysis was performed equally 72h after transfection. Cells transfected with AR5′UTRmut-GFP show less fluorescent intensity. D) Transcriptional activity of AR5′UTRwt-AR and AR5′UTRmut-AR. HEK293 cells were transfected with either empty vector, AR5′UTRwt-AR or AR5′UTRmut-AR. After 48h of transfection luciferase activity was measured. AR induced luciferase expression is significantly lower in AR5′UTRmut-AR transfected cells in respect to AR5′UTRwt-AR transfected cells (p***<0.001).
We then wanted to investigate, if the mutation in the 5′UTR also reduces AR-protein function in vitro.
To this aim we cloned both wild type and mutant 5`UTR in front of the AR-CDS and transfected the corresponding plasmids into AR-negative HEK293 cells together with a plasmid containing an androgen response element (ARE) in front of a luciferase reporter gene. Firefly luciferase measurements were normalized for transfection efficiency by co-measuring Renilla luciferase expression. Cells transfected with the AR5′UTRwt-AR construct showed a mean 5.7 fold induction of AR mediated luciferase expression while this induction was strongly reduced in cells transfected with the AR5′UTRmut-AR plasmid (Fig 2D). This proves that the mutation in the AR5′UTR is responsible for a reduced transcriptional activity of the AR protein.
Expression analysis of wildtype and mutated AR in vivo and in vitro
The results described suggest that in patients carrying the mutation in the 5′UTR, AR function might be reduced due to reduced post-transcriptional AR protein expression. We therefore sought to analyze AR expression levels in GFs from the patients compared to a male control. Although we obtained GF of documented labial origin from the first patient, the second patient′s GF were taken from the inguinal region outside the genitalia which is not a classical androgen responsive tissue. We thus used only GF from patient 1. When analyzing AR mRNA expression no significant difference was seen between control GF and the patient′s GF (Fig 3A). At the protein level instead, the patient′s GF showed a strong reduction of the full-length AR protein. At the same time, a lower migrating band of approximately 75kD was increased in the patient′s GF compared to control GF (Fig 3B). Dihydrotestosterone (DHT) is known to activate and stabilize full length AR once it is released from the chaperones by N/C-terminal protein interaction . Consistently, we found an increase of the full-length AR protein upon DHT treatment in GF from the male control and the index patient, but not of the shorter AR-fragment (Fig 3B). GF-derived protein extracts from a CAIS patient with a known AR-frameshift mutation  and a complete loss of AR expression served as negative control (Fig 3B).
A) AR mRNA accumulation in GF. Total RNA was extracted from GF of a male control, a CAIS patient with a documented frameshift mutation in the AR gene and index patient 1. Means and standard deviations of three independent experiments are shown. B) DHT-dependent AR protein expression in GF. Whole cell protein lysates were extracted from GF of the male control, the index patient and the CAIS patient with a documented frameshift mutation and treated with 10nM DHT or ethanol, respectively. From all samples, 35 μg of protein were loaded, additionally 10 μg were loaded for the control sample to avoid overexposition. Immunoblot analysis shows a markedly reduced amount of the 112 kD full-length AR protein in the index patient and an increased detection of a shorter 75kD fragment. DHT treatment stabilizes full-length 112kD AR protein while the shorter 75kD fragment remains of similar intensity as compared to untreated GF. The corresponding bands are indicated by an arrow. An unspecific band is denoted by a star. GF from the CAIS patient served as negative control. Actin measurement served as loading control. C) DHT-dependent ectopic AR protein expression in PC3 cells. Cells were either treated with 10nM DHT or ethanol. 5′UTRmut-AR transfected cells show reduced AR protein expression compared to 5′UTRwt-AR transfected cells. AR-121 transfected cells express an AR-fragment of lower molecular weight. Actin measurement served as loading control. D) DHT-dependent activation of the AR target gene APOD is compromised in the index patient. AR activity was measured through the activation of the endogenous AR target gene APOD. This revealed a mean 3.4 fold activation of APOD in response to DHT stimulation in three male foreskin control derived cell lines. GF of the index patient, like GF of the CAIS patient with known mutation show a highly significant loss of APOD induction as compared to the male controls (p***<0.001). Means and standard deviations of three independent experiments are shown. p-values are calculated by a t-test. E) DHT-dependent activation of the AR target gene PPAP2B is compromised in the index patient. AR activity was measured through the activation of the endogenous AR target gene PPAP2B. This revealed a mean 1.54 fold activation of PPAP2B in response to DHT stimulation in three male foreskin control derived cell lines. GF of the index patient, like GF of the CAIS patient with known mutation show a significant loss of PPAP2B induction as compared to the male controls (p**<0.01). Means and standard deviations of at least three independent experiments are shown. p-values are calculated by a t-test.
In order to recapitulate the AR protein expression pattern in vitro, we transfected above described plasmids containing wild type and mutant 5`UTR in front of the AR-CDS into AR-depleted prostate cancer PC3 cells. We chose PC3 cells because prostate is an androgen responsive tissue and response to DHT treatment can be monitored. Indeed, AR5′UTRwt-AR transfected PC3 cells showed an enhanced AR-protein signal after DHT treatment as compared to untreated cells, indicating that the construct readily responds to androgens (Fig 3C). When looking at the AR5′UTRmut-AR construct, AR-expression levels were strongly reduced but still responsive to DHT treatment reflecting the situation in GF from patient 1 (Fig 3C). A lower molecular weight band also appears in the mutant, although at a different ratio in respect to the full length AR protein observed in vivo. We speculated that the lower band could be another translational start as initiation of translation at the next downstream ATG within the AR-CDS at amino acid 191 has been proposed in the literature and an AR-plasmid starting at this second ATG (AR-121) has been designed previously . The calculated molecular weight of this shorter AR fragment would be 78kD. When analyzing cell extracts of PC3 cells transfected with AR-121 next to those cells transfected with the AR5′UTRmut-AR plasmid, we found that AR-121 ran at the same height as the lower molecular weight band, indicating that this band is likely to be an AR-product initiating at the second translational start (Fig 3C).
AR-5′UTR mutation reduces the transcriptional activity of the AR-protein in vivo
Although we saw a markedly reduced AR protein expression in the index patient′s GF, it is unclear whether this reduction is sufficient to abolish the transcriptional activity of the AR hence leading to the CAIS phenotype observed in the patients. We therefore decided to test the efficiency of the altered AR protein in the index patient′s GF in inducing endogenous target gene transcription. While numerous endogenous AR target genes have been found in prostate cancer derived cell lines, only three have been described in healthy male genital skin fibroblasts . Out of these, Apolipoprotein D (APOD) showed a reproducible DHT dependent transcriptional induction. More recently we identified another AR-target gene in GF, the Phosphatidic Acid Phosphatase Type 2B (PPAP2B), and included it in this study. GF derived from male foreskin tissue served as control. We measured mRNA levels of APOD and PPAP2B in the presence and absence of DHT and calculated the ratio of induction. While control GF showed a three to four fold up-regulation of APOD mRNA, GF of the index patient failed to respond to DHT as measured by APOD-induction and behaved similarly to GF obtained from a CAIS patient with a documented AR frameshift mutation  (Fig 3D). Although transcriptional induction was generally lower for PPAP2B compared to APOD, it also showed a significant induction in response to DHT in male control GF (p<0.01) but a failure of induction in GF derived from the index patient and the CAIS patient harboring the AR frameshift mutation (Fig 3E). We concluded that a post-transcriptional impairment of AR protein expression is responsible for the inability of DHT to induce a transcriptional active form of the AR in vivo and therefore confirmed the clinical diagnosis of complete androgen insensitivity at the molecular level.
The AR-5′UTR uORF is translated into a polypeptide
Finally we set out to elucidate the mechanism by which the mutation alters AR protein levels. Two not mutually exclusive scenarios can be postulated to explain the observed results. In the first scenario, the ribosomes are stalled at the de novo ATG and therefore reach only insufficiently the AR-primary ORF (pORF). In the second option, a polypeptide is built from this ATG. The majority of ribosomes would fall off after completion of the polypeptide and only few would reach the AR-pORF due to leaky scanning . In order to test the second option we cloned a 6xHistidine (6xHIS)-tag in front of the putative uORF stop-codon within the AR-5′UTR and replaced the tagged uORF version (uORF-HIS) with the untagged one in the above described AR5′UTR-GFP vector. This resulted in an AR5′UTRwt-HIS-GFP and an AR5′UTRmut-HIS-GFP vector with ATG as potential uORF start codon (S3 Fig). The new uORF of 204 nt would produce a peptide of 68 amino acids including the 6xHIS-tag and an expected molecular weight of 8.4kDa. After transfection of both constructs in HEK293 cells, we readily detected a polypeptide in the mutant tagged AR5′UTRmut-HIS-GFP vector of the expected molecular weight (Fig 4A) indicating usage of the newly introduced uORF by the translational machinery. GFP expression in both constructs showed the opposite pattern in respect to the uORF-HIS expression, evidencing that the context of the newly generated uORF start codon leads to the translation of a HIS-tagged polypeptide and suppresses the translation of the consecutive GFP protein (Fig 4A). Analyzing GFP mRNA levels revealed again no difference between the AR5′UTRwt-HIS-GFP and the AR5′UTRmut-HIS-GFP (Fig 4B). These results show that strong translation of the uORF encoded polypeptide within the 5′UTR of the AR is responsible for the reduced GFP and thus AR expression.
Expression analysis of the HIS-tagged uORF. HEK293 cells were transfected with either AR5′UTRwtHIS-GFP or AR5′UTRmutHIS-GFP. After 48h of transfection, RNA and protein was isolated. A) GFP protein analysis. Cells transfected with the AR5′UTRmutHIS-GFP construct show a strong peptide expression and highly reduced GFP protein levels. GAPDH measurement served as loading control. The experiment was done in triplicate. One representative blot is displayed. B) Q-RT-PCR analysis of GFP mRNA. There is no significant difference in the GFP mRNA levels between the AR5′UTRwtHIS-GFP and the AR5′UTRmutHIS-GFP construct (p = 0.64). GFP mRNA levels were normalized to the neomycin resistance expression of the vector. Experiments were performed in triplicate. C) Model for impaired AR-translation. In the wild type (wt) situation the 40S ribosomal subunit scans the 5′UTR for the translational start codon of the AR coding sequence (CDS). The majority of ribosomal complex will form at the canonical AR-start codon and protein translation starts. In the mutant (mut) 5′UTR a full ribosomal complex is loaded at the start of the uORF generated though the c.-5467C>T mutation creating a polypeptide of 63 amino acids. Ribosome stalling and/or dissociation of the 60S subunit at the end of the uORF would cause a poor re-initiation at the AR pORF. Internal ribosomal binding may lead to re-initiation at the next available ATG within a Kozak sequence to produce a shorter (75 kD) AR protein.
According to the McGill Androgen Receptor Mutation Database over 500 AR gene mutations have been identified in AIS as well as prostate cancer patients . Almost all of the described mutations are located in coding exons including splice site regions. Only three mutations within the 5`-UTR have been reported so far in breast and prostate cancer, but none in AIS [19, 20]. However, a functional impact of these mutations has not been analyzed.
Using NGS covering the entire AR gene locus we discovered a hitherto unrecognized mutation in the 5′UTR of the AR to be recurrent in AIS. We show that this mutation creates a uORF encoded peptide, and that translation of the uORF suppresses protein expression of the pORF without changing mRNA expression. We therefore confirm the hypothesis of Mikozami et al.  who postulated in 1994 that mutations in the AR-5′UTR might lead to AIS.
According to common models, uORF translation into small peptides can either cause complete dissociation of the ribosomal subunits at the end of the uORF or the remaining of the 40S ribosomal subunit on the transcript and its scanning for the next available ATG . The here presented case fits the second option because we see strong reduction but no complete inhibition of translation at the pORF (Figs 3B and 3C, 4A and 4C). One of the features affecting the efficiency of uORF-mediated translational repression is the intercistronic distance between the uORF and the pORF. In the case of the AR-5′UTR mutation, the distance might be too small for proper reassembly at the pORF but might enable reassembly at a further downstream ORF. Further experiments are planned to test if different spacing between the cistrons influence translational start site usage. In line with this, a lower molecular weight AR fragment of approximately 75 kD appears in the index patient′s GF, possibly generated by re-initiation of translation at an ATG downstream of the canonical AR–pORF (Figs 3B and 3C and 4C). uORF mediated switches in translational start sites have been described in the literature  and alternative start site usage has been postulated for the AR occurring at the first internal methionine 191 (S1 Fig) in patients carrying an AR-stop mutation before amino acid 191 [22–24]. The resulting N-terminal deleted AR protein lacks the transactivating domain [17, 25]. An intact N-terminus is crucial for an activating interaction between the N-and C-terminus of the AR .
After DHT induction, AR full length protein in the index patient is still detectable at a low level. Nevertheless, both patients have a complete lack of external genital virilisation. Accordingly, there is no AR transcriptional activity in the patient′s GF confirming the diagnosis on a molecular level. A possible explanation could be that the low amount of AR protein is not sufficient for the development of the male phenotype during embryogenesis. Alternatively, the shorter AR-fragment detected in the GF could act as a competitive inhibitor on AR transcriptional activity because it still has a DNA-binding but no transactivation domain. In line with this, it has been shown that the N-terminal truncated shorter AR fragment starting at methionine 191 significantly reduces activity of full length AR upon DHT treatment in AR transfected GF . The fact that the lower molecular weight band is also visible in the control sample, although much weaker compared to the full length AR, could be a consequence of leaky ribosome scanning at the pORF. As full length AR protein levels exceed largely those of the shorter fragment it is unlikely to have an effect on AR-function.
In conclusion, we provide a novel explanation for the molecular pathogenesis of AIS, one of the most common causes of DSD. On a more general level, our results suggest that mutations altering translational control might account for a considerable number of patients displaying characteristic monogenic phenotypes of endocrine diseases and beyond but lacking protein coding mutations in the associated genes to date.
Materials and Methods
Sample collection, fibroblast strains and cell culture conditions
The study was performed with approval of the Ethical Committee of the Christian-Albrechts-University, Kiel, Germany (AZ: D415/11) (S3 File). We obtained written consent from the parents on behalf of the children/minors enrolled in this study. Control foreskin fibroblasts were obtained from biopsies during circumcision of males with phenotypically normal male external genitalia. Fibroblasts from patient 1 originated from labia minora tissue which is homologous to the male foreskin. Fibroblasts from a 46,XY CAIS individual with proven frameshift mutation (Pro219fsX) also originated from labia minora tissue . GF were grown in phenol red free Dulbecco′s modified Eagle′s medium (Life Technologies) supplemented with 10% fetal bovine serum (FBS; Biochrome), 100 units/ml penicillin/streptomycin, 2 mM L-glutamine (Biochrome) and 20 mM HEPES buffer (Life Technologies) at 37°C with 5% CO2. For hormone induction experiments, 1.2x105 cells were plated in three 6cm dishes each and incubated at 37°C with 5% CO2. After 24h, time point zero was taken by collecting the cells in RNA-extraction buffer (RLT; Qiagen). The remaining two dishes were washed three times in the above medium without FBS and further grown in the same medium supplemented with 0.1% charcoal treated FBS (serum starved conditions). To the first dish DHT (Sigma-Aldrich, Germany) dissolved in 100% ethanol was added to a final concentration of 10 nM. An equal volume of ethanol was added to the second dish. Cells were left for 72h under these conditions at 37°C with 5% CO2 after which they were lysed in RNA-extraction buffer (RLT; Qiagen). For protein extractions, cells were grown in complete tissue culture medium containing 10% charcoal treated FBS. For hormone treatment, DHT was added to a final concentration of 10nM. Protein lysates were taken 72h after hormone treatment.
Next generation sequencing library preparation and sequencing
The sequencing library was prepared from genomic DNA of both index patients as well as controls according to the Haloplex protocol (Agilent) with oligos spanning the chromosomal region ChrX:66,754,874–66,955,461 (hg19). Sequencing was performed on a MiSeq benchtop sequencer (Illumina). Alignment to the hg19 reference genome and single nucleotide polymorphism (SNP) was performed by the MiSeq-Reporter software (Illumina). Small insertion deletion (indel) calling was performed using the SureCall software 188.8.131.52 from Agilent.
Sanger sequencing was performed on ABI 310 and ABI 3130 Genetic Analyzers (Applied Biosystems) according to standard protocols.
Cloning of the 5′UTR of the AR
In order to investigate the effect of the AR-5’UTR on the expression of GFP we created a modified version of the pcDNA3.1 plasmid by removing all transcribed sequences between the putative transcription start site (TSS) and the polyadenylation signal . The vector was created by PCR using primers that were able to amplify the pcDNA3.1+ backbone and concomitantly inserted an EcoRI and a BamHI site downstream of the TSS. Two PCR fragments corresponding to the 5′UTR of the AR (generated from genomic DNA of a male control and of patient 1) and to the GFP gene were created with primers inserting an EcoRI and NcoI site to the wt and mutant AR-5’UTR, and a NcoI and BamHI site to GFP. The two fragments were ligated into the modified pcDNA3.1 vector and sequenced to verify the correct sequence. The uORF is out of frame related to the GFP gene, reflecting the in vivo situation of the AR. For cloning of the AR-5’UTR in front of the AR-CDS, we performed a PCR covering the AR-5’UTR inserting a NsiI and BamhHI site at the very 5′ end until the SexAI restriction site at the beginning of the AR-CDS. This PCR product was cut with NsiI and SexAI and cloned into a NsiI/ SexAI cut psvARO vector . A BamHI fragment covering the entire AR-5′UTR and -CDS was then cloned into the above pcDNA3.1 derived vector and sequenced for verification. For the HIS tagged uORF an overlap extension PCR was performed in order to insert six histidines in front of the uORF stop codon in both the wt and mut AR-5′UTR. The HIS-tagged uORF containing AR-5′UTR was then cloned in the modified pcDNA3.1 vector and sequenced as described above. A schematic representation of the in vitro constructs is shown in S3 Fig. Primers used for cloning are listed in S1 Table.
HEK293T cells were grown in DMEM supplemented with 10% FBS. Typically 400,000 cells were plated in each well of a 6 well plate and transfected the day after using 1 μg of plasmid DNA/well and lipofectamine 2000 (Life Technologies). After 72h of transfection, cells were collected for concomitant protein and RNA extraction using the PARIS kit (Life Technologies) and for FACS analysis by resuspending the cells in phosphate buffered saline (PBS) supplemented with 2% FBS. For luciferase experiments 200,000 HEK293T cells were transfected with 10 ng of either AR5′UTRwt-AR vector, AR5′UTRmut-AR vector or empty vector together with 80 ng Renilla-luciferase vector as well as 400 ng of the ARE-luciferase reporter vector. The next day, DHT was added to the cells at a final concentration of 10 nM in order to activate the AR-protein. After further 24h, Firefly luciferase and Renilla luciferase expressions were measured according to the Dual-Luciferase Reporter Assay protocol (Promega) using a Veritas microplate luminometer (Turner BioSystems). PC3 cells were grown in RPMI supplemented with 10% FBS. Typically 200,000 cells were plated in each well of a 6 well plate and transfected the day after using 2 μg of plasmid DNA/well and lipofectamine 2000 (Life Technologies). The next day, cells were washed twice in RPMI without FBS and then incubated in RPMI supplemented with charcoal treated FBS containing either 10nM DHT or 100% ethanol for 24 hours. Cells were lysed in RIPA buffer for protein extraction.
Roughly 1x105 cells were resuspended in FACS buffer (PBS, 2% FBS). Flow cytometry data were acquired on a BD Accuri C6 (BD Biosciences) and analyzed with the BDCSampler software (BD Biosciences).
RNA isolation, amplification and detection
Total RNA was isolated from fibroblasts using the RNeasy kit (Qiagen). 500ng of total RNA was reverse transcribed using the QuantiTect Reverse Transcription Kit (Qiagen). Quantitative PCR was performed with the QuantiTect SYBR Green master mix (Qiagen) using primers against APOD, AR and SDHA. All primers were purchased from Qiagen and used following the manufacturer′s instructions. For HEK293T transfections, 500ng RNA extracted using the PARIS kit (Life Technologies) was treated twice with TURBO DNase (Life Technologies) to eliminate residual plasmid DNA and reverse transcribed using the QuantiTect Reverse Transcription Kit (Qiagen). The reverse transcription reaction was also performed in the absence of Reverse Transcriptase. Quantitative PCR was performed with the QuantiTect SYBR Green master mix (Qiagen) using primers against the AR-5′UTR and GFP (S1 Table) and the neomycin resistance gene (neo) . Primer were designed using the Primer3 software. p-values were calculated using a two-sided t-test.
Protein isolation and detection
Whole cell protein lysates were obtained by resuspending the cells in ice-cold RIPA buffer, supplemented with a cocktail of protease inhibitors (CompleteTM, Roche). Protein extracts were separated by SDS-PAGE on a NuPAGE® Novex® 4–12% Bis-Tris Protein gel (Life Technologies) and transferred to a nitrocellulose membrane (Whatman, GE Healtehcare Life Sciences). The membrane was blocked for 1-2h at room temperature with TBS containing 0.1% Tween and 5% non-fat dry milk. For AR and HIS detection F39.4.1 antibody (BioGenex) (1:100 dilution) and Anti-6xHIS tag antibody (Anti-HIS HRP, MACS) (1:10,000 dilution) was used respectively over night at 4°C, for actin detection Anti-Actin (A 2066) antibody (Sigma-Aldrich) was used for 1-2h at room temperature (1:10,000 dilution). Anti-GFP-HRP (MACS) was used for the duration of 1h at room temperature for GFP detection (1:5,000 dilution) and GAPDH antibody (GeneTex) was used for 45min at room temperature (1:10,000 dilution) for GAPDH detection. Secondary anti-rabbit and anti-mouse antibodies (Immuno Reagents Inc.) were incubated for 1-2h at room temperature (1:4,000 dilution). All antibodies were resuspended in TBS containing 0.1% Tween and 5% non-fat dry milk. Signals were detected on an Amersham Hyperfilm ECL (GE Healthcare) using the Luminata Forte Western HRP Substrate (Millipore). Expected molecular weights of polypeptides were calculated using EMBOSS GUI v.1.14: pepstats.
S1 Fig. Kozak sequences around the start codon of the uORF, pORF and the downstream ORF starting at amino acid 191 of the AR.
The depicted Kozak consensus sequence is a sequence occurring on eukaryotic mRNA. Big letters correspond to high evolutionary conservation.
S2 Fig. PCR analysis of GFP and neomycin resistance mRNA after DNAse treatment.
HEK293 cells were transfected with either empty vector, AR5′-UTRwt-GFP, AR5′-UTRmut-GFP or not transfected at all. After 72h of transfection, RNA was isolated and treated with DNAse in order to completely digest vector DNA. The expected size for neo is 95bp, that of GFP is 201bp. The minus reverse transcriptase control (-RT) shows no PCR-product, demonstrating that plasmid-DNA was completely digested from the samples.
S3 Fig. Schematic representation of the constructs used for the in vitro studies.
S3 File. Study approval by the Ethical Committee of the Christian-Albrechts-University, Kiel, Germany.
We like to thank Brigitte Karwelies, Gila Hohmann and Tanja Stampe for their laboratory support and Dr. Ingram Iaccarino for providing the vector for 5′UTR cloning of the AR. We are grateful to Gaby Wehner and Dr. Anne Katrin Eckstein for providing GF. The study has been funded by the Medical Faculty of the Christian-Albrechts-University, CAU, Kiel, Germany (Forschungsförderung 2015 –Anschub to NH) and the German Research Council (Deutsche Forschungsgemeinschaft, DFG) (Ho2073/7-1/7-3 to PMH and AM 343/2-1/2-3 to OA). We thank the KinderKrebsInitiative Buchholz/Holm-Seppensen for providing infrastructure.
Conceived and designed the experiments: NCH PMH. Performed the experiments: NCH MU. Analyzed the data: NCH CB FD PMH RW. Contributed reagents/materials/analysis tools: CB FD MC MW HUS AEK RW OH LA RS OA PMH. Wrote the paper: NCH RW OA RS PMH.
- 1. de Moor CH, Meijer H, Lissenden S. Mechanisms of translational control by the 3' UTR in development and differentiation. Seminars in cell & developmental biology. 2005;16(1):49–58. Epub 2005/01/22. pmid:15659339.
- 2. Araujo PR, Yoon K, Ko D, Smith AD, Qiao M, Suresh U, et al. Before It Gets Started: Regulating Translation at the 5' UTR. Comparative and functional genomics. 2012;2012:475731. Epub 2012/06/14. pmid:22693426; PubMed Central PMCID: PMC3368165.
- 3. Barbosa C, Peixeiro I, Romao L. Gene expression regulation by upstream open reading frames and human disease. PLoS genetics. 2013;9(8):e1003529. Epub 2013/08/21. pmid:23950723; PubMed Central PMCID: PMC3738444.
- 4. Andrews SJ, Rothnagel JA. Emerging evidence for functional peptides encoded by short open reading frames. Nature reviews Genetics. 2014;15(3):193–204. Epub 2014/02/12. pmid:24514441.
- 5. Jousse C, Bruhat A, Carraro V, Urano F, Ferrara M, Ron D, et al. Inhibition of CHOP translation by a peptide encoded by an open reading frame localized in the chop 5'UTR. Nucleic acids research. 2001;29(21):4341–51. Epub 2001/11/03. pmid:11691921; PubMed Central PMCID: PMC60176.
- 6. Mueller PP, Hinnebusch AG. Multiple upstream AUG codons mediate translational control of GCN4. Cell. 1986;45(2):201–7. Epub 1986/04/25. pmid:3516411.
- 7. Lonergan PE, Tindall DJ. Androgen receptor signaling in prostate cancer development and progression. Journal of carcinogenesis. 2011;10:20. Epub 2011/09/03. pmid:21886458; PubMed Central PMCID: PMC3162670.
- 8. Mongan NP, Tadokoro-Cuccaro R, Bunch T, Hughes IA. Androgen insensitivity syndrome. Best practice & research Clinical endocrinology & metabolism. 2015;29(4):569–80. Epub 2015/08/26. pmid:26303084.
- 9. Gottlieb B, Beitel LK, Nadarajah A, Paliouras M, Trifiro M. The androgen receptor gene mutations database: 2012 update. Human mutation. 2012;33(5):887–94. Epub 2012/02/16. pmid:22334387.
- 10. Hiort O, Birnbaum W, Marshall L, Wunsch L, Werner R, Schroder T, et al. Management of disorders of sex development. Nature reviews Endocrinology. 2014;10(9):520–9. Epub 2014/07/16. pmid:25022812.
- 11. Deeb A, Mason C, Lee YS, Hughes IA. Correlation between genotype, phenotype and sex of rearing in 111 patients with partial androgen insensitivity syndrome. Clinical endocrinology. 2005;63(1):56–62. Epub 2005/06/21. pmid:15963062.
- 12. Jaaskelainen J. Molecular biology of androgen insensitivity. Molecular and cellular endocrinology. 2012;352(1–2):4–12. Epub 2011/08/30. 10.1016/j.mce.2011.08.006. pmid:21871529.
- 13. Mizokami A, Chang C. Induction of translation by the 5'-untranslated region of human androgen receptor mRNA. The Journal of biological chemistry. 1994;269(41):25655–9. Epub 1994/10/14. pmid:7929269.
- 14. Nakagawa S, Niimura Y, Gojobori T, Tanaka H, Miura K. Diversity of preferred nucleotide sequences around the translation initiation codon in eukaryote genomes. Nucleic acids research. 2008;36(3):861–71. Epub 2007/12/19. pmid:18086709; PubMed Central PMCID: PMC2241899.
- 15. Werner R, Holterhus PM. Androgen action. Endocrine development. 2014;27:28–40. Epub 2014/09/24. pmid:25247642.
- 16. Audi L, Fernandez-Cancio M, Carrascosa A, Andaluz P, Toran N, Piro C, et al. Novel (60%) and recurrent (40%) androgen receptor gene mutations in a series of 59 patients with a 46,XY disorder of sex development. The Journal of clinical endocrinology and metabolism. 2010;95(4):1876–88. Epub 2010/02/13. pmid:20150575.
- 17. Jenster G, van der Korput HA, Trapman J, Brinkmann AO. Identification of two transcription activation units in the N-terminal domain of the human androgen receptor. The Journal of biological chemistry. 1995;270(13):7341–6. Epub 1995/03/31. pmid:7706276.
- 18. Appari M, Werner R, Wunsch L, Cario G, Demeter J, Hiort O, et al. Apolipoprotein D (APOD) is a putative biomarker of androgen receptor function in androgen insensitivity syndrome. J Mol Med (Berl). 2009;87(6):623–32. Epub 2009/03/31. pmid:19330472.
- 19. Crocitto LE, Henderson BE, Coetzee GA. Identification of two germline point mutations in the 5'UTR of the androgen receptor gene in men with prostate cancer. The Journal of urology. 1997;158(4):1599–601. Epub 1997/09/25. pmid:9302181.
- 20. Peters KM, Edwards SL, Nair SS, French JD, Bailey PJ, Salkield K, et al. Androgen receptor expression predicts breast cancer survival: the role of genetic and epigenetic events. BMC cancer. 2012;12:132. Epub 2012/04/05. pmid:22471922; PubMed Central PMCID: PMC3349557.
- 21. Wethmar K, Smink JJ, Leutz A. Upstream open reading frames: molecular switches in (patho)physiology. BioEssays: news and reviews in molecular, cellular and developmental biology. 2010;32(10):885–93. Epub 2010/08/21. PubMed Central PMCID: PMC3045505.
- 22. Holterhus PM, Bruggenwirth HT, Hiort O, Kleinkauf-Houcken A, Kruse K, Sinnecker GH, et al. Mosaicism due to a somatic mutation of the androgen receptor gene determines phenotype in androgen insensitivity syndrome. The Journal of clinical endocrinology and metabolism. 1997;82(11):3584–9. Epub 1997/11/14. pmid:9360511.
- 23. Zoppi S, Wilson CM, Harbison MD, Griffin JE, Wilson JD, McPhaul MJ, et al. Complete testicular feminization caused by an amino-terminal truncation of the androgen receptor with downstream initiation. The Journal of clinical investigation. 1993;91(3):1105–12. Epub 1993/03/01. pmid:8450040; PubMed Central PMCID: PMC288066.
- 24. Wilson CM, McPhaul MJ. A and B forms of the androgen receptor are present in human genital skin fibroblasts. Proceedings of the National Academy of Sciences of the United States of America. 1994;91(4):1234–8. Epub 1994/02/15. pmid:8108393; PubMed Central PMCID: PMC43131.
- 25. Simental JA, Sar M, Lane MV, French FS, Wilson EM. Transcriptional activation and nuclear targeting signals of the human androgen receptor. The Journal of biological chemistry. 1991;266(1):510–8. Epub 1991/01/05. pmid:1985913.
- 26. van Royen ME, van Cappellen WA, de Vos C, Houtsmuller AB, Trapman J. Stepwise androgen receptor dimerization. Journal of cell science. 2012;125(Pt 8):1970–9. Epub 2012/02/14. pmid:22328501.
- 27. Liegibel UM, Sommer U, Boercsoek I, Hilscher U, Bierhaus A, Schweikert HU, et al. Androgen receptor isoforms AR-A and AR-B display functional differences in cultured human bone cells and genital skin fibroblasts. Steroids. 2003;68(14):1179–87. Epub 2003/12/04. pmid:14643880.
- 28. Wang G, Guo X, Floros J. Differences in the translation efficiency and mRNA stability mediated by 5'-UTR splice variants of human SP-A1 and SP-A2 genes. American journal of physiology Lung cellular and molecular physiology. 2005;289(3):L497–508. Epub 2005/05/17. pmid:15894557.
- 29. Ramalingam P, Palanichamy JK, Singh A, Das P, Bhagat M, Kassab MA, et al. Biogenesis of intronic miRNAs located in clusters by independent transcription and alternative splicing. RNA. 2014;20(1):76–87. Epub 2013/11/15. pmid:24226766; PubMed Central PMCID: PMC3866646.