Mycobacterium avium complex (MAC) infection causes disseminated disease in immunocompromised hosts, such as human immunodeficiency virus (HIV)-positive patients, and pulmonary disease in persons without systemic immunosuppression, which has been increasing in many countries. In Japan, the incidence of pulmonary MAC disease caused by M. avium is about 7 times higher than that caused by M. intracellulare. To explore the bacterial factors that affect the pathological state of MAC disease caused by M. avium, we determined the complete genome sequence of the previously unreported M. avium subsp. hominissuis strain TH135 isolated from a HIV-negative patient with pulmonary MAC disease and compared it with the known genomic sequence of M. avium strain 104 derived from an acquired immunodeficiency syndrome patient with MAC disease. The genome of strain TH135 consists of a 4,951,217-bp circular chromosome with 4,636 coding sequences. Comparative analysis revealed that 4,012 genes are shared between the two strains, and strains TH135 and 104 have 624 and 1,108 unique genes, respectively. Many strain-specific regions including virulence-associated genes were found in genomes of both strains, and except for some regions, the G+C content in the specific regions was low compared with the mean G+C content of the corresponding chromosome. Screening of clinical isolates for genes located in the strain-specific regions revealed that the detection rates of strain TH135-specific genes were relatively high in specimens isolated from pulmonary MAC disease patients, while, those of strain 104-specific genes were relatively high in those from HIV-positive patients. Collectively, M. avium strains that cause pulmonary and disseminated disease possess genetically distinct features, and it suggests that the acquisition of specific genes during strain evolution has played an important role in the pathological manifestations of MAC disease.
Citation: Uchiya K-i, Takahashi H, Yagi T, Moriyama M, Inagaki T, Ichikawa K, et al. (2013) Comparative Genome Analysis of Mycobacterium avium Revealed Genetic Diversity in Strains that Cause Pulmonary and Disseminated Disease. PLoS ONE 8(8): e71831. https://doi.org/10.1371/journal.pone.0071831
Editor: Olivier Neyrolles, Institut de Pharmacologie et de Biologie Structurale, France
Received: March 25, 2013; Accepted: July 2, 2013; Published: August 21, 2013
Copyright: © 2013 Uchiya et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by JSPS KAKENHI Grant Number 24590164 and Grant-in-Aid for Specially Promoted Research of Meijo University Research Institute (to K.U.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Many species of nontuberculous mycobacteria (NTM) are found in a variety of habitats, including natural water, water distribution systems, bathrooms, soil, and household dust –. Unlike tuberculosis, direct transmission via human-to-human contact is rare in NTM infection. Instead, NTM infection is thought to occur via exposure to aerosols containing mycobacteria in the environment, although the exact source has not been specified. In Japan, approximately 90% of NTM infections are caused by Mycobacterium avium complex (MAC, 70–80%) and M. kansasii (10–20%) .
MAC infection can be attributed to two closely related organisms, M. avium and M. intracellulare. M. avium comprises four subspecies that infect specific hosts: M. avium subsp. avium and M. avium subsp. silvaticum are avian pathogens; M. avium subsp. hominissuis is found in the environment, humans, and pigs; and M. avium subsp. paratuberculosis causes disease in livestock and wildlife. It is generally considered that M. avium subsp. hominissuis is responsible for MAC disease . The presence (or absence) of specific insertion sequences (IS1245, IS900, and IS901) can be used to distinguish subspecies , but today, sequencing of the hsp65 gene, one of the house keeping genes of M. avium, provides more accurate information for subspecies identification .
M. avium is an opportunistic pathogen that causes generalized disseminated disease in immunocompromised patients, such as human immunodeficiency virus (HIV)-positive patients. M. avium infection spread with acquired immunodeficiency syndrome (AIDS) in the 1980’s. In contrast to HIV-associated disseminated MAC disease, pulmonary MAC disease in immunocompetent persons is caused by M. intracellulare and M. avium, and their prevalence varies by country. In Japan, the incidence of pulmonary MAC disease caused by M. avium is about 7 times higher than that caused by M. intracellulare . In recent years, pulmonary MAC disease producing lesions in the lingular segments and middle lobe is increasing in middle-aged to elderly females with no underlying disease in many countries . The prevalence of NTM lung disease in Japan increased dramatically from 0.82 per 100,000 population in 1971 to 5.9 per 100,000 population in 2001 , and the current rate is estimated as 8.0–10.0 per 100,000 population. This rate is substantially higher than the rates seen in the United State and Europe –.
It is likely that bacterial factors, as well as host-related risk factors, are associated with the establishment of pulmonary MAC disease. Although results are inconclusive, several studies have investigated the following possible host-related risk factors: decreases in the levels of estrogen , a major female sex hormone, and the presence of polymorphisms in NRAMP1 (encoding natural resistance-associated macrophage protein 1)  and MICA (encoding major histocompatibility complex class I chain–related A) . On the other hand, little is known about bacterial factors.
The mechanisms of the pathogenicity of mycobacteria involve the following: prevention of maturation of pathogen-containing phagosomes in host macrophages ; production of enzymes, such as catalase , that remove reactive oxygen species; and synthesis of mycobactin that serves as a siderophore to effectively acquire iron necessary for bacterial growth . In addition to these mechanisms, mycobacteria are also equipped with mechanisms for invading host cells .
In this study, we carried out whole-genome sequencing on the previously unreported M. avium subsp. hominissuis strain TH135 isolated from a HIV-negative patient with pulmonary MAC disease, and we performed comparative analysis between genomes of strain TH135 and strain 104 isolated from an AIDS patient with MAC disease to examine the bacterial factors that affect the establishment of pulmonary disease caused by M. avium.
Materials and Methods
Bacterial Strains, Growth Condition, and Genomic DNA Isolation
The clinical isolates used in this study comprised 35 M. avium strains including the genome analysis strain TH135 recovered from the sputa of HIV-negative patients with pulmonary MAC disease at the National Hospital Organization, Higashinagoya National Hospital in Japan from 2004 to 2008. In addition, 28 M. avium clinical isolates derived from blood of HIV-positive patients with disseminated MAC disease were provided by the National Center for Global Health and Medicine, formerly called the International Medical Center of Japan. The subspecies of M. avium clinical isolates was identified as M. avium subsp. hominissuis by sequence analysis of the 3′ fragment of the hsp65 gene . The organism was grown in Middlebrook 7H9 liquid medium supplemented with 10% oleic acid/albumin/dextrose/catalase enrichment (Difco Laboratories, Detroit, MI) at 37°C. Genomic DNA was extracted with a Qiagen kit (Qiagen Inc., Valencia, CA) according to the manufacturer’s instructions.
Genome Sequencing and Annotation
The genome sequence of M. avium subsp. hominissuis strain TH135 was determined by combining the technology of two genome sequencers: 454 GS FLX (Roche, Mannheim, Germany); and Hiseq 2000 (Illumina, CA). The genomic DNA was first sequenced using the Hiseq with 101-bp paired-end library (80,119,704 reads, 1,600-fold genome coverage), and sequence reads were assembled using Velvet (version 1.2.07). Gaps between the contigs were closed by mapping with FLX 8-kb paired-end reads (295,431 reads, 14-fold genome coverage) obtained using GS De Novo Assembler (version 2.7). The sequence obtained by the Hiseq was further compared with data obtained by the FLX, and unmatched sequences and gap sequences in the scaffolds were filled by PCR amplification followed by Sanger sequencing. The genome sequence was automatically annotated using the Microbial Genome Annotation Pipeline  and corrected manually using in silico Molecular Cloning Genomics Edition (IMCGE) software .
Protein function was assigned based on a BLASTP similarity search against the NCBI ‘nr’ (non-redundant protein) database. Transfer RNA (tRNA) and ribosomal RNA (rRNA) were predicted using a tRNAscan-SE 1.23  and RNAmmer 1.2 , respectively. Insertion sequence (IS) elements were identified using the IS-Finder , and the microbial genome database (MBGD) was used to detect conserved gene clusters .
Comparative Genomic Analysis
Comparative genomic analysis was performed with M. avium subsp. hominissuis strain 104 (GenBank accession no. NC_008595), which was derived from an AIDS patient with MAC disease . IMCGE software was used for data management and for visualization of genomic features. Multiple whole-genome alignments were performed using Mauve software, which was designed for the identification and alignment of conserved genomic DNA in the presence of rearrangements . Orthologues in M. avium strains TH135 and 104 were defined by bidirectional best-hit analysis between the genomes of both strains with a threshold of >90% amino acid identity and >60% aligned length coverage of a query sequence. The remaining coding sequences (CDS) without the characteristics of orthologues were defined as M. avium strain-specific CDSs.
PCR Analysis and Sequence Analysis
Clinical isolates were cultured in 5 mL 7H9 liquid medium supplemented with 10% oleic acid/albumin/dextrose/catalase enrichment at 37°C for 1–2 weeks and then transferred to 25 mL of the same medium for further culture. The culture was centrifuged, and DNA was extracted using an InstaGene Matrix (Bio-Rad Laboratories, Hercules, CA) according to the manufacturer’s instructions. The PCR mixture (50 µL) was composed of DNA solution (5–50 ng), 1 U of AmpliTaqGold DNA polymerase (Applied Biosystems, Foster City, CA), 5 µL of 2 mM deoxynucleoside triphosphate mixture, 5 µL of 10×PCR buffer, 1 µL of each primer set at 25 µM, and dimethyl sulfoxide (Sigma-Aldrich) to a final concentration of 4%. The PCR primers used in this study are shown in Table S1. The PCR conditions were as follows: 1 cycle at 95°C for 10 min; 35 cycles at 94°C for 1 min, 55°C for 1 min, and 72°C for 1 min; and 1 cycle at 72°C for 7 min. The PCR products were electrophoresed with the TrackIt 50 bp DNA ladder (Invitrogen, San Diego, CA) in a 2% agarose gel (E-gel; Invitrogen). The resulting PCR products were purified using a GenElute PCR DNA purification kit (Sigma-Aldrich), and direct sequencing analysis was performed using the same primers as those used for PCR. The resulting nucleotide sequences were compared with the genomic sequence data for M. avium strain TH135 and strain 104. Accordingly, the presence of strain TH135- and 104-specific genes in clinical isolates was determined by the use of specific primers in PCR. The suitability of the present DNA samples for screening of clinical isolates with PCR was determined by amplification of the hsp65 gene – the gene used to identify subspecies of M. avium in clinical isolates.
Data for the detection rate of specific genes in M. avium clinical isolates were analyzed statistically using Fisher’s exact test. P values of <0.05 were considered significant.
Results and Discussion
General Genomic Features
To explore the bacterial factors that affect the establishment of pulmonary disease caused by M. avium subsp. hominissuis, we determined the whole genome sequence of the previously unreported M. avium strain TH135 isolated from a HIV-negative patient with pulmonary disease and compared it with the complete genome of M. avium strain 104 derived from an AIDS patient with MAC disease . The general features of strain TH135 compared with those of strain 104 are presented in Table 1. The replication origin of the strain TH135 chromosome was deduced on the basis of the transition point in GC skew analysis and the presence of the dnaA gene accompanied by several DnaA boxes (Fig. 1). The genome was composed of a single circular chromosome of 4,951,217 bp with an average G+C content of 69.32%, 4,636 predicted CDS, 46 tRNA genes, and a single rRNA operon with the typical order of 16S, 23S, and 5S rRNA genes. The chromosome size of strain TH135 is 524,274 bp shorter than that of strain 104 (5,475,491 bp). Although both strains belong to the same subspecies, IS content is very different between the strains, and the strain 104 genome carries more IS elements than the strain TH135 genome. On the other hand, it is noteworthy that strain TH135 harbors five ISMav6 genes (MAH_0649, MAH_1321, MAH_2272, MAH_2945, and MAH_3485) that have 60 point mutations compared with a subspecies differentiation marker IS901, which is on the genomes of different subspecies––M. avium subsp. avium and M. avium subsp. silvaticum . IS elements are thought to be one of the major players in prokaryote genome plasticity . A greater number of IS elements indicates that the genome has undergone further structural variation during strain evolution.
The scale is shown in base pairs (Mb), with zero representing the location of the dnaA gene. From the outside to the inside, the outer two circles show forward- and reverse-strand coding sequences (CDS), respectively. The third and fourth circles show rRNA operons and tRNA genes, respectively. The fifth circle shows the percentage of G+C in relation to the mean G+C of the chromosome. The sixth circle shows the GC skew (G – C)/(G+C). The color of each CDS was assigned according to the cluster of orthologous groups (COG) functional classification system . The color of each COG family is shown in the figure.
Whole-genome alignment of both strains was carried out using Mauve software (Fig. 2). Although high conservation in both the sequence and gene order of strain TH135 and 104 was observed, there were gene insertions and two large inversions (green and blue blocks in Fig. 2). Gaps or white spaces within blocks indicate the presence of strain-specific sequences. On specific regions of over 10,000 bp in length, strain TH135 has 10 loci (specific region (SR)-1 to SR-10) and strain 104 has 11 loci (SR-11 to SR-21). Compared with strain TH135, the strain 104 genome possesses many specific regions of large size. Of these specific regions, many CDSs in SR-3, SR-10, and SR-19 were highly homologous to the CDSs in M. paratuberculosis, M. parascrofulaceum, and M. intracellulare, respectively (Table S2). Interestingly, many of these regions have low G+C content compared with the mean G+C content of the corresponding chromosome, which is an added sign of foreign origin (Fig. 2; Table S2 and Table S3). Furthermore, such specific regions are flanked by genes which encode integrases of phage origin and/or transposases derived from transposons (Table S2 and Table S3). In particular, region SR-14 of strain 104 was identified as a prophage insertion region flanked by phage insertion sites attL and attR. Also interestingly, a comparison of the genome of M. avium subsp. paratuberculosis (GenBank accession no. NC_002944), which causes disease in livestock and wildlife, and the genome of the present strains (TH135 and 104) revealed that regions SR-1–SR-10, but not SR-3, of strain TH135 and regions SR-11–SR-21 of strain 104 are present as specific regions in the strain TH135 and strain 104 genomes (data not shown). Taken together, these regions are likely to be inserted into chromosomes via horizontal gene transfer during strain evolution. Although strain TH135 and strain 104 show high genomic DNA sequence similarity (94.4% for strain TH135 and 86.6% for strain 104), both harbor many strain-specific regions, suggesting that both strains evolved independently from a common ancestor.
Representation of five local collinear blocks (LCBs) between chromosomal sequences of each strain generated by Mauve alignment software. The connecting lines between blocks indicate the location of each block in each genome. Each colored block represents a homologous region. LCBs drawn horizontally represent homology in the reverse strand of the chromosome. Gaps or white spaces within LCBs indicate the presence of strain-specific sequences. Specific region (SR)-1 to SR-10 in strain TH135 and SR-11 to SR-21 in strain 104 show strain-specific regions of over 10,000 bp in length.
The Venn diagram in Fig. 3A shows the distribution of shared orthologues and strain-specific genes between strains TH135 and 104. As shown in Fig. 3A, 4,012 genes were shared between the two strains, whereas the number of strain TH135- and 104-specific genes was 624 (13.5%) and 1,108 (21.6%), respectively. Furthermore, strain-specific ORFs are classified according to cluster of orthologous groups (COG) category (Fig. 3B) . The relative contribution of COGs are generally similar between strain TH135-specific genes and strain 104-specific genes, and category S (function unknown) and category L (recombination and repair) genes were dominant in both strains. However, the relative contribution of category I (lipid metabolism) was 3.5-fold higher in strain 104-specific genes (14%) than in strain TH135-specific genes (4%), while that of category M (cell envelope and outer membrane biogenesis) was 4-fold higher in strain TH135-specific genes (4%) than in strain 104-specific genes (1%).
(A) Venn diagram showing the distribution of shared orthologues and strain-specific genes between M. avium strains TH135 and 104. The Venn diagram was created with IMCGE software. Comparative analysis revealed that 4,012 genes are shared between the two strains, and 624 genes and 1,108 genes are unique to strains TH135 and 104, respectively. (B) Classification of specific genes of each strain based on COG functional categories. Each colored segment indicates the relative contribution of a functional category as a percentage of total COGs. The color of each COG family is shown in Figure 1.
Comparison of Virulence-associated Factors
We considered that differences in virulence-associated factors between strain TH135 and strain 104 might explain the different pathological manifestations of MAC disease caused by M. avium. Thus, we searched for such factors and compared them between the strains.
Mammalian Cell Entry
Four mammalian cell entry (mce) operons (mce1 to mce4), found in the genome sequence of M. tuberculosis H37Rv, are associated with the virulence of M. tuberculosis . Each mce locus comprises two yrbE and six mce genes (mceA to mceF), which are homologous to the permeases and substrate-binding proteins of ABC transporters, respectively . Although the precise mechanisms involving Mce proteins remain unclear, it was demonstrated that secreted Mce1A protein moves to the bacterial surface, thereby facilitating entry of mycobacteria into macrophages and HeLa cells, followed by subsequent survival , . Like Mce1A protein, Mce3A and Mce3E are also involved in cellular uptake of mycobacteria , while Mce2A appears to have a distinct role . The mce4 operon is implicated in the uptake of cholesterol, which is an essential energy source for mycobacteria for its long-term survival in host cells . Klepp et al. showed the possible involvement of mce operons of M. smegmatis in the maintenance of cell surface properties . Furthermore, bioinformatics evidence suggests that some Mce proteins of the bacterial membrane contribute to the formation of beta barrel proteins serving as channels –.
We found that the number of mce operons varies between strain TH135 (n = 7) and strain 104 (n = 9). The genomes of strain TH135 and 104 contained CDSs with a 52.0–84.3% sequence homology to Mce proteins encoded by operons mce1–4 in the M. tuberculosis H37Rv genome (GenBank accession no. NC_000962) (Table S4). Comparison of mce family genes in strain TH135 and 104 genome revealed that strain TH135-specific mce genes, MAH_0587 and MAH_0800, are absent in the strain 104 genome due to frameshifting (Table 2). Furthermore, five genes (MAH_0796 to MAH_0799, and MAH_0801) encoding Mce family proteins in the strain TH135 genome show low homology to corresponding genes (MAV_0948 to MAV_0951, and MAV_0953) in the strain 104 genome (Table S4). Interestingly, these five mce genes in the strain TH135 genome were located in specific region SR-2 (Table 2).
There are two sets of mce operons that show homology to genes belonging to mce3 operons in M. tuberculosis H37Rv  in the strain 104 genome, and genes belonging to one set showed high sequence similarity to mce genes (MAH_1680 to MAH_1685) in the strain TH135 genome (Table S4), while genes belonging to the other set (MAV_2532 to MAV_2537) were located in strain 104-specific region SR-19 (Table 3). The significance of the presence of these two sets of mce operons is an intriguing question. Furthermore, MAV_1807, a mce homologue, is located in region SR-16, and an operon consisting of six genes (MAV_5047 to MAV_5052), which is not present in TH135, is in region SR-21. Thus, strain 104 harbors more genes encoding strain-specific Mce proteins than strain TH135. Elucidating the roles of mce related genes is necessary to understand their relationships with pathological manifestations of MAC disease.
Mycobacteria use type VII secretion systems (ESX-1 to ESX-5) to secrete the 6-kDa early secreted antigenic target (ESAT-6), its protein partner the 10-kDa culture filtrate protein (CFP-10), and other effector proteins such as those with conserved N-terminal domains containing proline-glutamic acid (PE) or proline-proline glutamic acid (PPE) motifs. These effectors play an important role in long-term survival of bacteria in host cells , . Comparative genome analysis revealed that the genomes of strain TH135 and 104 contained CDSs having a 43.9% to 93.6% homology to Esx-related proteins encoded by the esx-2 to esx-5 loci, but not the esx-1 loci, in the M. tuberculosis H37Rv genome (Table S4). There are a few differences in esx-2 and esx-3 loci between the strains: the strain 104 genome harbors point shift mutations in the regions corresponding to MAH_0168, MAH_4605, and MAH_4283 of strain TH135, and these regions show a respective homology of 70.1%, 88.7%, and 85.6% to PPE69, EccC2, and EccC3 in M. tuberculosis H37Rv (Table S4).
Mycobacteria carry many genes encoding PPE and PE proteins with unknown function. We found three PPE family protein genes (MAH_1006, MAH_1657, and MAH_1946) in the strain TH135 genome, but not in the strain 104 genome (Table 2). On the other hand, strain 104 harbors three strain-specific PE and PPE family genes (MAV_0117 encoding a PE protein, and MAV_0790 and MAV_0117 encoding PPE proteins).
MmpL and MmpS Proteins
The high content of lipids, such as mycolic acids, in the cell wall is a distinctive characteristic of mycobacteria , and waxy cell walls play a pivotal role in host survival. MmpL and MmpS have been reported to mediate the transport of lipid metabolites to biosynthesize cell wall lipids –, albeit by an undefined mechanism. The mmpL genes are homologous to the genes encoding proteins that belong to a family of multidrug resistance pumps termed resistance nodulation cell division (RND) , . MmpS proteins possess one N-terminal transmembrane domain with an extracytoplasmic C-terminus . The strain TH135 genome harbors all mmpL and mmpS genes found in the strain 104 genome as well as strain TH135-specific mmpL and mmpS genes, mmpL5 (MAH_0778), mmpL5_5 (MAH_4506), mmpL6 (MAH_3375), mmpL family gene (MAH_0016), and mmpS4_1 (MAH_4505) (Table 2). Gene mmpS4_1 is located adjacent to mmpL5_5 within the strain TH135-specific region SR-10, suggesting that these genes work in a collaborative manner. In addition, MAH_0016 and MAH_0778 are in region SR-1 and region SR-2, respectively. The fact that the strain TH135 genome has more mmpL and mmpS genes than the strain 104 genome suggests differences in cell wall lipid composition between the strains.
Mycobacteria produce catalase to remove reactive oxygen species produced by host cells to ensure their survival after invading. M. tuberculosis and M. bovis carry katG-gene encoding catalase, which is an important determinant of their pathogenicity in mice and guinea pigs , . The katG gene also exists in the genomes of strains TH135 and 104. Interestingly, there is an additional catalase gene (MAH_4495) exclusive to the strain TH135 genome, and this gene is located in TH135-specific regions SR-10 (Table 2). Product of MAH_4495 may be involved in intracellular replication of strain TH135, although further studies are needed to determine it precise function.
Prediction of the Bacterial Factors that Mediate Pulmonary and Disseminated MAC Disease
M. avium strains that cause pulmonary disease are thought to be acquired via the respiratory route and invade through the respiratory mucosal membrane. Such strains are incorporated by phagocytosis and survive in alveolar macrophages where they proceed to cause pulmonary disease . In particular, the ability to replicate in professional phagocytes such as alveolar macrophages is crucial for the long-term survival of M. avium in lung tissue, and this appears to influence the establishment of chronic pulmonary disease. Our findings that strain TH135 specifically carries genes associated with bacterial survival in host cells, namely genes encoding MmpL and MmpS, or catalase, may be of particular importance for the establishment of pulmonary disease.
On the other hand, strains that cause disseminated disease are most likely acquired via the gastrointestinal route . More precisely, bacteria are acquired through the consumption of contaminated water or food, and they experience a number of environments during the course of infection; they endure the acidic pH of the stomach, reach the intestinal lumen, and then invade lining cells of the small intestine, especially the enterocytes of the terminal ileum. Bacteria that invade the lamina propria survive after phagocytosis by phagocytic cells and spread to the blood through lymphatic vessels before being taken up by the spleen and the liver. In these processes, bacterial invasion of lining cells of the small intestine is a crucial step in the establishment of disseminated disease in immunocompromised hosts, and strain 104-specific mce genes associated with cell invasion and bacterial survival in cells may play an important role in the invasion of intestinal epithelial cells. McGarvey and Bermudez reported that M. avium, which causes disseminated disease, exhibit higher cell invasion capability than M. intracellulare, which does not cause disseminated disease . Once functions of strain-specific genes are revealed, their roles in bacterial resistance against host defense will be elucidated in the future. This will lead to identification of bacterial factors associated with the pathological manifestations of MAC disease, which is of great significance in clinical applications.
Screening of Clinical Isolates for Genes Located in the Strain-specific Regions
To investigate the importance of genes in strain-specific regions, we screened 35 clinical isolates (including strain TH135) from HIV-negative patients with pulmonary MAC disease and 29 clinical isolates (including strain 104) from HIV-positive patients with disseminated MAC disease for these genes (Table 4 and Table S5). As shown in Table 4, MAH_2592 in the strain TH135-specific region SR-5 was found in 28.6% of isolates from pulmonary MAC disease patients but was absent in those from HIV-positive patients. The detection rate of MAH_1001 and MAH_4506 in region SR-3 and region SR-10, respectively, was significantly higher in clinical isolates from pulmonary MAC disease patients than in those from HIV-positive patients. MAH_0016 in region SR-1 was found exclusively in strain TH135. For genes in the strain 104-specific regions, MAV_1807 in region SR-16 was found in 20.7% of isolates from HIV-positive patients but was absent in those from pulmonary MAC disease patients. Although not statistically significant, the detection rate of MAV_2532 in region SR-19 was higher in isolates from HIV-positive patients than in those from pulmonary MAC disease patients. MAV_0264 and MAV_0482 in region SR-11 and region SR-12, respectively, were found exclusively in strain 104.
Thus, screening of clinical isolates for genes located in the strain-specific regions revealed that the detection rates of strain TH135-specific genes were generally high in clinical isolates from pulmonary MAC disease patients. On the other hand, the detection rates of strain 104-specific genes were generally high in clinical isolates from HIV-positive patients. These results suggest that the genes located in the strain-specific regions have a strong influence on the pathological manifestations of MAC disease. Further study is needed to investigate the relationship between MAC disease and other specific genes, in addition to the virulence-associated genes.
In conclusion, comparative genome analysis and screening of clinical isolates for specific genes showed that the M. avium subsp. hominissuis strains which cause pulmonary and disseminated disease possess genetically distinct features. Furthermore, it is thought that the acquisition of specific genes during strain evolution has played an important role in the pathogenesis of M. avium. In particular, strain TH135-specific virulence-associated genes may be involved in not only the establishment of pulmonary disease, but also the pathogenicity of M. avium. Therefore, comparing the presence of these specific genes between M. avium strains isolated in Japan and those isolated abroad is important to elucidate the prevalent genetic features of M. avium in each country. This may also lead to the discovery of factors promoting the spread of pulmonary disease caused by M. avium.
Primers used for detection of specific genes in M. avium clinical isolates.
List of strain TH135-specific regions and CDS found in each region.
List of strain 104-specific regions and CDS found in each region.
Comparison of mce- and esx-related genes in M. tuberculosis H37Rv, M. avium TH135, and M. avium 104 genomes.
We are grateful to Dr. Takami Shibayama and everyone from the clinical laboratory at the National Hospital Organization, Higashinagoya National Hospital. We also thank Masaki Niimi, Kazuhiro Kurokawa, Eimi Tanaka, and Hiroya Fuku for technical assistance.
Conceived and designed the experiments: KU HT KO TY TI KI T. Nakagawa T. Nikai. Performed the experiments: HT KU. Analyzed the data: KU HT MM. Contributed reagents/materials/analysis tools: HT KU KO. Wrote the paper: KU.
- 1. Ichiyama S, Shimokata K, Tsukamura M (1988) The isolation of Mycobacterium avium complex from soil, water, and dusts. Microbiol Immunol 32: 733–739.
- 2. von Reyn CF, Maslow JN, Barber TW, Falkinham JO, Arbeit RD (1994) Persistent colonization of notable water as a source of Mycobacterium avium infection in AIDS. Lancet 343: 1137–1141.
- 3. Falkinham JO, Norton CD, LeChevallier MW (2001) Factors influencing numbers of Mycobacterium avium, Mycobacterium intracellulare, and other mycobacteria in drinking water distribution systems. Appl Environ Microb 67: 1225–1231.
- 4. Nishiuchi Y, Maekura R, Kitada S, Tamaru A, Taguri T, et al. (2007) The recovery of Mycobacterium avium-intracellulare complex (MAC) from the residential bathrooms of patients with pulmonary MAC. Clin Infect Dis 45: 347–351.
- 5. Sakatani M (2005) The non-tuberculous mycobacteriosis. Kekkaku 80: 25–30.
- 6. Mijs W, de Haas P, Rossau R, Van der Laan T, Rigouts L, et al. (2002) Molecular evidence to support a proposal to reserve the designation Mycobacterium avium subsp avium for bird-type isolates and ‘M. avium subsp hominissuis’ for the human/porcine type of M. avium. Int J Syst Evol Micr 52: 1505–1518.
- 7. Turenne CY, Wallace R, Behr MA (2007) Mycobacterium avium in the postgenomic era. Clin Microbiol Rev 20: 205–229.
- 8. Turenne CY, Semret M, Cousins DV, Collins DM, Behr MA (2006) Sequencing of hsp65 distinguishes among subsets of the Mycobacterium avium complex. J Clin Microbiol 44: 433–440.
- 9. Tanaka E, Amitani R, Niimi A, Suzuki K, Murayama T, et al. (1997) Yield of computed tomography and bronchoscopy for the diagnosis of Mycobacterium avium complex pulmonary disease. Am J Resp Crit Care 155: 2041–2046.
- 10. Kajiki A (2011) Non-tuberculous mycobacteriosis. What has been coming out. Kekkaku 86: 113–125.
- 11. Dailloux M, Abalain ML, Laurain C, Lebrun L, Loos-Ayav C, et al. (2006) Respiratory infections associated with nontuberculous mycobacteria in non-HIV patients. Eur Respir J 28: 1211–1215.
- 12. Bodle EE, Cunningham JA, Della-Latta P, Schluger NW, Saiman L (2008) Epidemiology of nontuberculous mycobacteria in patients without HIV infection, New York City. Emerg Infect Dis 14: 390–396.
- 13. Moore JE, Kruijshaar ME, Ormerod LP, Drobniewski F, Abubakar I (2010) Increasing reports of non-tuberculous mycobacteria in England, Wales and Northern Ireland, 1995–2006. BMC Public Health 10: 612.
- 14. Tsuyuguchi K, Suzuki K, Matsumoto H, Tanaka E, Amitani R, et al. (2001) Effect of oestrogen on Mycobacterium avium complex pulmonary infection in mice. Clin Exp Immunol 123: 428–434.
- 15. Tanaka G, Shojima J, Matsushita I, Nagai H, Kurashima A, et al. (2007) Pulmonary Mycobacterium avium complex infection: association with NRAMP1 polymorphisms. Eur Respir J 30: 90–96.
- 16. Shojima J, Tanaka G, Keicho N, Tamiya G, Ando S, et al. (2009) Identification of MICA as a susceptibility gene for pulmonary Mycobacterium avium complex infection. J Infect Dis 199: 1707–1715.
- 17. Fratti RA, Chua J, Vergne I, Deretic V (2003) Mycobacterium tuberculosis glycosylated phosphatidylinositol causes phagosome maturation arrest. Proc Natl Acad Sci USA 100: 5437–5442.
- 18. Li ZM, Kelley C, Collins F, Rouse D, Morris S (1998) Expression of katG in Mycobacterium tuberculosis is associated with its growth and persistence in mice and guinea pigs. J Infect Dis 177: 1030–1035.
- 19. Gobin J, Moore CH, Reeve JR, Wong DK, Gibson BW, et al. (1995) Iron acquisition by Mycobacterium tuberculosis: isolation and characterization of a family of iron-binding exochelins. Proc Natl Acad Sci USA 92: 5189–5193.
- 20. Arruda S, Bomfim G, Knights R, Huimabyron T, Riley LW (1993) Cloning of an Mycobacterium tuberculosis DNA fragment associated with entry and survival inside cells. Science 261: 1454–1457.
- 21. Sugawara H, Ohyama A, Mori H, Kurokawa K (2009) Microbial genome annotation pipeline (MiGAP) for diverse users [abstract S-001]. In: Program and abstracts of the 20th Int Conference Genome Informatics. Kanagawa, Japan.
- 22. Ohyama A, Kurokawa K, Enai K, Saitoh H, Kanaya S, et al. (2006) Bioinformatics tool for genomic era: a step towards the in silico experiments-focused on molecular cloning. J Comp Aid Chem 7: 102–115.
- 23. Lowe TM, Eddy SR (1997) tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25: 955–964.
- 24. Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, et al. (2007) RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35: 3100–3108.
- 25. Siguier P, Perochon J, Lestrade L, Mahillon J, Chandler M (2006) ISfinder: the reference centre for bacterial insertion sequences. Nucleic Acids Res 34: D32–D36.
- 26. Uchiyama I (2003) MBGD: microbial genome database for comparative analysis. Nucleic Acids Res 31: 58–62.
- 27. Horan KL, Freeman R, Weigel K, Semret M, Pfaller S, et al. (2006) Isolation of the genome sequence strain Mycobacterium avium 104 from multiple patients over a 17-year period. J Clin Microbiol 44: 783–789.
- 28. Darling ACE, Mau B, Blattner FR, Perna NT (2004) Mauve: Multiple alignment of conserved genomic sequence with rearrangements. Genome Res 14: 1394–1403.
- 29. Ichikawa K, Yagi T, Moriyama M, Inagaki T, Nakagawa T, et al. (2009) Characterization of Mycobacterium avium clinical isolates in Japan using subspecies-specific insertion sequences, and identification of a new insertion sequence, ISMav6. J Med Microbiol 58: 945–950.
- 30. Mahillon J, Leonard C, Chandler M (1999) IS elements as constituents of bacterial genomes. Res Microbiol 150: 675–687.
- 31. Tatusov RL, Natale DA, Garkavtsev IV, Tatusova TA, Shankavaram UT, et al. (2001) The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res 29: 22–28.
- 32. Casali N, Riley LW (2007) A phylogenomic analysis of the Actinomycetales mce operons. BMC Genomics 8: 60.
- 33. Chitale S, Ehrt S, Kawamura I, Fujimura T, Shimono N, et al. (2001) Recombinant Mycobacterium tuberculosis protein associated with mammalian cell entry. Cell Microbiol 3: 247–254.
- 34. El-Shazly S, Ahmad S, Mustafa AS, Al-Attiyah R, Krajci D (2007) Internalization by HeLa cells of latex beads coated with mammalian cell entry (Mce) proteins encoded by the mce3 operon of Mycobacterium tuberculosis. J Med Microbiol 56: 1145–1151.
- 35. Pandey AK, Sassetti CM (2008) Mycobacterial persistence requires the utilization of host cholesterol. Proc Natl Acad Sci USA 105: 4376–4380.
- 36. Klepp LI, Forrellad MA, Osella AV, Blanco FC, Stella EJ, et al. (2012) Impact of the deletion of the six mce operons in Mycobacterium smegmatis. Microbes Infect 14: 590–599.
- 37. Mah N, Perez-Iratxeta C, Andrade-Navarro MA (2010) Outer membrane pore protein prediction in mycobacteria using genomic comparison. Microbiology 156: 2506–2515.
- 38. Pajon R, Yero D, Lage A, Llanes A, Borroto CJ (2006) Computational identification of beta-barrel outer-membrane proteins in Mycobacterium tuberculosis predicted proteomes as putative vaccine candidates. Tuberculosis 86: 290–302.
- 39. Song H, Sandie R, Wang Y, Andrade-Navarro MA, Niederweis M (2008) Identification of outer membrane proteins of Mycobacterium tuberculosis. Tuberculosis 88: 526–544.
- 40. Cole ST, Brosch R, Parkhill J, Garnier T, Churcher C, et al. (1998) Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature 396: 190–198.
- 41. Abdallah AM, Gey van Pittius NC, Champion PA, Cox J, Luirink J, et al. (2007) Type VII secretion-mycobacteria show the way. Nat Rev Microbiol 5: 883–891.
- 42. Sampson SL (2011) Mycobacterial PE/PPE proteins at the host-pathogen interface. Clin Dev Immunol 26: 1–10.
- 43. Daffé M, Draper P (1998) The envelope layers of mycobacteria with reference to their pathogenicity. Adv Microb Physiol 39: 131–203.
- 44. Cox JS, Chen B, McNeil M, Jacobs WR (1999) Complex lipid determine tissue specific replication of Mycobacterium tuberculosis in mice. Nature 402: 79–83.
- 45. Converse SE, Mougous JD, Leavell MD, Leary JA, Bertozzi CR, et al. (2003) MmpL8 is required for sulfolipid-1 biosynthesis and Mycobacterium tuberculosis virulence. Proc Natl Acad Sci USA 100: 6121–6126.
- 46. Deshayes C, Bach H, Euphrasie D, Attarian R, Coureuil M, et al. (2010) MmpS4 promotes glycopeptidolipids biosynthesis and export in Mycobacterium smegmatis. Mol Microbiol 78: 989–1003.
- 47. Tekaia F, Gordon SV, Garnier T, Brosch R, Barrell BG, et al. (1999) Analysis of the proteome of Mycobacterium tuberculosis in silico. Tuber Lung Dis 79: 329–342.
- 48. Domenech P, Reed MB, Barry CE (2005) Contribution of the Mycobacterium tuberculosis MmpL protein family to virulence and drug resistance. Infect Immun 73: 3492–3501.
- 49. Wilson TM, Delisle GW, Collins DM (1995) Effect of inhA and katG on isoniazid resistance and virulence of Mycobacterium bovis. Mol Microbiol 15: 1009–1015.
- 50. Inderlied CB, Kemper CA, Bermudez LEM (1993) The Mycobacterium avium complex. Clin Microbiol Rev 6: 266–310.
- 51. McGarvey JA, Bermudez LE (2001) Phenotypic and genomic analyses of the Mycobacterium avium complex reveal differences in gastrointestinal invasion and genomic composition. Infect Immun 69: 7242–7249.