Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Identification of benthic diatoms isolated from the eastern tidal flats of the Yellow Sea: Comparison between morphological and molecular approaches

  • Sung Min An,

    Affiliation Marine Ecosystem and Biological Research Center, Korea Institute of Ocean Science & Technology, Ansan, Republic of Korea


  • Dong Han Choi,

    Affiliations Marine Ecosystem and Biological Research Center, Korea Institute of Ocean Science & Technology, Ansan, Republic of Korea, Department of Marine Biology, University of Science and Technology, Daejeon, Republic of Korea

  • Jung Ho Lee,

    Affiliation Department of Biology Education, Daegu University, Gyeongsan, Republic of Korea

  • Howon Lee,

    Affiliation Marine Ecosystem and Biological Research Center, Korea Institute of Ocean Science & Technology, Ansan, Republic of Korea

  • Jae Hoon Noh

    Affiliations Marine Ecosystem and Biological Research Center, Korea Institute of Ocean Science & Technology, Ansan, Republic of Korea, Department of Marine Biology, University of Science and Technology, Daejeon, Republic of Korea

Identification of benthic diatoms isolated from the eastern tidal flats of the Yellow Sea: Comparison between morphological and molecular approaches

  • Sung Min An, 
  • Dong Han Choi, 
  • Jung Ho Lee, 
  • Howon Lee, 
  • Jae Hoon Noh


Benthic diatoms isolated from tidal flats in the west coast of Korea were identified through both traditional morphological method and molecular phylogenetic method for methodological comparison. For the molecular phylogenetic analyses, we sequenced the 18S rRNA and the ribulose bisphosphate carboxylase large subunit coding gene, rbcL. Further, the comparative analysis allowed for the assessment of the suitability as a genetic marker for identification of closely related benthic diatom species and as potential barcode gene. Based on the traditional morphological identification system, the 61 isolated strains were classified into 52 previously known taxa from 13 genera. However, 17 strains could not be classified as known species by morphological analyses, suggesting a hidden diversity of benthic diatoms. The Blast search on NCBI’s Genebank indicated that the reference sequences for most of the species were absent for the benthic diatoms. Of the two genetic markers, the rbcL genes were more divergent than the 18S rRNA genes. Furthermore, a long branch attraction artefact was found in the 18S rRNA phylogeny. These results suggest that the rbcL gene is a more appropriate genetic marker for identification and classification of benthic diatoms. Considering their high diversity and simple shapes, and thus the difficulty associated with morphological classification of benthic diatoms, a molecular approach could provide a relatively easy and reliable classification system. However, this study suggests that more effort should be made to construct a reliable database containing polyphasic taxonomic data for diatom classification.


Diatoms are the most dominant taxa among the various microalgae and are known to account for ca. 40% of the total primary production in the ocean [1, 2]. Diatoms also play an important role in the biogeochemical cycles of carbon and silica [3]. In tidal flats, especially, benthic diatoms are the most dominant and diverse group and are key organisms that contribute to the preservation of the ecological functions of tidal flats such as primary production, nutrient cycling, and sediment stabilization [47]. Thus, the ecology and diversity of diatoms in tidal flats has received attention for a long time [811]. Although the study of diatom diversity has a relatively long history, overcoming the limitations of morphological classifications remains to be problematic. The small size and simple forms of benthic diatoms have made it difficult to study their diversity [1214]. Furthermore, since the classification system is based on morphological characteristics of the type specimen, it is difficult to determine whether species having a similar form that appear in a variety of environments are the same species or different ones.

Since molecular techniques were applied to diatom research for the first time in the 1980s [15], molecular phylogenetic studies have been widely performed to identify and classify diatoms to overcome morphological limitations [1619]. DNA barcoding is a method for α-taxonomy using molecular analyses based on differences in DNA sequences according to species. Therefore, unique DNA sequences can be referred to as tags or barcodes for each taxon [20]. Using DNA barcoding techniques, even morphologically similar strains can be identified at the species level. These molecular phylogenetic analyses have also enabled the rapid, convenient, and accurate classification of diatoms and have thus contributed considerably to studies on the diversity of diatoms.

Specific marker genes are used for molecular phylogenetic analyses. Different DNA regions within the nuclear rRNA gene, as well as mitochondrial and chloroplast genes, have been used for the phylogenetic analysis of diatoms [21]. Among them, nuclear 18S rRNA has been the most widely used [20, 22, 23]. The ribulose-1,5-bisphosphate carboxylase large subunit (rbcL) gene in chloroplasts has also been used for the phylogenetic study of diatoms [16, 2426]. In addition, the cytochrome c oxidase subunit I (coxI), internal transcribed spacer (ITS), and ribulose-1,5-bisphosphate carboxylase small subunit (rbcS) have been used for the phylogenetic study of diatoms [16, 21, 27, 28]. However, these genetic makers have fewer records in public databases compared with the 18S rRNA gene.

In this study, morphological and molecular taxonomic characteristics of benthic diatoms isolated from tidal flats were investigated to evaluate the applicability of molecular phylogenetic approaches using 18S rRNA and rbcL genes. In addition, we present morphological as well as genetic information on the benthic diatoms. Although this research does not reveal the complete diversity of diatoms in tidal flats, it will be helpful in further studies on the diversity of benthic diatoms in various environments throughout the world.

Materials and methods

Collection, isolation and development of new strains

Benthic diatoms were collected mainly from tidal flats of Geunso Bay in Taean (36° 44' 12.06" N 126° 10' 47.52" E), Eulwang-ri (37° 26' 43.67" N 126° 22' 18.07" E), and saline Sihwa (37° 18' 46.73" N 126° 36' 32.64" E) along the west coast of Korea (Fig 1). The numbers of strains obtained in each region were 53 in Geunso Bay and four in Sihwa, and four in Eulwang-ri. Most samples were obtained in the Geunso Bay where regular monthly surveys had been conducted from 2009. Geunso Bay is a semi-enclosed bay with an area of 87 km2, and the water depth at high tide is 2–4 m depending on the area. There is no inflow river, and facies are predominantly sandy silt. The Oi tidal flat, where Sihwa station is located, has an area of 0.025 km2, and the facies are predominantly silty sand. Eulwang-ri is a sandy facies and there is a beach near the sampling station.

Fig 1. Map of the study sampling locations.

(TA: Taean, SH: Sihwa, EW: Eulwang-ri).

To obtain sediment samples containing diatoms, the surface of the tidal flat was scratched to a depth of ca. 2 mm and the sediment collected in a conical tube. Samples were transported to the laboratory under refrigerated conditions and then incubated at ± 2°C of the in situ temperature. Diatom strains were isolated within 1 day of sampling. A single-diatom cell was isolated under an inverted microscope (Eclipse Ti-U; Nikon, Tokyo, Japan) using a glass Pasteur pipette and placed into a 24-well plate containing f/2 medium with silicate (Sigma-Aldrich, St. Louis, MO, USA). After confirmation of monoclonal growth, each culture was transferred to a new tissue culture flask (Falcon, Cockeysville, MD, USA) containing 35 ml of fresh medium for one week. Several cultures suspected to be a mixture were further isolated by a dilution method [29]. All strains were incubated at 15°C under a 12:12 h light-dark cycle. Illumination was provided by a fluorescent lamp with an irradiance of ca. 100 μmol photons m-2s-1. The strains were transferred to fresh medium every 2 or 3 weeks. Research activities at the sampling areas of this study did not require specific permission because the areas are not restricted or ecosystem protected. Endangered and protected species do not live in the study area and thus were not included in the survey.

Morphological observations

Monoclonal cultures of benthic diatom strains were identified to the genus or species level by morphological features based on observations under light and scanning electron microscopy. For the light microscopy examination, diatom cultures were treated with acid to prepare cleaned frustules [30], and then permanent slides were made using Mountmedia (Wako Pure Chemical Industries, Osaka, Japan). The slides were examined using light microscopy under a ×100 oil immersion objective lens (Eclipse 80i; Nikon). For scanning electron microscopy examination, diatom cells fixed with Lugol’s solution were filtered onto a polycarbonate filter (diameter of 25 mm; pore size of 1 or 2 μm) and then washed with distilled water. The filter papers were dehydrated in a graded ethanol series (10%, 25%, 50%, 75%, 90%, and 100%) and dried using tetramethylsilane (Sigma-Aldrich, St. Louis, MO, USA). Finally, the samples were mounted onto stubs and sputter-coated with platinum. Observations were performed with a Hitachi S–4300 scanning electron microscope (Hitachi, Tokyo, Japan). The previous studies were referred to for instructions on morphological comparisons [3141]. Strains that did not match those in the published literature were treated as unidentified species.

DNA extraction, PCR and sequencing

For DNA extraction, the cultured strain (100 μl) was harvested by centrifugation at 14,000 × g for 1 min and the cell pellet was resuspended in 1 ml of sterilized STE (sodium chloride-Tris-EDTA, pH 7.8) buffer solution. Two cycles of freezing (–80°C) and thawing (95°C) were followed by vigorous vortexing with sterilized silica/zirconium beads to break the cells. To remove cell debris, the lysate was centrifuged at 8,000 × g for 1 min. The supernatant was dispensed into a clean tube and used as template DNA for PCR.

PCR amplification was performed using two primer sets: Diatom9F (5′–TGTGGGAGAGGGGAAATCAAG–3′) [42] and EukB-R (5′–TGATCCTTCTGCAGGTTCACCAC–3′) [15] for 18S rDNA, and DPrbcL1 (5′–AAGGAGAAATHAATGTCT–3′) and DPrbcL7 (5′–AARCAACCTTGTGTAAGTCTC–3′) for the rbcL gene [43]. These primers produced PCR products of approximately 1,600 bp and 1,550 bp, respectively. PCR was performed in a total volume of 30 μl, containing 1.0 μl of template DNA, 3 μl of 10 × Ex Taq buffer, 2.4 μl of dNTPs (10 mM), 0.5 μl of each primer (10 μM), and 0.2 μl of TaKaRa Ex Taq polymerase (5 U μl−1; Takara, Otsu, Japan). PCR was conducted using the following conditions: PCR of 18S rRNA was conducted with initial denaturation at 94°C for 5 min, 34 cycles of main amplification (94°C for 45 sec, 55°C for 55 sec, 72°C for 2 min), and final extension at 72°C for 10 min. PCR of rbcL was conducted with initial denaturation at 94°C for 3 min, 35 cycles of main amplification (94°C for 1 min, 55°C for 1 min, 72°C for 1.5 min), and final extension at 72°C for 10 min. PCR products were purified using the Accuprep PCR Purification Kit (Bioneer, Daejeon, South Korea) and sent for commercial sequencing at Macrogen (Seoul, South Korea). The electrophenogram outputs for each product were edited and assembled using the ChromasPro v.1.45 program ( and Vector NTI Advance 11 (Invitrogen Corp., Carlsbad, CA, USA). The sequences obtained in this study were deposited in GenBank and the accession numbers of the sequences are shown in Table 1.

Table 1. Strains of the benthic diatoms isolated in this study, information on their collection, and accession numbers of 18S rRNA and rbcL gene sequences.

Species names were determined by morphological analyses.

Sequence alignment and phylogenetic analyses

For phylogenetic analysis, 18S rRNA and rbcL sequences from diatoms were retrieved in GenBank ( After excluding uncultured and environmental clone sequences, 1,853 sequences of the 18S rRNA gene and 1,473 sequences of the rbcL gene were aligned with the sequences obtained in present study using the ARB program [44] and corrected manually. Two Ochrophyta species (Nannochloropsis salina D.J. Hibberd and Ochromonas danica E.G. Pringsheim) were used as an outgroup. Neighbor–joining (NJ) and maximum–parsimony (MP) trees were constructed using MEGA 5.2 [45]. Maximum–likelihood (ML) trees were constructed using Randomized Axelerated Maximum Likelihood (RAxML) v.8.2.1 [46]. We used the “–f a” option for rapid bootstrap analysis and the best likelihood tree search using “–# 100” with default settings, namely, “–m GTRGAMMA” for the substitution model with rate heterogeneity, “–i” for the automatically optimized SPR rearrangement for heuristic search, and “–c” for 25 distinct rate categories. The robustness of each clade was assessed by further bootstrap analyses (1,000 replications) under the NJ and MP criteria using MEGA v.5.2 [45].


Morphological observations

The 61 diatom isolates were identified by morphometric characteristics using light and scanning electron microscopy and their detailed information is shown in Table 2. All strains were raphid diatoms and classified into 3 orders, 6 families, 13 genera, and 52 taxa (36 known and 16 unknown taxa; Fig 2). Forty-two strains could be morphologically identified to the species level (Table 2). Most isolates belonged to Bacillariaceae (25 isolates under 4 genera, 22 taxa) or Naviculaceae (23 isolates under 3 genera, 20 taxa), and the rest belonged to 4 classes, namely, Berkeleyaceae (3 isolates under 2 genera, 3 taxa), Entomoneidaceae (6 isolates under Entomoneis, 4 taxa), Pleurosigmataceae (3 isolates under 2 genera, 2 taxa), and Surirellaceae (1 isolate under 1 taxon). Navicula (17 taxa) and Nitzschia (16 taxa) were abundant in new isolates, followed by Entomoneis (4 taxa), Cylindrotheca (3 taxa), Bacillaria (2 taxa), Berkeleya (2 taxa), and Halsea (2 taxa). Based on the morphological observations, 42 strains (69%) were identified as 35 known taxa; however, 19 strains (31%) remained as 16 unidentified taxa, namely, 6 Navicula, 3 Nitzschia, 3 Entomoneis, and 1 each for Bacillaria, Cylindrotheca, Pleurosigma and Seminavis. The recognized identities and observed morphometric characteristics of the strains are summarized in Table 2; light micrographs of diatoms of the various taxa are shown in Figs 36.

Fig 2. Pie chart showing morphological affiliations of the strains isolated in this study.

Fig 3. Light micrograph of diatoms isolated in this study belonging to Cylindrotheca, Nitzschia, and Tryblionella.

(a) Nitzschia paleacea TA406. (b) N. paleaeformis TA394. (c) N. dubiiformis SH366. (d) N. dissipata TA44. (e) N. dubia TA37. (f) N. pellucida EW229. (g) Bacillaria sp.1 SH349. (h) N. sigmaformis TA311. (i) Nitzschia sigma TA341 (400x). (j) Nitzschia sp.1 Dillu16. (k) Nitzschia sp.2 TA61. (l) N. liebetruthii TA353. (m) Nitzschia sp.4 TA409. (n) Tryblionella apiculate TA-85. (o) Cylindrotheca closterium TA256. (p) C. gracilis TA46 (400x). (q) Cylindrotheca sp.1 TA198. (r) Bacillaria paxillifer EW234. (s) Nitzschia bergii TA139. (t) N. ligowskii TA426. (u) N. pusilla TA420. (v) N. aequorea Dillu38. Scale bar = 10 μm. Note that scale bars of 9 and 16 are inside of the picture.

Fig 4. Light micrograph of diatoms isolated in this study belonging to Berkeleya, Gyrosigma, Haslea, Parlibellus, and Pleurosigma.

(a) Gyrosigma limosum TA152. (b) Pleurosigma sp.1 TA34. (c) Haslea pseudostrearia TA280. (d) H. nipkowii SH381. (e) Parlibellus delognei TA387. (f) Berkeleya rutilans TA440. (g) B. fennica TA424. Scale bar = 10 μm.

Fig 5. Light micrograph of diatoms isolated in this study belonging to Navicula and Seminavis.

(a) Navicula gregaria TA289. (b) N. agatkae TA291. (c) Navicula incertata TA414. (d) Navicula sp.1 TA298. (e) Navicula sp.5 TU3. (f) Navicula sp.3 EW220. (g) N. ramosissima TA316. (h) N. flagellifera TA105. (i) Navicula sp.2 TA64. (j) N. salinicola TA204. (k) N. perminuta TA441. (l) N. trivialis TA83. (m) N. salinarum TA402. (n) N. cf. salinarum TA407. (o) N. salinarum var. minima TA416. (p) Navicula sp.4 TA323. (q) Navicula sp.6 TA446. (r) Seminavis sp.1 TA305. Scale bar = 10 μm.

Fig 6. Light micrograph of diatoms isolated in this study belonging to Entomoneis and Petodictyon.

(a, b) Entomoneis paludosa TA208. (c) Entomoneis sp.2 SH373. (d, e) Entomoneis sp.3 EW239. (f, g) Entomoneis sp.1 TA410. (h) Petrodictyon gemma TA201. Scale bar = 10 μm.

Table 2. Morphometric data and classification based on the morphology of diatom strains isolated in this study.

Species name and sequence identity of the closest relative found in GenBank using BLASTn.

Molecular-based identification

Both 18S rRNA and rbcL genes from 61 culture strains were sequenced successfully. The BLASTn results of each 18S rRNA and rbcL sequence are given in Table 2 according to the best matched species and sequence identity. For many strains, the closest relative based on the BLAST search differed from identification based on morphology. The morphological and genetic classification results were consistent for only nine strains with >98.7% identity to their closest relatives based on their 18S rRNA gene sequences (Table 2). Similarly, morphological and genetic identification using the rbcL sequences were consistent only in six strains with relatively high sequence identities, ranging from 94.3% to 99.5% (Table 2).

From the phylogenetic trees, phylogenetic relationships among the isolates can be determined (Figs 79). In total, 110 sequences of the 18S rRNA gene and 93 sequences of the rbcL gene were used for the phylogenetic analysis. In the phylogenetic trees of the rbcL gene, most of strains were separated in accordance with their taxonomic positions. In contrast, some strains were not consistent with the morphological classification in the 18S rDNA phylogenies. Petrodictyon gemma TA201, belonging to Surirellaceae, clustered with Entomoneis ornata strain 14A, belonging to Entomoneidaceae, with a long branch in the ML tree of 18S rDNA (Fig 7). Additionally, two Entomoneis paludosa strains, TA208 and TA263, showed another long branch (Fig 7). Unlike the ML tree, however, P. gemma and the two E. paludosa strains clustered together with a long branch in the NJ and MP phylogenies. Thus, in the 18S rDNA tree, the phylogenetic positions of these species were unstable. In the Naviculales, despite the fact that the morphological features were similar to those of naviculoids, the tube-dwelling diatoms Berkeleya and Parlibellus did not cluster in the naviculoid group, but rather in asymmetrical biraphid diatoms with a low bootstrap value in the 18S rDNA phylogenies (Fig 7). In addition, several different species were not clearly differentiated in the 18S rDNA phylogenies, such as Berkeleya rutilans TA440 and Berkeleya fennica TA424, which had a very low sequence distance (Fig 7, Table 2). A similar low resolution was also found among Navicula salinarum TA402, Navicula trivialis TA83, and N. cf. trivialis TA407 (Fig 8).

Fig 7.

Phylogenetic trees obtained using 18S rRNA (a) and rbcL (b) gene sequences of 61 culture strains. Bootstrap values obtained by neighbor–joining, maximum–likelihood, and maximum–parsimony methods are shown on the nodes. Expanded tree of Navicula sensu stricto and Nitzschia sensu stricto are shown in Figs 8 and 9, respectively.

Fig 8.

Phylogenetic tree of Navicula sensu stricto obtained using 18S rRNA (a) and rbcL (b) gene sequences. Bootstrap value obtained by neighbor–joining, maximum–likelihood, and maximum–parsimony methods are shown on the nodes.

Fig 9.

Phylogenetic tree of Nitzschia sensu stricto obtained using 18S rRNA (a) and rbcL (b) gene sequences. Bootstrap value obtained by neighbor–joining, maximum–likelihood, and maximum–parsimony methods are shown on the nodes.

Using the sequences obtained in this study, we analyzed divergence levels of the 18S rRNA and rbcL genes (Table 3). Although the divergence levels of 18S rRNA genes were higher than those of rbcL genes in the genus Entomoneis due to long branches, the genetic distance of the rbcL gene within the genus was, on average, double that of the 18S rRNA gene. Furthermore, the genetic distance of rbcL was three times higher than that of 18S rRNA in two dominant benthic genera, Navicula and Nitzschia.

Table 3. Nucleotide sequence distances of the 18s rRNA and rbcL genes within a genus according to Jukes and Cantor [47] model.


In this study, we attempted to identify and classify benthic diatoms by the polyphasic approach using both morphological characteristics and molecular markers and suggested that molecular approach using rbcL gene could become a better alternative to traditional morphological classification approach. Despite a long history of taxonomic studies on benthic diatoms, overcoming the difficulties associated with identification and classification of diatoms is a major challenge because of their small size and morphological similarities. In the process of identifying the strains obtained in this study, many strains were not morphologically identified at the species level due to these difficulties. Although more strains might be identifiable by a thorough literature review and some may be confirmed to be a new species, it is evident that morphometric classification is a laborious and time-consuming procedure. Some previous studies avoided identification at the species level or dealt only with the community dynamics of benthic diatoms [12, 13]. Therefore, the community structure of diatoms and their distribution in tidal flats have not been clearly elucidated [48]. To reveal easily and quickly the hidden diversity of benthic diatoms, largely attributed to their very small and similar morphologies, the development of molecular barcoding techniques is urgently needed. To enable this, it is necessary to construct a reliable genetic database.

The quality of a database has a direct and absolute influence on the applicability and efficiency of DNA barcoding techniques [49]. Currently, genetic information on most species could not be found in GenBank, indicating that the database is still insufficient, and that molecular taxonomic studies on benthic diatoms are limited. At the time of writing, the numbers of 18S rDNA and rbcL gene sequences deposited in GenBank are 4,775 and 3,099, and the number of species are reduced to 811 and 709, respectively. Despite the fact that extant diatoms are estimated to include 30,000–100,000 species [50], there is no genetic information on the majority of such species. Owing to the limited data available in GenBank, the closest relatives of most 18S rDNA sequences did not match the classifications by morphological identification (Table 2). These inconsistencies were more apparent in the case of the rbcL gene.

In this study, six groups of diatoms, namely, Bacillariaceae, Naviculaceae, Pleurosigmataceae, Berkeleyaceae, Entomoneidaceae, and Surirellaceae, were clearly distinguished and formed monophyletic groups in the phylogenetic trees of rbcL gene. In the 18S rDNA analyses, despite a morphological difference, some diatom sequences showed high similarity (more than 99%) to those of other species. These relatively high sequence similarities might have been due to either misidentification of records deposited in GenBank or low resolution of the 18S rDNA gene [18, 19]. However, a relatively low sequence distance within a genus shows that 18S rDNA is not an appropriate genetic marker to differentiate diatom species clearly, as is seen in the case of lower resolution among species and polyphyletic characteristics of several species (Table 2). For example, Navicula salinarum TA402, N. cf. salinarum TA407, and N. trivialis TA83 are similar but morphologically different species. N. trivialis TA83 has subrostrate apices and a central area that is bound by mostly shortened striae, whereas N. salinarum TA402 has rostrate apices and a central area that is formed by alternating long and short striae [31, 33]. However, the 18S rDNA sequences of these species are almost identical, and therefore cannot be clearly distinguished from each other (Fig 8). Similarly, Berkeleya fennica, which can be distinguished by its smaller and denser striae (over 30/10 μm) from B. rutilans [40], were not clearly differentiated from B. rutilans in the 18S rDNA phylogenetic tree. In addition, the Surirelloid diatom Petrodictyon gemma was clustered with Entomoneis by a long branch in the 18S rDNA phylogeny. This long branch attraction artefact was also found in the 18S rDNA phylogenies of Haslea nipkowii and Neidium affine [51], indicating that unusually rapid evolutionary events have occurred in the 18S rRNA genes of some benthic diatoms [52]. In this respect, it is apparent that the 18S rRNA gene of some benthic diatoms has undergone unusually rapid evolutionary changes. Thus, although 18S rRNA has been widely used in phylogenetic studies on diatoms and has the largest database compared with other genetic markers [20, 22, 23], it is unsuitable as a marker for the study of diatom biodiversity because of its low resolution [20].

Conversely, the rbcL gene varies markedly compared with 18S rDNA [16]. Consistently in this study, the rbcL gene showed higher divergence levels than those of the 18S rRNA gene, with a few exceptions in Entomoneis and Haslea, which were supposed to have undergone rapid evolutionary changes in 18S rDNA (Figs 7 and 8). Furthermore, long branch artefacts were not found among the rbcL phylogeny. In addition, the rbcL gene, a plastid–encoded gene, is advantageous in its use as a genetic marker because of its high PCR success rate (i.e., ease of amplification), simplicity of alignment, and low susceptibility to interference by heterotrophic contaminants [53]. However, the deficiencies in databases must still be addressed. Hamsher et al. [54] reported that the range of divergence in the rbcL gene sequence among species in the genus Sellaphora was 0.14–0.73%. Also, Kermarrec et al. [55] suggested 99% and 98% rbcL gene sequence identities as the thresholds for species- and genus-level classifications, respectively. However, most strains obtained in this study shared a sequence identity of 97% or less with sequences in the GenBank database. These results indicate that much of the necessary information remains unknown. However, it is still clear that the rbcL gene would be more appropriate than 18S rDNA for the molecular taxonomy and phylogenetic analyses of benthic diatoms.

Despite the ecological importance of benthic diatom community, their identification and classification systems still need to be improved. In this study, we showed that a large proportion of diatoms could not be identified by morphological characteristics and that genetic information should be expanded for molecular phylogenetic analyses. Furthermore, rbcL gene is suggested as a superior genetic marker to 18S rRNA gene to identify and phylogenetically classify benthic diatoms. The huge number of diatom species estimated in various environments suggests a need for more efforts to construct a reliable database containing polyphasic taxonomic data.


We thank anonymous reviewers for providing constructive comments and Hwa Young Lee and Seong Jun Chun for help with sampling and algal culturing. We also thank Dr. Eun Chan Yang for his helpful comments on a previous version of this manuscript.

Author Contributions

  1. Conceptualization: DHC JHN SMA JHL.
  2. Data curation: SMA.
  3. Formal analysis: SMA DHC JHN.
  4. Funding acquisition: JHN.
  5. Investigation: SMA HWL.
  6. Methodology: SMA.
  7. Project administration: JHN.
  8. Resources: SMA.
  9. Supervision: JHN.
  10. Validation: DHC JHN SMA.
  11. Visualization: SMA.
  12. Writing – original draft: SMA.
  13. Writing – review & editing: DHC JHN.


  1. 1. Berger WH, Wefer G. Productivity of the glacial ocean: discussion of the iron hypothesis. Limnol Oceanogr. 1991;36: 1899–1918.
  2. 2. Mann DG. The species concept in diatoms. Phycologia. 1999;38: 437–495.
  3. 3. Dugdale RC, Wilkerson FP, Minas HJ. The role of silicate pump in driving new production. Deep Sea Res I. 1995;42: 697–719.
  4. 4. Gowda G, Gupta T, Rajesh K, Gowda H, Lingadhal C, Ramesh A. Seasonal distribution of phytoplankton in Nethravathi estuary, Mangalore. J Mar Biol Ass India. 2001;43: 31–40
  5. 5. Admiraal W. The ecology of estuarine sediment-inhabiting diatoms. Prog Phycol Res. 1984;3: 269–322.
  6. 6. Underwood GJC, Kromkamp J. Primary production by phytoplankton and microphytobenthos in estuaries. Adv Ecol Res. 1999;29: 93–153.
  7. 7. Haubois AG, Sylvestre F, Guarini JM, Richard P, Blanchard GF. Spatio-temporal structure of the epipelic diatom assemblage from an intertidal mudflat in Marennes-Oléron Bay, France. Estuar Coast Shelf Sci. 2005;64: 385–394.
  8. 8. Hustedt F. Marine littoral diatoms of Beaufort, North Carolina. Duke Univ Mar Stat Bull. 1955;6: 1–67.
  9. 9. Smyth JC. A study of the benthic diatoms of Loch Sween (Argyll). J Ecol. 1955;43: 149–171.
  10. 10. Round FE. Studies on Bottom-Living Algae in Some Lakes of the English Lake District: Part II. The Distribution of Bacillariophyceae on the Sediments. J Ecol. 1957;45: 343–360.
  11. 11. Round FE. Studies on Bottom-Living Algae in Some Lakes of the English Lake District: IV. The Seasonal Cycles of the Bacillariophyceae. J Ecol. 1960;48: 529–547.
  12. 12. Sullivan M, Currin C. Community Structure and Functional Dynamics of Benthic Microalgae in Salt Marshes. In: Concepts and Controversies in Tidal Marsh Ecology (Ed. by Weinstein M. & Kreeger D.). Netherlands: Springer; 2000. pp. 81–106.
  13. 13. Brotas V, Plante-cuny MR. The use of HPLC pigment analysis to study microphytobenthos communities. Acta Oecol. 2003;24: S109–S115.
  14. 14. Underwood GJC, Barnett M. What determines species composition in microphytobenthic biofilms? In: Functioning of microphytobenthos in estuaries (Ed. by Kromkamp J.). Amsterdam: Royal Netherlands Academy of Arts and Sciences; 2006. pp. 121–138.
  15. 15. Medlin L, Elwood HJ, Stickel S, Sogin ML. The characterization of enzymatically amplified eukaryotic 16S-like rRNA-coding regions. Gene. 1988;71: 491–499. pmid:3224833
  16. 16. Evans KM, Wortley AH, Mann DG. An assessment of potential diatom "barcode" genes (cox1, rbcL, 18S and ITS rDNA) and their effectiveness in determining relationships in Sellaphora (Bacillariophyta). Protist. 2007;158: 349–364. pmid:17581782
  17. 17. Jahn R, Zetzsche H, Reinhardt R, Gemeinholzer B. Diatoms and DNA barcoding: A pilot study on an environmental sample. In: Proceedings of the 1st Central European diatom meeting. Berlin: Freie Universität; 2007. pp. 63–68.
  18. 18. Mann DG, Sato S, Trobajo R, Vanormelingen P, Souffreau C. DNA barcoding for species identification and discovery in diatoms. Cryptogamie Algol. 2010;31: 557–577.
  19. 19. Moniz MBJ, Kaczmarska I. Barcoding of Diatoms: Nuclear Encoded ITS Revisited. Protist. 2010;161: 7–34. pmid:19674931
  20. 20. Beszteri B, Ács É, Makk J, Kovács G, Márialigeti K, Kiss KT. Phylogeny of six naviculoid diatoms based on 18S rDNA sequences. Int J Syst Evol Microbiol. 2001;51: 1581–1586. pmid:11491361
  21. 21. Moniz MBJ, Kaczmarska I. Barcoding diatoms: Is there a good marker?. Mol Ecol Resour. 2009;9: 65–74. pmid:21564966
  22. 22. Jones HM, Simpson GE, Stickle AJ, Mann DG. Life history and systematics of Petroneis (Bacillariophyta), with special reference to British waters. Eur J Phycol. 2005;40: 61–87.
  23. 23. Sato S, Kooistra WH, Watanabe T, Matsumoto S, Medlin LK. A new araphid diatom genus Psammoneis gen. nov.(Plagiogrammaceae, Bacillariophyta) with three new species based on SSU and LSU rDNA sequence data and morphology. Phycologia. 2008; 47: 510–528.
  24. 24. Pniewski F, Friedl T, Latała A. Identification of diatom isolates from the Gulf of Gdańsk: testing of species identifications using morphology, 18S rDNA sequencing and DNA barcodes of strains from the Culture Collection of Baltic Algae (CCBA). Oceanological and Hydrobiological Studies, 2010;39: 3–20.
  25. 25. Amato A, Kooistra WHCF, Ghiron JHL, Mann DG, Proschold T, Montresor M. Reproductive isolation among sympatric cryptic species in marine diatoms. Protist. 2007;158: 193–207. pmid:17145201
  26. 26. Trobajo R, Mann DG, Clavero E, Evans KM, Vanormelingen P, Mcgregor RC. The use of partial cox1, rbcL and LSU rDNA sequences for phylogenetics and species identification within the Nitzschia palea species complex (Bacillariophyceae). Eur J Phycol. 2010;45: 413–425.
  27. 27. Ehara M, Inagaki Y, Watanabe KI, Ohama T. Phylogenetic analysis of diatom coxI genes and implications of a fluctuating GC content on mitochondrial genetic code evolution. Curr Genet. 2000;37: 29–33. pmid:10672441
  28. 28. Delaney JA, Ulrich RM, Paul JH. Detection of the toxic marine diatom Pseudo-nitzschia multiseries using the RuBisCO small subunit (rbcS) gene in two real-time RNA amplification formats. Harmful Algae. 2011;11: 54–64.
  29. 29. Choi DH, Noh JH. Phylogenetic diversity of Synechococcus strains isolated from the East China Sea and the East Sea. Fems Microbiol Ecol. 2009;69: 439–448. pmid:19624741
  30. 30. Hendey N. The permanganate method for cleaning freshly gathered diatoms. Microscopy. 1974;32: 423–426.
  31. 31. Patrick R, Reimer CW. The diatoms of the United States: exclusive of Alaska and Hawaii vol. 2 Part 1: Entomoneidaceae, Cymbellaceae, Gomphonemaceae, Epithemiaceae. Pennsylvania: Academy of Natural Sciences of Philadelphia; 1975.
  32. 32. Lobban CS. Marine tube-dwelling diatoms of eastern Canada: descriptions, checklist, and illustrated key. Can J Bot. 1984;62: 778–794.
  33. 33. Krammer K, Lange-Bertalot H. Bacillariophyceae 1. Teil: Naviculaceae. In: Süßwasserflora von Mitteleuropa Band 2/1. Heidelberg: Spektrum Akademischer Verlag; 1986.
  34. 34. Krammer K, Lange-Bertalot H. Bacillariophyceae 2. Teil: Bacillariaceae, Epithemiaceae, Surirellaceae. In: Süßwasserflora von Mitteleuropa Band 2/2. Heidelberg: Spektrum Akademischer Verlag; 1988.
  35. 35. Sterrenburg FAS, Underwood GJC. Studies on the Genera Gyrosigma and Pleurosigma (Bacillariophyceae). The Marine "Gyrosigma spenceri" Records: Gyrosigma limosum Sterrenburg et Underwood nov. sp. Proc Acad Nat Sci Philadelphia. 1997;148: 165–169.
  36. 36. Krammer K, Lange-Bertalot H. Bacillariophyceae English and French translation of the keys. In: Süßwasserflora von Mitteleuropa Band 2/5. Heidelberg: Spektrum Akademischer Verlag; 2000.
  37. 37. Witkowski A, Lange-Bertalot H, Metzeltin D. Diatom flora of marine coasts I. Iconographia Diatomologica 7. Königstein: Koeltz Scientific Books; 2000.
  38. 38. Witkowski A, Lange-Bertalot H, Kociolek JP, Ruppel M, Wawrzyniak-Wydrowska B, Bak M, et al. Four new species of Nitzschia sect. Tryblionella (Bacillariophyceae) resembling N. parvula. Phycologia. 2004;43: 579–595.
  39. 39. Massé G, Rincé Y, Cox E, Allard G, Belt ST, Rowland SJ. Haslea salstonica sp. nov. and Haslea pseudostrearia sp. nov. (Bacillariophyta), two new epibenthic diatoms from the Kingsbridge estuary, United Kingdom. C R Acad Sci. 2001;324: 617–626.
  40. 40. Antoniades D, Hamilton PB, Douglas MSV, Smol JP. Diatoms of North America: the freshwater floras of Prince Patrick, Ellef Ringnes and northern Ellesmere Islands from the Canadian Arctic Archipelago. Iconographia Diatomologica vol. 17. Koenigstein: Koeltz Scientific Books; 2008.
  41. 41. Poulin M, Massé G, Belt ST, Delavault P, Rousseau F, Robert JM, et al. Morphological, biochemical and molecular evidence for the transfer of Gyrosigma nipkowii Meister to the genus Haslea (Bacillariophyta). Eur J Phycol. 2004;39: 181–195.
  42. 42. Lynch ED, Lee MK, Morrow JE, Welcsh PL, León PE, King MC. Nonsyndromic Deafness DFNA1 associated with mutation of a human homolog of the Drosophila gene diaphanous. Science. 1997;278: 1315–1318. pmid:9360932
  43. 43. Daugbjerg N, Andersen RA. A molecular phylogeny of the heterokont algae based on analyses of chloroplast-encoded rbcL sequence data. J Phycol. 1997;33: 1031–1041.
  44. 44. Ludwig W, Strunk O, Westram R, Richter L, Meier H, Buchner A, et al. ARB: a software environment for sequence data. Nucleic Acids Res. 2004;32: 1363–1371. pmid:14985472
  45. 45. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28(10): 2731–2739. pmid:21546353
  46. 46. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30: 1312–1313. pmid:24451623
  47. 47. Jukes TH, Cantor CR. Evolution of protein molecules. In: Mammalian protein metabolism (Ed. by Munro HN.). New York: Academic Press; 1969. pp. 21–132.
  48. 48. Ribeiro LLCS. Intertidal benthic diatoms of the Tagus estuary: taxonomic composition and spatial-temporal variation. Thesis, Universidade de Lisboa. 2010. Available:
  49. 49. Lang I, Kaczmarska I. A protocol for a single-cell PCR of diatoms from fixed samples: method validation using Ditylum brightwellii (T. West) Grunow. Diatom Res. 2011;26: 43–49.
  50. 50. Mann DG, Vanormelingen P. An Inordinate Fondness? The Number, Distributions, and Origins of Diatom Species. J Eukaryot Microbiol. 2013;60: 414–420. pmid:23710621
  51. 51. Bruder K, Medlin LK. Morphological and molecular investigations of naviculoid diatoms. II. Selected genera and families. Diatom Res. 2008;23: 283–329.
  52. 52. Felsenstein J. Cases in which parsimony or compatibility methods will be positively misleading. Syst Biol. 1978;27: 401–410.
  53. 53. MacGillivary ML, Kaczmarska I. Survey of the efficacy of a short fragment of the rbcL gene as a supplemental DNA barcode for diatoms. J Eukaryot Microbiol. 2011;58: 529–536. pmid:22092527
  54. 54. Hamsher SE, Evans KM, Mann DG, Poulíčková A, Saunders GW. Barcoding diatoms: exploring alternatives to COI-5P. Protist. 2011;162: 405–422. pmid:21239228
  55. 55. Kermarrec L, Franc A, Rimet F, Chaumeil P, Frigerio JM, Humbert JF, et al. A next-generation sequencing approach to river biomonitoring using benthic diatoms. Freshw Sci. 2014;33: 349–363.