Figures
Abstract
The woody Sonchus alliance, a spectacular example of adaptive radiation with six genera and approximately 31 species, is found exclusively on three Macaronesian Islands (Madeira, Canaries, and Cape Verdes) in the Atlantic Ocean. Four of the Sonchus taxa are restricted to Madeira, including shrubs and small trees at higher elevations (S. fruticosus and S. pinnatus), and caudex perennials in the lower coastal areas (S. ustulatus subsp. maderensis and S. ustulatus subsp. ustulatus). The Madeiran Sonchus stemmed from a single colonization event that originated from the Canaries < 3 million years ago. However, the plastome evolution and species relationships remains insufficiently explored. We therefore assembled and characterized the plastomes of four Sonchus taxa from Madeira and conducted a phylogenomic analysis. We found highly conserved plastome sequences among the taxa, further supporting a single and recent origin. We also found highly conserved plastomes among the cosmopolitan weedy Sonchus, Macaronesian Sonchus in the Atlantic, and Juan Fernández Islands Dendroseris in the Pacific. Furthermore, we identified four mutation hotspot regions (trnK-rps16, petN-psbM, ndhF-Ψycf1, and ycf1) and simple sequence repeat motifs. This study strongly supports the monophyly of Madeiran Sonchus. However, its relationship with the remaining woody Sonchus alliance from the Canary Islands requires further investigation.
Citation: Kim H-B, Lee D-G, Kim S-C (2023) Plastomes of Sonchus (Asteraceae) endemic to the Atlantic Madeira archipelago: Genome structure, comparative analysis, and phylogenetic relationships. PLoS ONE 18(6): e0287523. https://doi.org/10.1371/journal.pone.0287523
Editor: Branislav T. Šiler, Institute for Biological Research, University of Belgrade, SERBIA
Received: January 8, 2023; Accepted: June 7, 2023; Published: June 22, 2023
Copyright: © 2023 Kim et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The plastome datasets sequenced and analyzed in the current study are available from GenBank (https://www.ncbi.nlm.nih.gov) under the accession numbers of MN537814, MN537815, MN537816, and MN542414.
Funding: This research was funded by the Basic Science Research Program through the National Research Foundation of Korea (NRF: https://www.nrf.re.kr/eng/main/) (Grant Number 2019R1A2C2009841) to SCK. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Oceanic islands serve as natural laboratories that provide excellent opportunities to investigate the patterns and processes of organismal evolution [1]. The inherent geographical isolation from the continent allows ancestral island populations to become distinct from continental progenitor populations, providing an opportunity to undergo explosive lineage diversification into a wide range of available habitats within the oceanic insular environment—this phenomenon is commonly known as adaptive radiation [2], an example of cladogenetic speciation. Adaptive radiation is often used to explain the diversification of numerous endemic biotas on oceanic islands, providing wonderful opportunities for investigation into speciation [3]. Such cases of evolution include, but are not limited to, well-known examples of Hawaiian Drosophila (e.g., D. adiastola Hardy) [4], Galapagos finches (e.g., Certhidea olivacea Gould) [5], and Hawaiian honeycreepers (e.g., Pinicola enucleator Linnaeus) [6]. In addition, numerous plant examples, including Dendroseris D.Don and Robinsonia DC. (e.g., R. gayana Decne.) on the Juan Fernández Islands, the silversword alliance (e.g., Dubautia plantaginea Gaudich.) and Tetramolopium Nees (e.g., T. conyzoides Hillebr.) in Hawaii, and various lineages (e.g., Echium L. (e.g., E. candicans L.f.), Sonchus L. (e.g., S. fruticosus L.f.), and Argyranthemum Webb (e.g., A. callichrysum (Svent.) Humphries) from Macaronesia, reveal the patterns and processes of adaptive radiation on oceanic islands [7–13].
The Sonchinae (currently recognized as Hyoseridinae) is a subtribe from the tribe Cichorieae Lam. & DC. (Asteraceae) distributed globally but with discontinuous habitats in terms of phytogeography, and currently includes 14 genera and approximately 146 species [12]. Several genera of this subtribe, which are mainly found on oceanic islands, exhibit peculiar characteristics compared to their continental species and have been studied as examples of adaptive radiation. Notably, these include Dendroseris D.Don in the Juan Fernández Islands, Thamnoseris F.Phil. in the San Ambrosio Islands, S. megalocarpa (Hook.f.) J.M.Black [formerly, Actites megalocarpa (Hook.f.) Lander] in Australia, and Sonchus novae-zelandiae (Hook.f.) Garn.-Jones [formerly, Kirkianella novae-zelandiae (Hook.f.) Allan] and S. grandifolius Kirk [formerly, Embergeria grandifolia (Kirk) Boulos] in New Zealand, as well as the woody Sonchus alliance in the Macaronesian Islands. The subgenus Dendrosonchus, also known as the woody Sonchus alliance, consists of approximately 31 species and is found in Macaronesia, mainly in the Canary Islands, Madeira archipelagos, and Cape Verde. However, S. pinnatifidus is found on both the Canary Islands and Morocco. They are endemic to the Canary Islands with the exception of five species, among which S. daltonii is endemic to the Cape Verde, and the remaining four species are endemic to the Madeira archipelagos [14].
The Madeira archipelago, a part of the floristic region of Macaronesia, is positioned approximately 520 km west of the African country, Morocco, and 400 km north of the Canary Islands. It comprises the main island of Madeira and two smaller eastern islands, Porto Santo and Desertas (Fig 1). Of the 1,226 vascular plants reported in Madeira, 123 (10%) are endemic [15,16]. Four Sonchus L. taxa occur exclusively in the Madeira archipelago: S. fruticosus L.f., S. pinnatus Aiton, S. ustulatus subsp. maderensis Aldridge, and S. ustulatus subsp. ustulatus Lowe (Fig 1). The Madeira Sonchus species form a part of the woody Sonchus alliance (ca. 31 species) and originated from the Canary archipelago. They are thought to be monophyletic, as determined by nuclear ribosomal DNA sequences but not yet by chloroplast DNA and represent lineage diversification within the archipelago after inter-archipelago dispersal events [12,14]. These species are categorized into two groups based on distinct life forms: shrub/subshrub types (S. fruticosus and S. pinnatus) and caudex perennials (two subspecies of S. ustulatus) (Fig 1) [15]. S. fruticosus is a perennial shrub with short, thick branches that reach up to 4 m in height. It has pinnatifid, sinuate, and sessile leaves with triangular lobes and dentate margins. Sonchus pinnatus is a perennial shrub that grows up to 2 m in height and has pinnatisect or pinnatipartite and petiolate leaves with lanceolate lobes [15]. The two S. ustulatus subspecies are herbaceous perennials that reach up to 30 cm height with short sub-woody stems. Sonchus ustulatus subsp. ustulatus has pinnatipartite to pinnatisect and subsessile leaves, with glaucous on the lower surface and a dentate margin. In contrast, S. ustulatus subsp. maderensis has an entire margin and overlapped leaves with lanceolate terminal lobes. These species occur in different habitats and altitudinal ranges in the Madeira archipelago (Fig 1). Sonchus fruticosus is distributed throughout Madeira and Porto Santo, whereas S. pinnatus and S. ustulatus subsp. ustulatus exclusively occur in Madeira. Sonchus ustulatus subsp. maderensis is more widespread, occurring on all three islands (Madeira, Porto Santo, and Desertas) [15]. Sonchus ustulatus subsp. ustulatus and S. ustulatus subsp. ustulatus are commonly distributed in the northern and southern coastal areas of Madeira, respectively. Sonchus fruticosus is distributed in Laurisilva and the moist ravines in the interior of Madeira, at altitudes of 800–1200 m, while S. pinnatus commonly is distributed on rocky slopes at 1000–1400 m and lower altitudes [15].
(a) Map of Madeira archipelago and distribution of the endemic Sonchus species to the archipelago. Occurrence data of S. fruticosus (purple) and S. pinnatus (orange) were obtained from the IUCN Red List database (https://www.iucnredlist.org/species/103588658/103588672 and https://www.iucnredlist.org/species/103588709/103588713, respectively). Green areas depict the species overlap areas. The two subspecies of S. ustulatus, namely subsp. maderensis (blue) and subsp. ustulatus (red) were based on previous reports [14]. The boundary, which is similar but not identical to the original image and is thus for illustrative purpose only, was obtained from the United States Geological Survey (USGS) National Map Viewer (http://viewer.nationalmap.gov/viewer/). Accessed on May 10, 2023. Sonchus species on Madeira (all photos were taken by Seung-Chul Kim); (b) S. fruticosus, (c) S. pinnatus, (d) S. ustulatus subsp. maderensis, and (e) S. ustulatus subsp. ustulatus.
The advent of next generation sequencing (NGS) allowed for the rapid assembly and characterization of whole plastid genome sequences in numerous land plant lineages. These plastome resources have proven useful in resolving difficult and obscure phylogenetic relationships and developing efficient plastid markers for DNA barcoding and phylogeographic and phylogenetic studies [17]. In an ongoing effort to establish robust and highly resolved plastome-based phylogenetic relationships between Sonchus and related genera in the subtribe Hyoseridinae, we characterized several herbaceous and woody Sonchus species [18–22]. In particular, the plastomes of the woody Sonchus alliance have been characterized with special emphasis on species from the Canaries [18,21]. However, information on the plastome organization and variations among the four Madeiran taxa and their relationships with the species in the Canaries remains limited. Definitive evidence for the monophyly of Madeira Sonchus based on cpDNA is also lacking. Therefore, this study aimed to: (1) determine the complete plastomes of four Madeira-endemic Sonchus taxa; (2) compare their gene content, order, and any changes in organization; (3) conduct comparative genomic analyses to identify highly variable hotspot regions and chloroplast simple sequence repeat (cpSSR) markers; and (4) elucidate the phylogenetic relationships among the four Madeira Sonchus taxa and assess their relationships with the Canary Islands congeneric taxa.
Methods and methods
Material preparation, DNA extraction, genome sequencing, and annotation
One the 2002 expedition to Madeira, four endemic Sonchus taxa were collected freshly from the island (see voucher and collection information in [14]). The total genomic DNA of three taxa, S. pinnatus, S. ustulatus subsp. ustulatus, and S. ustulatus subsp. maderensis was isolated using the Exgene™ Plant SV mini kit (GeneAll, Seoul, Korea). Sonchus fruticosus’s DNA was extracted using the CTAB method [23]. The concentration and quality of the extracted DNA were verified using 1% agarose gel electrophoresis. Next-generation sequencing (NGS) was performed at Macrogen Corporation (Seoul, Korea), and approximately six gigabytes of raw NGS sequence data was generated for each taxon. As shown in Table 1, the depth of coverage of 500X or higher indicates that sufficient sequencing was conducted for chloroplast genome assembly.
An Illumina paired-end (PE) genomic library was constructed and sequenced using the Illumina HiSeq platform, according to the standard Illumina PE protocol in the TruSeq Nano DNA Sample Preparation Guide. The sequence reads were assembled using the de novo genomic assembler Velvet 1.2.10 [24]. In the case of S. pinnatus, we performed PCR confirmation for three ambiguous regions (two in the large single copy, LSC, and one in the small single copy, SSC) to assemble the circular plastid genome. PCR gap filling was performed using the Inclone™ Taq DNA polymerase kit (IncloneBiotech Co., Yongin, Korea), with a final volume of 50 μL. The PCR product was purified using the Inclone™ Gel & PCR Purification Kit (Inclonebiotech Co., Yongin, Korea) and sequenced at Macrogen Corporation (Seoul, Korea).
Protein-coding regions (CDS) were annotated using CPGAVAS2 [25], and tRNAs were predicted using ARAGORN v1.2.38 (http://mbio-serv2.mbioekol.lu.se/ARAGORN/; [26]) and tRNAscan-SE (http://lowelab.ucsc.edu/tRNAscan-SE/; [27]). RNAmmer 1.2 Server [28] was used to annotate rRNAs. These were annotated and corrected based on four congeneric taxa from the Canary Islands, namely S. canariensis (NC042381), S. acaulis (NC042382), S. webbii (NC042383), and S. leptocephalus (MN334533) using Geneious Prime 2020.0.3 (Biomatters Ltd., Auckland, New Zealand). Each circular genome map was visualized using OGDRAW v. 1.3.1 (http://ogdraw.mpimp-golm.mpg.de/; [29]) and MPI-MP CHLOROBOX online. Complete cp genome sequences were deposited in GenBank (MN537814, MN537815, MN537816, and MN542414).
Identification of highly divergent regions
We estimated the nucleotide diversity of woody Sonchus species and analyzed the DNA polymorphisms using DnaSP v. 6 [30]. This software computed the sliding window for nucleotide diversity (Pi) with a 100 bp window length and a 25 bp step size. We executed the LAGAN alignment mode using the mVISTA program (http://genome.lbl.gov/vista/mvista/; [31,32]) to align the four Madeira Sonchus taxa. Sonchus fruticosus was used as the reference. Four plastid genomes were aligned using MAFFT v. 7 (https://mafft.cbrc.jp/alignment/software/; [33]), and gene orientation was checked using Geneious software. The window size and resolution were 100 bp and 60 bp, respectively.
Repeat sequence analysis
MIcroSAtellite identification software (MISA; http://pgrc.ipk-gatersleben.de/misa/) was used to detect simple sequence repeat (SSR) motifs with repeat units ranging from one to six nucleotides [34]. The minimum number of nucleotide repeats was set to 15 for mononucleotide repeats (1–15), five for dinucleotide (5–3) repeats, and three for trinucleotide (3–3), tetranucleotide (4–3), pentanucleotide (5–3), and hexanucleotide (6–3) repeats.
Codon usage bias, RNA editing site, and genes under positive selection
The codon usage bias was calculated using MEGA-X [35]. We compared the codon distribution and relative synonymous codon usage (RSCU) values of the study group. Moreover, we conducted RNA-editing on Madeiran Sonchus species using PREP-Cp (http://prep.unl.edu/; [36]), with default settings (cutoff value of 0.8). Each Madeira Sonchus had 87 protein-coding sequences (CDSs). Thirty-five of the total CDSs were analyzed as 52 plastid genes were not supported by the PREP-Cp. We calculated the Ka/Ks ratio of each pair in Madeira Sonchus using DnaSP v. 6 [30] to evaluate selective pressure.
Phylogenetic analysis
To determine the phylogenetic position and relationships of four Madeiran Sonchus species, we selected a total of 18 complete chloroplast genomes of the subtribe Hyoseridinae (formerly known as Sonchinae). Reichardia ligulata (MN893255), which diverged early in Hyoseridinae, was used as the outgroup. We selected herbaceous Sonchus spp., including S. boulosii (NC042244), S. arvensis (NC054161), S. oleraceus (MK371006), S. asper (MK371015), and S. brachyotus (MT850047). Moreover, we included five previously sequenced the woody Sonchus alliance taxa from the Canary Islands, namely S. bupleuroides (MK371017), S. webbii (NC042383), S. leptocephalus (MN334533), S. acaulis (MK033507), and S. canariensis (MK033506). Finally, we included seven Dendroseris species, which are endemic to the Juan Fernández Islands in the Pacific Ocean and belong to the Hyoseridinae, namely D. berteroana (NC051923), D. pinnata (NC051926), D. marginata (NC051925), D. litoralis (NC051922), D. macrantha (NC051924), D. micrantha (NC051921), and D. pruinata (NC051920). We aligned the chloroplast genomes using MAFFT v. 7 with default settings and edited them manually using Geneious software. A maximum-likelihood (ML) tree and a Bayesian tree were inferred using IQ-TREE v. 1. 6.12 [37], and MrBayes v.3.2.7 [38], respectively, with partitioning the alignment by the gene annotations. The best-fit substitution model for each partition was chosen according to the Bayesian information criterion (BIC) score and weight, using ModelFinder [39] implemented in IQ-TREE. The branch bootstrap support (BS) of ML tree was calculated with 1000 bootstrap replicates, and for Bayesian inference, the Markov chain Monte Carlo (MCMC) was run for four chains for 2,000,000 generations with sampling a tree each 100 generations and burning out the first 25% of the sampled trees. We also conducted a maximum parsimony (MP) analysis using PAUP version 4.0a [40]. The default heuristic search options included starting trees via stepwise simple sequence addition with one tree held at each step, the tree-bisection-reconnection (TBR) branch-swapping algorithm, steepest descent, and MulTrees option in effect, zero branch length collapsed, and topological constraints not enforced. BS was calculated from 1000 replicates using the same heuristic search options to evaluate the robustness of the groups.
Results
Comparative genome analysis of four Sonchus taxa according to gene content, order, and organization
Each Madeira chloroplast genome was composed of 131 CDSs, 87 genes, 37 tRNAs, and six rRNAs (Table 1). The total cp DNA size among the four taxa ranged from 152,410 bp (S. fruticosus and S. pinnatus), 152,411 bp (S. ustulatus subsp. ustulatus) to 152,426 bp (S. ustulatus subsp. maderensis) (Fig 2), which are assembled with high depths of coverage, ranged from 537 (S. pinnatus) to 2770 (S. fruticosus). The sizes of the LSC, SSC, and IR were nearly identical among the four plastomes (Table 2). In particular, two morphologically distinct shrubby species, S. fruticosus and S. pinnatus, presented identical chloroplast genome sizes with nearly identical sequences, except for three point-mutations on each clpP intron, and the trnK-rps16 and psaA-ycf3 intergenic spacers. The four Madeira taxa had identical gene compositions (Table 1). Seven genes (petB, atpF, ndhA, rpl2, rpl16, rps16, and rpoC1) contained a single intron and three genes (clpP, rps12, and ycf3) contained two introns (Table 2). We found little variation for the border positions of the LSC, SSC, and IR regions among the chloroplast genomes of Sonchus species in Madeira and the Canary Islands (Fig 3). The Madeira Sonchus showed a difference in the ycf1 region between SSC and IRb in S. ustulatus subsp. maderensis, compared with the remaining taxa. For the Canary Islands Sonchus plastomes, S. webbii contained a 6 bp gap between ψycf1 and ndhF, while others contained a 15 bp gap at the SSC/IRa. The ψycf1 gene of S. webbii (480 bp) was slightly longer than the others (471 bp).
The genes inside and outside the circle are transcribed in clockwise and counterclockwise directions, respectively. Genes belonging to different functional groups are shown in different colors. Thick lines indicate the extent of the inverted repeats (IRa and IRb) that separate the genomes into large single copy (LSC) and small single copy (SSC) regions.
Ψ indicates pseudogenes. Gene names are indicated in boxes and their lengths in the corresponding regions are displayed above the boxes.
Simple sequence repeats, codon usage bias, and RNA editing sites
We investigated SSRs among the four Madeira Sonchus plastomes and found a similar total number of SSRs (Fig 4). A total of 71 SSRs were found in S. fruticosus and S. ustulatus subsp. ustulatus, and 70 in S. pinnatus and S. ustulatus subsp. maderensis (Fig 4A). The most SSRs were detected in the coding regions (53%), and the most frequent SSR type was trinucleotides (81%) (Fig 4A). The LSC region contained the most SSRs (46–47 SSRs; 58–59%), compared to the SSC (15 SSRs; 21%) and IR (9 SSRs; 12.8%) regions (Fig 4B). All four plastomes had an equal number of mononucleotide, dinucleotide, and tetranucleotide repeats, but different numbers of trinucleotide and hexanucleotide repeats (Fig 4A). In terms of trinucleotide motifs, S. pinnatus contained 64 repeats, whereas the remaining taxa contained 65 repeats. Additionally, S. ustulatus subsp. maderensis did not contain a hexanucleotide repeat, whereas the remaining species had one hexanucleotide repeat. Interestingly, we found identical frequencies of SSR distribution and types between the two morphologically divergent S. fruticosus and S. ustulatus subsp. ustulatus.
(a) Frequency of SSR type and (b) Frequency of SSR distribution in the chloroplast genome.
Analysis of relative synonymous codon usage (RSCU) based on the protein-coding genes revealed an average codon usage ranging from 22,774 (S. ustulatus subsp. maderensis) to 22,776 (S. fruticosus, S. pinnatus, and S. ustulatus subsp. ustulatus) among the four Madeira Sonchus taxa, with consistent patterns of frequently used codons among them (S1 Table). The highest RSCU value was observed with the UUA codon used for Leucine (1.9), followed by that of AGA for Arginine (1.41–1.86) and GCU for Alanine (1.78) (Fig 5). The lowest RSCU value was observed with the AGC codon used for serine (0.33), and CUG and CUC for leucine (0.37).
RSCU = relative synonymous codon usage (Left bar: S. fruticosus, S. pinnatus and S. ustulatus subsp. ustulatus; Right bar: S. ustulatus subsp. maderensis).
We also predicted RNA editing sites in the Madeira Sonchus species. All species had 48 RNA editing sites with the same cutoff value of 0.8 (S2 Table). These editing sites were present in 20 of the 35 protein-coding genes. Nine RNA editing sites were found in ndhB, five in ndhD, four in accD and rpoC1, three in matK and ndhA, and two in atpA, ccsA, ndhG, petB, rpoC2, and rps14. One editing site was identified for each atpI, ndhF, psbF, psbL, rpl20, rpoA, rpoB, and rps2. All nucleotide changes were cytosine (C) to thymine (T) transitions. The most frequent transitions were conversions from proline (P) to leucine (L).
Sequence divergence and mutation hotspot regions
We calculated the nucleotide diversity of eight woody Sonchus alliances on the Macaronesian Islands (i.e., Madeira and the Canary Islands) using DnaSP (Fig 6A and 6B). The average nucleotide diversity (Pi) was 0.0005 and ranged from 0 to 0.015. The SSC region had the highest nucleotide diversity (Pi = 0.00117), whereas each of the two IR regions had the lowest (P = 0.0001). The LSC region presented nucleotide diversity of 0.00058. For the eight Macaronesia Sonchus plastomes, we found four mutation hotspots with Pi values of ≥ 0.01, including two intergenic regions (trnK-rps16 and petN-psbM) in the LSC, and one protein-coding (ycf1) and one intergenic region (ndhF-ψycf1) in the SSC (Fig 6B). We also found four highly variable regions (i.e., trnK-rps16 and psbI-trnS, clpP intron, and ycf1) when only Madeiran species plastomes were considered (Fig 6A).
(a) Nucleotide diversity (Pi) of the four Madeira Sonchus plastomes and (b) eight Macaronesia the woody Sonchus alliance plastomes. The four most divergent regions are suggested as divergence hotspots based on Pi ≥ 0.005 for Madeira Sonchus, and Pi ≥ 0.01 for Macaronesia Sonchus species.
The mVISTA plots, with S. fruticosus as a reference, showed a high degree of synteny and gene order conservation among three of the Madeira Sonchus plastomes (Fig 7). We found 56 polymorphic sites. Not surprisingly, the intergenic spacer region (IGS) contained 55.4% of the total mutations, while the exon and intron regions contained 23.2% and 21.4%, respectively (Table 3). Approximately 55% of the polymorphisms were base substitutions, 29% were transitions (A/G or C/T), and 71% were transversions (A/C, A/T, C/G, and G/T). Of the 25 polymorphic indels (insertions and deletions), 10 (40%) were non-SSR indels, and the remaining 15 (60%) were poly A/T/G SSR indels. The longest non-SSR indels at 17 bp were located in the rps16-trnK and rpl16 introns. The most divergent regions included the rps16 intron, clpP intron, psaI-accD, atpF-atpH, rps16-trnK, rpl32-trnL, and ycf1 (Fig 7). The most variable region between the four Madeira Sonchus plastomes was the LSC (75.0%) region, followed by the SSC (23.2%) and IR (1.7%) regions.
VISTA-based identity plots showed the sequence identity of three chloroplast genomes with S. fruticosus as reference. The vertical scale indicates the percent identity from 50% to 100%. The horizontal scale indicates the genome base positions. Gray arrows above the alignment indicate the position and direction of each gene.
Monophyly and species relationships in Madeira archipelago
The topologies of the ML tree and the Bayesian inference (BI) tree were identical, and both strongly suggested the monophyly of the woody Sonchus alliance in the Macaronesian Islands (100% BS and 1.0 posterior probability (PP)) (Fig 8). The MP tree (not shown) also strongly supported the monophyly of Sonchus and its relationships with other representative genera of the subtribe Hyoseridinae. It is highly likely that the woody Sonchus alliance from the Macaronesian Islands was derived from a common ancestor. Also, the Madeira Sonchus formed a monophyletic clade based on the complete plastome sequences (100% BS). The two subspecies of S. ustulatus, namely subsp. ustulatus and subsp. maderensis, are not in a monophyletic clade. Rather, subsp. ustulatus shares a more recent common ancestor with S. pinnatus and S. fruticosus than with conspecific subsp. maderensis (Fig 8).
Reichardia ligulata was used as an outgroup based on previous studies [12,41]. The numbers above the nodes present the bootstrap values of 1000 replicates and posterior probability from the Bayesian inference.
Discussion
Madeiran Sonchus chloroplast genomes
We found highly conserved plastomes based on their size, gene content, order, and organization between the four Madeira Sonchus. The Madeira Sonchus species (152,410 bp—152,426 bp) had slightly larger plastomes than the weedy Sonchus (151,849 bp -151,967 bp) and the Canary Islands Sonchus species (152,071 bp—152,406 bp). However, the gene content and organization were highly conserved among the groups [20,21]. The woody Sonchus alliance from Madeira and the Canaries had slightly larger plastomes than the continental herbaceous congeneric species. In addition, very similar plastome sizes (152,199 bp—152,619 bp for Dendroseris and 152,071 bp—152,406 bp for the woody Sonchus alliance), gene content, and organization was observed between the Atlantic Ocean diploid Sonchus endemics and the Pacific Ocean Juan Fernández Islands tetraploid Dendroseris, which belongs to the same subtribe Hyoseridinae as Sonchus [42]. This indicates that, even though these insular endemics from the Pacific and Atlantic Oceans are morphologically and ecologically divergent, they have highly conserved plastomes that suggest their recent independent origins.
Using the same SSR search parameters (1–15, 2–5, 3–3, 4–3, 5–3, and 6–3) as the weedy Sonchus (S. asper, S. oleraceus; [20]) and the woody Sonchus alliance in the Canaries (S. canariensis, S. acaulis, S. webbii; [21]), slightly fewer SSRs were found among the Madeira Sonchus: 71 versus 79 for the weedy Sonchus and 80 for S. acaulis, 78 for S. canariensis, and 78 for S. webbii. Most of the trinucleotide motifs found in the Madeira Sonchus plastomes coincided with the weedy and Canary Islands Sonchus species [20,21]. We found a similar number of cpSSRs (71–74) and trinucleotide repeats with the highest frequency (87%) in the Juan Fernández Island endemic Dendroseris [42]. However, the LSC region contained the most SSRs (62%) in Dendroseris. The cpSSR markers identified in this study will be useful and important resources for population genetics and phylogeographic studies. The codon type distribution was consistent (Fig 5). The codon usage bias toward a high RSCU value of U and A at the third codon position was also noted, as previously reported [43–45]. The number of RNA editing sites of Madeira Sonchus was significantly lower than that of Sonchus asper (98 sites; [20]) and mostly woody species of Dendroseris (93–104 sites; [42]). In general, the most frequent RNA editing sites were found in ndhB and ndhD. The Madeira Sonchus presented a consistent pattern also found in previous studies [46–48]. Two regions, trnK-rps16 and ycf1, were consistently identified as highly variable regions in the cosmopolitan weedy Sonchus [20] and Macaronesia endemic Sonchus [21]. As maternally inherited chloroplast markers, these mutation hotspots will be useful for population or phylogenetic studies of the woody Sonchus alliance in Macaronesia and widely distributed Sonchus and related species. Substantially fewer polymorphic sites were found in the Madeira Sonchus than in the three Canary Islands Sonchus species (206 polymorphic sites; [21]) and the herbaceous weedy Sonchus species (528 polymorphic sites; [20]). This further supports the previous view that, despite their morphological and ecological differentiation, the Madeira Sonchus species are highly genetically similar, suggesting their recent origin from a common ancestor most likely from the Canary Islands [12,14,49].
Phylogenetic relationships
The phylogenetic tree shows the monophyletic clade of the woody Sonchus alliance in the Macaronesian Islands. Although the current study was based on a limited number of species, it is most likely that the woody Sonchus alliance from the Macaronesian Islands was derived from a common ancestor, as previously reported [12,14,41]. Unlike previous studies [14], the monophyly of the Madeira Sonchus was strongly supported in every ML, BI and MP analysis based on the complete plastome sequences. The monophyly of the Madeira clade and interspecific relationships were weakly supported based on the nrDNA ITS (65% and 71%, respectively) and few chloroplast regions (69% and 60%, respectively). One novel finding of this study is that the two conspecific subspecies of S. ustulatus are not closely related. Rather, S. ustulatus subsp. ustulatus shares a more recent common ancestor with S. pinnatus and S. fruticosus than with conspecific S. ustulatus subsp. maderensis. The previous ITS tree weakly suggested that two subspecies from coastal areas, without forming monophyly, represent basal lineages that first diverged within Madeira [14].
The plastome-based phylogenetic analysis of Madeira Sonchus suggests that two currently recognized subspecies of S. ustulatus may represent distinct taxonomic entities, warranting recognition at the species level. Sonchus ustulatus subsp. maderensis was proposed by Aldridge [50], based on its morphological differences and geographical distribution. Sonchus ustulatus subsp. maderensis occurs in moist rocky areas on the northern coast of Madeira (and rarely in Porto Santo and Desertas). In contrast, S. ustulatus subsp. ustulatus occurs in dry rocky and sunny areas on the southern coast of Madeira. These taxa are small, herbaceous perennials with acaulous or short, subwoody stems that can reach up to 30 cm in height. In contrast, S. pinnatus and S. fruticose are perennial shrubs that reach heights of up to 2 m and 4 m, respectively. Therefore, given the species distribution and chloroplast phylogenomic tree topology, it is highly likely that the common ancestor of the Madeiran Sonchus was somewhat similar to S. ustulatus. These were likely herbaceous perennials that first colonized the northern coastal areas of Madeira and diversified into laurel forests and moist ravines in the interior of Madeira, becoming tall shrubs. The species-level recognition of the two subspecies is further supported by a recent report of a new species, S. parathalassius J.G.Costa ex R.Jardim & M.Seq., from Porto Santo and the taxonomic recognition of S. latifolia (= S. ustulauts subsp. maderensis) [51]. Overall, new species-level recognition and interspecific relationships in the Madeira archipelago require further independent confirmation based on highly variable genome-wide nuclear markers.
The monophyly of the Madeira Sonchus was fully and strongly established in this study, but that of the closely related or progenitor Sonchus species from the Canary Islands remains unclear. Based on the limited Canary Islands Sonchus samples in this study, the plastome tree suggested that S. acaulis and S. canariensis are sisters to the Madeira clade, suggesting a possible origin from Tenerife and Gran Canaria. We gained insufficient insight into this using previous cpDNA phylogeny because of a lack of resolution within subg. Dendrosonchus and genus Taeckholmia [14]. The more resolved nuclear ITS phylogeny suggested the Madeira Sonchus as a sister to the clade of some Canary Islands Sonchus from various islands, including the older central islands (Gran Canaria, Tenerife, and La Gomera) and relatively younger western islands (La Palma and El Hierro). It makes the identification of the ancestral island and species of origin of Madeira Sonchus uncertain. Nevertheless, it was suggested that the origin of Madeira Sonchus happened relatively recently (i.e., estimated inter-archipelago dispersal event of ca. 2.7 million years ago), despite the geological age (ca. 5 million years ago) of the Madeira archipelago [49]. An extensive phylogenomic framework, including all the woody Sonchus alliances in the Macaronesian Islands, is required to further determine the geographical origin of the Madeira Sonchus species.
Conclusion
This is the first study to report the complete chloroplast genome of four Madeira Sonchus species, enabling the comparison of their genomes to those of the congeneric cosmopolitan Sonchus, the woody Canary Islands endemic Sonchus from the Atlantic Ocean, and the Juan Fernández Islands endemic Dendroseris in the Pacific Ocean. Although the four Madeira endemic Sonchus species are highly morphologically and ecologically differentiated, we found nearly identical and highly conserved chloroplast genomes, supporting a single and recent origin. We identified four mutation hotspots, namely trnK-rps16, petN-psbM, ndhF-Ψycf1, and ycf, and 70–71 cpSSRs for population genetics, phylogeography, and phylogenetic investigation. We also characterized the codon usage bias and RNA editing sites among the four Madeira taxa and compared them with those of the Canary Islands Sonchus and the Juan Fernández Islands Dendroseris. The plastome-based phylogenetic tree strongly supported the monophyly of the Madeira lineage for the first time, but its relationship with the Canary Islands congenerics requires further study based on genome-wide nuclear and chloroplast genomes. The phylogenomic tree also suggests that the herbaceous perennial, S. ustulatus subsp. maderensis, which occurs on the northern coast of Madeira, was the first to diverge within the Madeira archipelago. Lastly, S. ustulatus subsp. ustulatus, which are herbaceous perennials occurring on the southern coast of Madeira, are more closely related to the clade of perennial shrubs, S. pinnatus and S. fruticose, than to conspecific S. ustulatus subsp. maderensis. This relationship corroborates the species-level recognition of S. ustulatus subsp. maderensis as a distinct species, S. latifolia. Further genome-wide investigations may clarify the relationship among Sonchus species within Madeira.
Supporting information
S1 Table. Codon usage and the codon-anticodon recognition pattern for tRNA in four Madeira Sonchus plastomes.
https://doi.org/10.1371/journal.pone.0287523.s001
(XLSX)
S2 Table. Predicted RNA editing sites in the complete chloroplast genome of four Madeira Sonchus species.
https://doi.org/10.1371/journal.pone.0287523.s002
(XLSX)
Acknowledgments
The first author is greatly indebted to the people at the Plant Molecular Phylogenetics Laboratory at Sungkyunkwan University for their technical support during the study. The authors would like to thank Ji-Hyeon Jeon for performing bioinformatic analysis to accurately determine the phylogenetic relationships of the study group.
References
- 1. Stuessy TF, Takayama K, López-Sepúlveda P, Crawford DJ. Interpretation of patterns of genetic variation in endemic plant species of oceanic islands. Botanical Journal of the Linnean Society. 2014;174: 276–288. pmid:26074627
- 2. Emerson BC. Evolution on oceanic islands: molecular phylogenetic approaches to understanding pattern and process. Mol Ecol. 2002;11: 951–966. pmid:12030975
- 3. Takayama K, López‐Sepúlveda P, Greimler J, Crawford DJ, Peñailillo P, Baeza M, et al. Relationships and genetic consequences of contrasting modes of speciation among endemic species of Robinsonia (Asteraceae, Senecioneae) of the Juan Fernández Archipelago, Chile, based on AFLPs and SSRs. New Phytologist. 2015;205: 415–428. pmid:25209139
- 4. Carson HL, Kaneshiro KY. Drosophila of Hawaii: Systematics and Ecological Genetics. Annu Rev Ecol Syst. 1976;7: 311–345.
- 5.
Grant PR. Ecology and Evolution of Darwin’s Finches. Princeton, New Jersey, USA: Princeton University Press; 1986.
- 6. Lerner HRL, Meyer M, James HF, Hofreiter M, Fleischer RC. Multilocus resolution of Phylogeny and timescale in the extant adaptive radiation of Hawaiian honeycreepers. Current Biology. 2011;21: 1838–1844. pmid:22018543
- 7.
Carlquist S. Island Biology p. 660. New York, USA: Columbia University Press; 1974.
- 8. Sang T, Crawford DJ, Kim S-C, Stuessy TF. Radiation of the endemic genus Dendroseris (Asteraceae) on the Juan Fernandez Islands: Evidence from sequences of the ITS regions of nuclear ribosomal DNA. Am J Bot. 1994;81: 1494.
- 9. Barrier M, Baldwin BG, Robichaux RH, Purugganan MD. Interspecific hybrid ancestry of a plant adaptive radiation: allopolyploidy of the Hawaiian silversword alliance (Asteraceae) inferred from floral homeotic gene duplications. Mol Biol Evol. 1999;16: 1105–1113. pmid:10474905
- 10. Lowrey TK, Crawford DJ. Allozyme divergence and evolution in Tetramolopium (Compositae: Astereae) on the Hawaiian Islands. Syst Bot. 1985;10: 64.
- 11. Böhle UR, Hilger HH, Martin WF. Island colonization and evolution of the insular woody habit in Echium L. (Boraginaceae). Proc Natl Acad Sci U S A. 1996;93: 11740–5. pmid:8876207
- 12. Kim SC, Crawford DJ, Francisco-Ortega J, Santos-Guerra A. A common origin for woody Sonchus and five related genera in the Macaronesian Islands: molecular evidence for extensive radiation. Proc Natl Acad Sci U S A. 1996;93: 7743–8. pmid:8755546
- 13. Francisco‐Ortega J, Santos‐Guerra A, Hines A, Jansen RK. Molecular evidence for a Mediterranean origin of the Macaronesian endemic genus Argyranthemum (Asteraceae). Am J Bot. 1997;84: 1595–1613. pmid:21708563
- 14. Lee C, Kim S, Lundy K, Santos‐Guerra A. Chloroplast DNA phylogeny of the woody Sonchus alliance (Asteraceae: Sonchinae) in the Macaronesian Islands. Am J Bot. 2005;92: 2072–2085. pmid:21646124
- 15.
Jardim R, Francisco D. Endemic Flora of Madeira. Funchal: Muhia press; 2000.
- 16.
Press JR, Short MJ. Flora of Madeira. Exeter, UK: Pelagic Publishing; 2016.
- 17. Jansen RK, Cai Z, Raubeson LA, Daniell H, dePamphilis CW, Leebens-Mack J, et al. Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns. Proceedings of the National Academy of Sciences. 2007;104: 19369–19374. pmid:18048330
- 18. Kim H-B, Cho M-S, Kim S-H, Yang J, Kim S-C. Characterization of the complete chloroplast genome of the woody sow-thistle, Sonchus leptocephalus (Asteraceae) endemic to the Canary Islands. Mitochondrial DNA Part B. 2019;4: 3388–3389. pmid:33366006
- 19. Kim S-H, Mejías JA, Kim S-C. Next generation sequencing reveals the complete plastome sequence of newly discovered cliff-dwelling Sonchus boulosii (Asteraceae: Cichorieae) in Morocco. Mitochondrial DNA Part B. 2019;4: 164–165.
- 20. Cho Kim, Kim Mejías, Kim . Sow thistle chloroplast genomes: Insights into the plastome evolution and relationship of two weedy species, Sonchus asper and Sonchus oleraceus (Asteraceae). Genes (Basel). 2019;10: 881. pmid:31683955
- 21. Cho M-S, Yang JY, Yang T-J, Kim S-C. Evolutionary comparison of the chloroplast genome in the woody Sonchus alliance (Asteraceae) on the Canary Islands. Genes (Basel). 2019;10. pmid:30875850
- 22. Abdalla Masana S, Kim S-H, Kim S-C. Characterization of the complete plastome sequence of perennial sowthistle, Sonchus arvensis (Asteraceae). Mitochondrial DNA Part B. 2020;5: 2652–2654. pmid:33457894
- 23. Doyle JJ, Doyle JL. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemical bulletin. 1987;19: 11–15.
- 24. Zerbino DR, Birney E. Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18: 821–829. pmid:18349386
- 25. Shi L, Chen H, Jiang M, Wang L, Wu X, Huang L, et al. CPGAVAS2, an integrated plastome sequence annotator and analyzer. Nucleic Acids Res. 2019;47: W65–W73. pmid:31066451
- 26. Laslett D. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 2004;32: 11–16. pmid:14704338
- 27. Lowe TM, Chan PP. tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 2016;44: W54–W57. pmid:27174935
- 28. Lagesen K, Hallin P, Rødland EA, Stærfeldt H-H, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35: 3100–3108. pmid:17452365
- 29. Greiner S, Lehwark P, Bock R. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 2019;47: W59–W64. pmid:30949694
- 30. Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, et al. DnaSP 6: DNA Sequence Polymorphism Analysis of Large Data Sets. Mol Biol Evol. 2017;34: 3299–3302. pmid:29029172
- 31. Brudno M, Do CB, Cooper GM, Kim MF, Davydov E, Program NCS, et al. LAGAN and Multi-LAGAN: Efficient Tools for Large-Scale Multiple Alignment of Genomic DNA. Genome Res. 2003;13: 721–731. pmid:12654723
- 32. Frazer KA, Pachter L, Poliakov A, Rubin EM, Dubchak I. VISTA: computational tools for comparative genomics. Nucleic Acids Res. 2004;32: W273–W279. pmid:15215394
- 33. Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2019;20: 1160–1166. pmid:28968734
- 34. Beier S, Thiel T, Münch T, Scholz U, Mascher M. MISA-web: a web server for microsatellite prediction. Bioinformatics. 2017;33: 2583–2585. pmid:28398459
- 35. Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol Biol Evol. 2018;35: 1547–1549. pmid:29722887
- 36. Mower JP. The PREP suite: predictive RNA editors for plant mitochondrial genes, chloroplast genes and user-defined alignments. Nucleic Acids Res. 2009;37: W253–W259. pmid:19433507
- 37. Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: A Fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32: 268–274. pmid:25371430
- 38. Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Höhna S, et al. MrBayes 3.2: Efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61: 539–542. pmid:22357727
- 39. Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017;14: 587–589. pmid:28481363
- 40.
Swofford DL. PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4. Sunderland, Massachusetts, USA: Sinauer Associates; 2002.
- 41. Kim S-C, Lee C, Mejías JA. Phylogenetic analysis of chloroplast DNA matK gene and ITS of nrDNA sequences reveals polyphyly of the genus Sonchus and new relationships among the subtribe Sonchinae (Asteraceae: Cichorieae). Mol Phylogenet Evol. 2007;44: 578–597. pmid:17531507
- 42. Cho M-S, Kim S-H, Yang J, Crawford DJ, Stuessy TF, López-Sepúlveda P, et al. Plastid phylogenomics of Dendroseris (Cichorieae; Asteraceae): Insights into structural organization and molecular evolution of an endemic lineage from the Juan Fernández Islands. Front Plant Sci. 2020;11. pmid:33224173
- 43. Yang J, Kang G-H, Pak J-H, Kim S-C. Characterization and comparison of two complete plastomes of Rosaceae species (Potentilla dickinsii var. glabrata and Spiraea insularis) endemic to Ulleung Island, Korea. Int J Mol Sci. 2020;21: 4933. pmid:32668601
- 44. Cho M-S, Kim JH, Yamada T, Maki M, Kim S-C. Plastome characterization and comparative analyses of wild crabapples (Malus baccata and M. toringo): insights into infraspecific plastome variation and phylogenetic relationships. Tree Genet Genomes. 2021;17: 41.
- 45. Yang J, Choi M-J, Kim S-H, Choi H-J, Kim S-C. Plastome characterization and phylogenomic analysis yield new insights into the evolutionary relationships among the species of the subgenus Bryocles (Hosta; Asparagaceae) in East Asia. Plants. 2021;10: 1980. pmid:34685791
- 46. Kim S-H, Yang J, Park J, Yamada T, Maki M, Kim S-C. Comparison of whole plastome sequences between thermogenic skunk cabbage Symplocarpus renifolius and nonthermogenic S. nipponicus (Orontioideae; Araceae) in East Asia. Int J Mol Sci. 2019;20: 4678. pmid:31547213
- 47. Rabah SO, Lee C, Hajrah NH, Makki RM, Alharby HF, Alhebshi AM, et al. Plastome sequencing of ten nonmodel crop species uncovers a large insertion of mitochondrial DNA in cashew. Plant Genome. 2017;10. pmid:29293812
- 48. Pinard D, Myburg AA, Mizrachi E. The plastid and mitochondrial genomes of Eucalyptus grandis. BMC Genomics. 2019;20: 132. pmid:30760198
- 49. Kim S-C, McGowen MR, Lubinsky P, Barber JC, Mort ME, Santos-Guerra A. Timing and tempo of early and successive adaptive radiations in Macaronesia. PLoS One. 2008;3: e2139. pmid:18478126
- 50. Aldridge AE. Macaronesian Sonchus subgenus Dendrosonchus sl.l. (Compositae-Lactuceae), including a reappraisal of the species concept and new combinations. Botanica Macaronesica. 1976;2: 81–93.
- 51. Jardim R, Menezes de Sequeira M. Notas do herbario florestal do INRB (LISFA): Fasc. XXXIII. Silva Lusitana. 2011;19: 233–245.