Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Morphological and molecular diversity in mid-late and late maturity genotypes of cauliflower

  • Neha Rana,

    Roles Data curation, Investigation, Methodology, Writing – original draft

    Affiliation Department of Vegetable Science & Floriculture, Chaudhary Sarwan Kumar Himachal Pradesh Agricultural University, Palampur, Himachal Pradesh, India

  • Akhilesh Sharma ,

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Writing – review & editing

    assharmaakhil1@gmail.com

    Affiliation Department of Vegetable Science & Floriculture, Chaudhary Sarwan Kumar Himachal Pradesh Agricultural University, Palampur, Himachal Pradesh, India

  • Ranbir Singh Rana,

    Roles Funding acquisition, Supervision, Writing – review & editing

    Affiliation Centre for Geo Informatics Research and Training, Chaudhary Sarwan Kumar Himachal Pradesh Krishi Vishvavidyalaya, Palampur, Himachal Pradesh, India

  • Hem Lata,

    Roles Data curation, Writing – review & editing

    Affiliation Department of Vegetable Science & Floriculture, Chaudhary Sarwan Kumar Himachal Pradesh Agricultural University, Palampur, Himachal Pradesh, India

  • Bansuli,

    Roles Formal analysis, Software, Validation, Writing – review & editing

    Affiliation Department of Vegetable Science & Floriculture, Chaudhary Sarwan Kumar Himachal Pradesh Agricultural University, Palampur, Himachal Pradesh, India

  • Alisha Thakur,

    Roles Writing – review & editing

    Affiliation Department of Vegetable Science & Floriculture, Chaudhary Sarwan Kumar Himachal Pradesh Agricultural University, Palampur, Himachal Pradesh, India

  • Vivek Singh,

    Roles Writing – review & editing

    Affiliation Department of Vegetable Science & Floriculture, Chaudhary Sarwan Kumar Himachal Pradesh Agricultural University, Palampur, Himachal Pradesh, India

  • Aditya Sood

    Roles Data curation, Formal analysis

    Affiliation Department of Vegetable Science & Floriculture, Chaudhary Sarwan Kumar Himachal Pradesh Agricultural University, Palampur, Himachal Pradesh, India

Abstract

Genetic diversity is the prerequisite for the success of crop improvement programmes. Keeping in view, the current investigation was undertaken to assess the agro-morphological and molecular diversity involving 36 diverse mid-late and late cauliflower genotypes following α-RBD design during winter season 2021–22. Six morphological descriptors predicted as polymorphic using Shannon diversity index with maximum for leaf margin (0.94). The genotypes grouped into nine clusters based on D2 analysis with four as monogenotypic and gross plant weight (32.38%) revealed maximum contribution towards the genetic diversity. Molecular diversity analysis revealed 2–7 alleles among 36 polymorphic simple sequence repeats (SSR) with average of 4.22. Primer BoESSR492 (0.77) showed maximum polymorphic information content (PIC) with mean of 0.58. SSR analysis revealed two clusters each with two subclusters with a composite pattern of genotype distribution. STRUCTURE analysis showed homogenous mixture with least amount of gene pool introgression within the genotypes. Thus, based on morphological and molecular studies, the diverse genotypes namely, DPCaCMS-1, DPCaf-W4, DPCaf-US, DPCaf-W131W, DPCaf-S121, DPCaf-18, DPCaf-13, DPCaf-29 and DPCaf-CMS5 can be utilized in hybridization to isolate potential transgressive segregants to broaden the genetic base of cauliflower or involve them to exploit heterosis.

Introduction

Amongst the cole crops belonging to family Brassicaceae, cauliflower (Brassica oleracea var. botrytis L., 2n = 2x = 18) finds most significant place having low calories and high in dietary fibre, with a widespread center of origin in the Mediterranean Basin [1] and is grown in several nations across the world for both its nutritional content and for its added-value foods in the processing industry. Cauliflower is cultivated to obtain snow-white, compact and tender curds that are high in flavonoids, isothiocyanates, carotenoids, indoles and tocopherols which are frequently used as a vegetable in soups, curries and pickles [2]. Additionally, cauliflower has a variety of therapeutic uses, such as anti-cancer [3], lowering the risk of cardiovascular disease, and treating diabetes [4]. With an area, production, and productivity of 467 thousand hectares, 8941 thousand metric tonnes and 19.14 metric tonnes per hectare, respectively, India is the second-largest producer of cauliflower in the world after China [5].

Breeders have focused a lot of attention on the crop for varietal/hybrid development since it is a cost-effective, nutrient-dense vegetable crop with a wide range of plant and curd traits [6] and the variability in the germplasm is prerequisite to meet these objectives [7]. Thus, understanding the variable characteristics of the germplasm and their genetic relationships is crucial in crop development efforts to create new varieties, and enhancement of productivity [8]. This will make it easier to tackle upcoming difficulties including climate change, the depletion of natural resources and numerous biotic and abiotic stresses [9].

The Indian cauliflower germplasm is greatly diversified by the involvement of exotic germplasm and locally generated inter-group progenies for its utilization in heterosis breeding and stress resistance [10]. The present day cultivated cauliflower varieties in India is the result of unique genomic composition of the Indian cauliflower and snowball types. The late cauliflower in India is the consequence of introduction from European countries and have played a significant role in the evolution of mid-late group that shares resemblance of both snowball and typical Indian types [11]. The introgression of genes (major/minor) during the isolation of new genotypes alter the performance of different attributes for their adaptability and consumer acceptance. The researchers reported wide diversity in various maturity groups of cauliflower viz., early [12], mid group [13], and snowball type [1416] using morphological descriptors. However, the traits across groups are sensitive to environmental conditions and thereby, these observations do not satisfactorily reproduce degree of variation in the genotypes for use in breeding. The understanding of degree of variation at molecular level and corroborating this knowledge with morphological descriptors would provide enormous significance to the breeders in planning their breeding program. The plant architecture of cauliflower germplasm exhibited enormous diversity for leaf and curd characteristics in terms of size, shape, colour and compactness [17] that would be available using different methodologies i.e. evaluations of morphological features, isozymes, total seed protein and molecular markers [1]. The assessment of genetic diversity and relationships of germplasm through morphological characterization is the first step in any breeding program [18], although they involve lots of labour and time besides sensitive to environmental factors [17]. Molecular markers are therefore viewed as the best tools to understand genetic relationships in crop species as they are independent of environmental variables, dominant/co-dominant in nature and suggest precise scoring techniques [18]. Microsatellite markers are the best in comparison to other markers on account of easy detection through PCR, ample number, highly polymorphic, co-dominantly inherited, multi-allelic and consistently dispersed in genomes and require small amount of DNA for analysis [19]. In order to estimate the genetic diversity in cauliflower germplasm, it would be of immense significance to understand the variation at the molecular level and their validation using morphological descriptors. Keeping in view the above aspects, the current study was planned with objectives to assess morphological variation and molecular diversity using SSR markers in genotypes of mid-late and late cauliflower.

Material and methods

Plant materials

Experimental layout.

Thirty-six mid-late and late cauliflower genotypes were used to study genetic diversity and relationship determination derived from diverse populations and are also collected from diverse ecological areas (Table 1). These genotypes were assessed during winter of 2021–2022 at Vegetable Research Farm, Department of Vegetable Science and Floriculture, CSK Himachal Pradesh Agricultural University, Palampur using α–lattice square design using three replications, nine blocks per replication, and four entries per block. The pH of the soil in the experimental field was 5.7, making it an acidic clay loam. The seedlings were transplanted at 45 cm each as inter- and intra-row spacing. The seedling of 36 genotypes were raised in nursery beds of size 3 m × 1 m × 0.15 m on September 3, 2021 and were ready for transplanting in about 40 days.

Phenotyping

The Protection of Plant Varieties and Farmers’ Rights Authority (PPV & FRA) in New Delhi, India proposed 11 agro-morphological DUS descriptors for cauliflower, along with 20 other morphological and quality traits. Keeping these morphological traits in consideration, the observations were made on randomly selected five plants of each genotype in each replication. The data was changed to the individual DUS test scores and further transformed to binary data. The data on other morphological and quality characteristics, such as days to curd initiation, days to first marketable curd harvest, stalk length (cm), leaf length (cm), leaf width (cm), number of leaves per plant, plant height (cm), plant frame (cm), curd polar diameter (cm), curd equatorial diameter (cm), curd size index (cm2), curd solidity (g/cm), gross plant weight (g), marketable curd weight (g), net curd weight (g), non-marketable curds (%), harvest duration (days), harvest index (%), total soluble solids (°Brix) and ascorbic acid (mg per 100g fresh weight basis) were recorded and means of all observations were calculated for further statistical analysis.

Genotyping using SSR markers

DNA extraction and purification.

Using the Cetyl trimethyl ammonium bromide (CTAB) procedure described by Clarke [20], DNA extraction for the genotypes was done. Using a pestle and mortar, fresh and tender leaf tissues were crushed, and the extract was then transferred to a 2 ml Eppendorf tube containing 2000 μl of 2X CTAB extraction buffer. For 40–50 minutes, tubes were incubated in a water bath at 65°C while being shaken every 10 minutes. After adding a 24:1 ratio of chloroform: isoamyl alcohol in tubes (800 μl), the mixture was shaken for 25 minutes. The samples were then centrifuged at 12000 rpm for 7 minutes. After being transferred, the supernatant (upper phase) into 1.5 ml tubes with an equivalent volume of cold isopropanol (600 μl) was chilled overnight at -20°C. The next day, centrifugation was carried out for 7 minutes at 12000 rpm. After pellet formation, the supernatant was removed and the pellets were washed in 200 μl of 70% alcohol, centrifuged at 7000 rpm for 3 minutes and then allowed to air dry. 50 μl of 1 X TAE buffer were added to the pellet to dissolve it and they were kept at -20°C. Additionally, a 1% agarose gel and the Nanodrop 2000c (Thermo Scientific) were used to verify the quantity and purity of the isolated DNA.

SSR primer selection.

Eighty SSR markers were selected from the list of previous researchers [19, 2123] from all nine chromosomes, reported to had steady amplification and distinct banding patterns. These 80 markers were screened on 36 genotypes. The primers which failed to produce frequent or clear band sizes i.e. monomorphic were excluded from the research. Therefore, a final set of 36 polymorphic SSR primers were selected for further investigation and scored (S1 Table).

PCR amplification and gel electrophoresis.

The PCR reaction was conducted in a total volume of 10 μl (1 μl of template DNA and 9 μl of reaction mixture containing 0.6 μl of each forward and reverse primer, 3.5 μl of master mix, and 4.3 μl of nuclease-free water) in PCR tubes. This reaction was performed in a DNA Thermal Cycler (BIO-RAD) as follows: an initial denaturation step for 4 min at 94˚C; annealing step for 30s at 94˚C and extension step for 1 min at 72˚C followed by 35 cycles. Final extension step followed by 7 min at 72˚C and then the reaction was stored at 4 ˚C for ∞.

Each sample’s PCR product were verified by running it on a 2.5% agarose gel for two hours at 80 V with 10 μl of ethidium bromide in 1X TAE buffer. Additionally, a 100-bp DNA ladder was used as a molecular indicator for determining the size of the SSR primer product. The Gel-Documentation Unit (BIO-RAD) was used to view the gels and took pictures of them.

Statistical analysis

DUS data analysis.

NTSYS-pc (version 2.02) was used to create the dendrogram using binary data of 11 DUS descriptors [24]. Each descriptor’s Shannon Diversity Index (H) was determined using the following formula: where, pi is the proportion (n/N)

Morphological diversity analysis.

Each genotype’s phenotypic data for two quality and 18 morphological traits was collected, subjected to the Mahalanobis D2 statistic [25] and then sorted into various clusters using the Tocher technique [26]. The computer programme WINDOSTAT 8.0 created by Indostat Services was utilised to carry out the D2 analysis. The following equation represents the Mahalanobis D2 analysis between two genotypes determined based on the ’p’ characters: where,

wij = variance-covariance matrix

wij = reciprocal of (wij), (i j = 1,2……, p)

Xi1 = sample mean for ith character for first sample

Xi2 = sample mean for ith character for second sample

Additionally, using the EIGEN algorithm in the programme XLSTAT with the correlation coefficient between two genotypes, principal component analysis (PCA) was used to predict the grouping pattern of the 36 cauliflower genotypes.

Molecular analysis.

The amplicons were scored as 1 (present) or 0 (absent) depending on the molecular size (base pair) of the amplified product and a dendrogram was created using Rohlf [23] NTSYS software (version 2.02) and the Jaccard’s similarity matrix coefficient. where,

‘Cij’ is the number of positive matches between two genotypes

ni and nj are the total number of bands in genotype i and j, respectively, in SIMQUAL program of NTSYS-pc package (version 2.02)

After that, PCoA plot was designed using the GenALEx 6.5 software [27]. Further, binary data was used to calculate a genetic dissimilarity matrix using the Jaccard dissimilarity index (dij) between pairs of accessions (units), where,

dij represents the dissimilarity between units i and j

The DARwin programme (version 6.0) was used to generate an UnWeighted Neighbour Joining tree from the dissimilarity matrix [28] and the polymorphism information content (PIC) of each gel was calculated. where,

k is the total number of alleles

pi is the frequency of the ith allele in the set of genotypes investigated [29].

Various genetic diversity parameters including average number of alleles (Na) count, effective number of alleles (Ne), Shannon’s diversity index (I), observed heterozygosity (Ho), and predicted heterozygosity (He), were examined using POPGENE software version 1.32. The 36 cauliflower genotypes’ genetic makeup was determined using the Bayesian model-based programme STRUCTURE (v.2.3.3) [30]. The analysis was done with K ranging from 1 to 10 using an admixture model with 10,000 burning periods and 10,000 replicates. The final peak of plotting LnPD values were identified using online web-based Structure Harvester program. In addition to this, genetic diversity within and among the populations (AMOVA) was determined using GenAlex software ver. 6.5. [27].

Results

Agro-morphological characterization

Qualitative characteristics are crucial for describing plants since they are heavily impacted by customer preferences, the socioeconomic environment and natural selection. In Table 2, the diversity index for cauliflower is shown together with the distribution (%) of several categories of minimum descriptors. The Shannon Diversity Index (H) ranged from 0.29 to 0.94 with a mean of 0.54. The similarity coefficient ranged from 0.81 to 1.00, indicating a wide range of genetic variation among the genotypes. There was significant diversity among 36 genotypes except for leaf glossiness, blanched/non-blanched curds, curd texture, curd compactness and riceyness which were determined to be monomorphic and were omitted from the research. Majority of genotypes possessed intermediate plant growth habit (72%), erect leaves (Type-3) position of leaves (78%), dark green leaf colour (75%), broad elliptic leaf shape (89%), crenate leaf margin (56%) and white curd colour (92%). The cluster analysis grouped all the genotypes into two major clusters with two sub-clusters (Fig 1 and S2 Table).

thumbnail
Fig 1. Groping of 36 genotypes based on morphological characters.

https://doi.org/10.1371/journal.pone.0290495.g001

thumbnail
Table 2. Distribution (%) of different categories of minimal descriptors in cauliflower.

https://doi.org/10.1371/journal.pone.0290495.t002

Mahalanobis D2 diversity analysis

To differentiate the genotypes based on similarity and differences, the genotypes were subjected to multivariate cluster analysis based on Euclidean distance among the phenotypic characters. The genotypes were grouped into nine distinct cluster following Tocher’s procedure [26]. Cluster I categorized as the largest with 11 genotypes followed by 9, 5, 4 and 3 genotypes in clusters III, V, IV and II, respectively while cluster VI, VII, VIII and IX were monogenotypic that revealed optimum genetic diversity in the germplasm (Fig 2 and S3 Table). The inter-cluster distance ranged from 12.65–39.71, with maximum divergence in clusters II and IX (39.71) followed by clusters VII and IX (39.29), and clusters VIII and IX (36.50) [Table 3]. Cluster means also showed differences for all the traits (S4 Table). Cluster VII recorded maximum mean values for leaf length (37.16 cm), number of leaves per plant (12.80), plant height (46.39 cm), curd polar diameter (8.00 cm), curd size index (98.11), curd solidity (52.98), marketable curd weight (651.27 g) and net curd weight (423.87 g). Similarly, cluster VIII showed maximum means for stalk length (4.31 cm), leaf width (18.40 cm), plant frame (55.72 cm), curd equatorial diameter (12.27 cm), gross plant weight (940.60 g) and harvest duration (17 days). These mean values point towards the diversity in genotypes viz., DPCaf-CMS5 ‘DPCaCMS-1’, ‘DPCaf-29’, ‘DPCaf-US’ ‘DPCaf-W4’, ‘DPCaf-W131W’and ‘DPCaY-7’ for various important curd and plant characters namely, marketable/net curd weight, curd size index/solidity, plant frame, harvest duration. In addition, the relative contribution of individual trait indicated the maximum contribution towards genetic divergence by gross plant weight (32.38%) followed by curd solidity (22.70%) and stalk length (20.95%) [Fig 3]. Thus, the captioned parents and traits with maximum diversity can be utilized in cauliflower improvement programme.

thumbnail
Fig 2. Average distance of intra and inter-cluster centroids based on various traits in cauliflower genotypes.

https://doi.org/10.1371/journal.pone.0290495.g002

thumbnail
Fig 3. Relative contribution (%) of individual trait to the genetic divergence among cauliflower genotypes.

https://doi.org/10.1371/journal.pone.0290495.g003

thumbnail
Table 3. Average intra and inter-cluster values of D2 and √D2 among clusters.

https://doi.org/10.1371/journal.pone.0290495.t003

Principal component analysis (PCA)

In the present study, 36 genotypes were characterized according to 20 phenotypic traits by PCA analysis (Fig 4) that divided total variation into 20 principal components. The first six PCs were selected with eigen values of more than 1 and they cumulatively explained 79.29% of total variation. PC1 explained 27.82% of the total variability that was mainly influenced by days to curd initiation, days to first marketable curd harvest, curd polar diameter, gross plant weight and harvest index. PC2 showed the association of harvest index and harvest duration with 18.91% contribution. In PC3 (11.30%), plant frame had a major impact. The fourth main component explained 7.75% of total variability which were due to days to curd initiation, leaf width, curd size index, curd solidity and non-marketable curd (S4 Table).

thumbnail
Fig 4. Principal component analysis (PCA) based on morphological traits for active variables (Red) and active observations (Blue).

https://doi.org/10.1371/journal.pone.0290495.g004

Diversity analysis using SSR markers

In this study, 36 SSR markers, uniformly distributed all over the nine chromosomes (S1 Table) with high polymorphism content and clear banding pattern recognition were applied for genomic DNA amplification of 36 cauliflower genotypes (Fig 5). Table 4 explains the information on different number of alleles per locus (Na), effective number of alleles per locus (Ne), Shannon’s Information index (I), polymorphism information content (PIC), observed heterozygosity and expected heterozygosity. A total of 152 alleles were observed with each marker correlating to 2–7 alleles with average of 4.22 alleles per marker. The PIC of markers differed significantly with maximum by primer BoESSR492 (0.77) and the minimum by BoESSR370 (0.11) with average value of 0.58. The UPGMA dendrogram divided 36 genotypes in two major clusters each containing 18 genotypes which further divided clusters A into two sub-clusters A1 and A2, and that of B into B1 and B2 with 8 and 10 genotypes in their respective sub-clusters (Fig 6; S5 Table). The genetic identity was further confirmed by neighbor-joining tree (Fig 7) using the DARwin software version 5.0.158 [23]. The molecular variance (AMOVA) depicts high proportion of variability within population (95%) and only 5% among population (Fig 8). Associations among the 36 genotypes following PCoA (Fig 9) revealed similar grouping of lines to that of the cluster analysis which determined 49.49% diversity mainly through three principal components (PC) (26.215% for PC1, 13.81% for PC2 and 9.53% for PC3).

thumbnail
Fig 5. PCR amplification profile of SSR markers for 36 cauliflower genotypes (PCR product was separated on 2.5% agarose gel, M = 100bp DNA ladder).

https://doi.org/10.1371/journal.pone.0290495.g005

thumbnail
Fig 6. Dendrogram generated for 36 cauliflower genotypes by UPGMA (NTSYS-pc software) using SSR data.

https://doi.org/10.1371/journal.pone.0290495.g006

thumbnail
Fig 7. Neighbor joining tree of 36 genotypes generated by DARwin software (using data from 36 SSR markers).

https://doi.org/10.1371/journal.pone.0290495.g007

thumbnail
Fig 8. Molecular variance among and within population of cauliflower based on AMOVA.

https://doi.org/10.1371/journal.pone.0290495.g008

thumbnail
Fig 9. Principal coordinate analysis using SSR markers for cauliflower genotypes.

https://doi.org/10.1371/journal.pone.0290495.g009

thumbnail
Table 4. Details of polymorphic SSR primers used for the molecular characterization of cauliflower genotypes.

https://doi.org/10.1371/journal.pone.0290495.t004

Further, structure analysis revealed that the LnP(D) (log-likelihood) increased with the model parameter K value (Fig 10) with a sharp peak at K = 2, indicating that the population studied was a mixed population consisting of two subpopulations with 16 genotypes (44.44%) from CSKHPKV, Palampur in P1 and 20 genotypes in P2 (55.55%) from Palampur, IARI, New Delhi and IARI-Regional Station, Katrain, Kullu, Himachal Pradesh. Additionally, both subpopulations contained genotypes from different groups of curd maturity.

thumbnail
Fig 10. Gene pool introgression based on the population structure analysis at K = 2.

Each genotype is represented by a vertical bar, which is partitioned into K colored segments that represent individual’s estimated membership coefficient (Q) to the K (= 2) clusters. Two subpopulations were P1-Red, P2-Green, Genotypes 15,16,17, and 18 showed introgression/mixed population.

https://doi.org/10.1371/journal.pone.0290495.g010

Discussion

The first and foremost step in any efficient breeding program is to realize the extent of genetic diversity in the population of a crop species. The range of genetic diversity and population structure determine a significant role in the maintenance and augmentation of productivity [31]. The choice of the suitable genotypes would be useful based on knowledge of the genetic relationships at the molecular and plant levels. The genetic divergence in genetic material pave way to select parents with uniqueness or distinctiveness for traits of interest for utilization in hybridization to isolate superior progenies in segregating generations [32]. The Shannon Diversity Index (H) varied from 0.29 to 0.94 based on the examination of DUS data, demonstrating a broad range of genetic variation across the genotypes. Breeders are better able to understand the genetic diversity and relationships of parents by using cluster analysis and principal coordinate analysis to create suitable hybridization combinations [33]. The cluster analysis based on Euclidean distance further substantiated the diversity in our genetic material by distinguishing them based on similarities and differences. Using Tocher’s approach (D2 analysis) divided the genotypes into 9 diverse groups with four monogenotypic pointing towards diversity among genotypes [34, 35]. The clustering pattern showed that genotypes with the same geographical distribution fall into different clusters, demonstrating that the genetic makeup of the genotype has an impact on the clustering pattern [18]. The utilization of genotypes in hybridization from the same cluster would presumably little diverge from one another and therefore, they may not be able to synthesize desirable segregants [36, 37]. Cluster V (16.51) showed significant heterogeneity with maximum intra-cluster distance followed by cluster IV (15.42). Clusters II and IX were most divergent as depicted from maximum inter-cluster distance (39.71), suggesting to involve genotypes from these diverse clusters in hybridization to obtain transgressive segregants [3841]. The clusters VII and VIII were found to be the most divergent based on cluster means for majority of the valuable traits (curd and plant parameters) that had direct impact on marketable curd weight [34] and can be successfully applied for the selection and identification of various parents for hybridization [38, 39, 42].

The phenotypic traits may be simplified into numerous components using principal component analysis (PCA), giving researchers the chance to choose PCs with traits that are more strongly associated with variance [35]. Only the first 6 PCs out of a total of 20 PCs were contributing more to the overall genetic variation i.e. 79.29% and cover majority of the primary traits that describe maturity, curd and plant characteristics. Similarly, Kumar et al. [43] and Singh et al. [44] also identified a comparable observation in cauliflower. Therefore, based on agro-morphological characterization, genotypes viz., DPCaCMS-1, DPCaf-W4 and DPCaf-US from cluster II, DPCaf-W131W, DPCaf-S121, DPCaf-S122 and DPCaf-18 from cluster III, DPCaY-9, DPCaf-13 and DPCaY-7 from cluster IV, DPCaf- 12–1 from cluster VI, DPCaf-29 from cluster VII, and DPCaf-CMS5 from cluster VIII offer greater potential as a breeding stock to be used in hybridization programmes for the isolation of transgressive segregants in cauliflower.

Environmental elements significantly affect morphological characterization since phenotype (P) is the consequence of relationship between genotype (G) and environment (E) i.e.P = G + E + GE and therefore, the worth of any diverse genetic material is not reflected at the field level. Unbiased phenotyping caused by the combination of genotype and environment can have an impact on both the grouping pattern of genotypes and the deviation of inter-/intra cluster distance [45]. These intricacies and distortions give false information about genetic divergence. The assessment of genetic diversity through DNA markers offers a rapid and accurate method for assessing genetic diversity among genotypes [32] as they are independent of environment factors [46]. The use of DNA markers supports quick screening for polymorphic loci among diverse group of genotypes. The assessment of genetic diversity in cauliflower has been undertaken by using various molecular markers including RAPD, RFLP, AFLP and SSR [1, 21]. Among these markers, SSR are locus specific, numerous, stable, widely dispersed throughout the genome, highly polymorphic due to variation in repeat units and highly informative due to their codominant nature and their usage can thus avoid the unpredictable results caused by morphological markers [46, 47]. Therefore, they are most ideal markers to study diversity in germplasm, varietal identification, marker-assisted breeding and genome mapping [46].

In present study, 36 polymorphic SSR markers generated 152 alleles with 4.22 alleles per marker along with 3.04 effective number of alleles per locus (Ne), 1.18 Shannon’s information index (I), 0.58 Polymorphic Information Content (PIC), 0.08 Observed heterozygosity (Ho) Ho and 0.64 expected heterozygosity (He) indicating good amount of diversity in 36 genotypes of cauliflower. Zhu et al. [6] observed relatively narrow genetic diversity with average PIC value (0.316) in their cauliflower inbred lines and pointed out that this may be the result of high frequency of artificial-oriented selection, inadequate polymorphism among SSR markers, and low genetic diversity in the genetic material used. However, our results with comparative high PIC value indicating optimum genetic diversity in the breeding lines which were developed through open pollination and collected from diverse sources.

The molecular markers help to differentiate breeding material by grouping them into different clusters and through principal coordinate analysis. This helps breeders to understand the genetic relationship among inbred lines and their genetic diversity which can effectively be utilized in hybridization program to obtain desirable transgressive segregants [6, 47, 48]. Accordingly, SSR markers divided genotypes into two main clusters (A and B) each of which further categorized into two sub-clusters with variable set of genotypes based on maturity group. In sub-cluster (A2), genotypes of mid-late group (e.g., DPCaY-7 and DPCAY-9) were placed together while certain sub- clusters had genotypes particular to late maturity group as depicted in sub-cluster B2 (Pusa Snowball K-1 and Pusa Snowball K-25). Vanlalneihi et al. [49] and Rakshita et al. [46] also observed agro-morphological and molecular diversity in diverse set of genotypes from different maturity groups of Indian cauliflower. Cauliflower genotypes have vast ecological distribution, strong environmental adaption, capacity to survive, and evolutionary consequences all contribute to its expanded genetic diversity [50] on account of its sensitivity to temperature for curding and thereby, distribution of genotypes to different maturity groups.

The polymorphic information content (PIC) value of the primer is an important metric for determining the degree of polymorphism to be employed in molecular research. The PIC value indicated low to high levels of diversity with 8 SSR primers with moderate range (between 0.25 and 0.50), 27 SSR markers highly polymorphic (value more than 0.50) [51]. El- Eswai et al. [52] also reported similar PIC utilizing 25 genotypes from a diversified pool and 12 SSR markers in an Ireland collection. Additionally, the neighbor-joining tree using the DARwin programme added more genetic assurance by creating three significant groups. The variation in NTSYS-pc v2.02 and DARwin v6.0.21 program may be ascribed to the differentiation in clustering method i.e. former calculate similarity coefficient while other one work on dissimilarity coefficient.

The genetic differentiation based on AMOVA might be useful in breeding programs that revealed high proportion of variation (95%) within cauliflower population and 5% among populations. Yousef et al. [53] observed 6 per cent variation among population and 94 per cent variability within population. In addition, principal coordinates analysis (PCoA) provide important information about major groups as compared to cluster analysis. PCoA plotted clear separation of the genotypes collected from different locations and suggested that 49.5% of total variation cumulatively accounted by first three axes (PC1, PC2 and PC3) with their individual contribution of 26.2, 13.8 and 9.5%, respectively. Yousef et al. [53] and Stansell et al. [54] also explained the diversity using PCoA with major contribution of first three axes.

Model-based population structure analysis (STRUCTURE) depicts homogeneous mixture and provides understanding about the introgression in the present population within gene pool, thereby elucidate grouping better than dendrogram. The differentiations at K = 2 revealed almost consistency in pedigree of genotypes with introgression in few genotypes and this mix is quite obvious as breeding genotypes is a continuous process through recombination and outcrossing events [55]. Such introgression will continue to rise in the genetic resources [56]. Yousef et al. [53] showed optimum value of K = 2 while Zhu et al. [6] and Rakshita et al. [46] at K = 4 working with different sets of cauliflower germplasm.

We noticed a good range of diversity, reflected from grouping pattern of cauliflower genotypes. The outcome of the current study demonstrated a substantial genetic diversity among cauliflower genotypes based on molecular markers, their degree of polymorphism and number of detected alleles. The grouping of genotypes based on SSR markers was inconsistent with conventional grouping of the genotypes based on morphological traits and geographic origins. In cauliflower, least SSR diversity was demonstrated by Tonguc and Griffiths [57] that may be the result of selection of genotypes from a narrow gene pool. On the other hand, Astarini et al. [58], Plieske and Struss [59] and Li et al. [19] observed the effectiveness of SSR markers in assessing the diversity range in cauliflower. Hence, it would be imperative to use more markers with their distribution across nine chromosomes to undertake inclusive diversity studies in the cauliflower germplasm. The findings of this study would be utilized to characterize and identify genetic variation in cauliflower accessions for traits of preference by breeders based on farmer/consumer assessment. The diverse genotypes namely, DPCaCMS-1, DPCaf-W4, DPCaf-US, DPCaf-W131W, DPCaf-S121, DPCaf-18, DPCaf-13, DPCaf-29 and DPCaf-CMS5 can be utilized in various improvement programs of cauliflower. The use of more SSR markers will support to identify novel genes that will provide a platform in DNA fingerprinting, genome mapping and gene pyramiding.

Conclusion

Cauliflower is a highly thermos-sensitive crop and its curd initiation and curd developmental stages are under the influence of prevailing temperatures. Therefore, for cauliflower breeders, agro-ecological diversity along with molecular characterization represents a double-edged sword. Based on morphological and molecular characterization, genotypes namely, ‘DPCaCMS-1’, ‘DPCaf-US’, ‘DPCaf-W131W’, ‘DPCaf-S121’, ‘DPCaf-W4’, ‘DPCaf-29’, ‘DPCaf-18’, ‘DPCaf-13’, ‘DPCaY-4’, ‘DPCaf-S121’, ‘DPCaf-30’ and ‘DPCaCMS-5’ were found diverse and these genotypes could be used in future hybridization programmes to either exploit heterosis or isolation of transgressive segregants with desirable horticultural traits in mid-late/late cauliflower genotypes.

Supporting information

S1 Table. List of SSR primers used for molecular studies.

https://doi.org/10.1371/journal.pone.0290495.s001

(DOCX)

S2 Table. Clustering of cauliflower genotypes based on DUS characters.

https://doi.org/10.1371/journal.pone.0290495.s002

(DOCX)

S3 Table. Cluster means for different traits of cauliflower genotypes based on D2 statistic.

https://doi.org/10.1371/journal.pone.0290495.s003

(DOCX)

S4 Table. Principal component analysis for different traits in cauliflower genotypes.

https://doi.org/10.1371/journal.pone.0290495.s004

(DOCX)

S5 Table. Clustering of cauliflower genotypes on the basis of SSR data (NTSYS 2.02).

https://doi.org/10.1371/journal.pone.0290495.s005

(DOCX)

Acknowledgments

The authors would like to thank Dr. R.C. Agrawal, National Coordinator, National Agriculture Higher Education Project (NAHEP) and Dr. Anuradha Agrawal, Project Coordinator for their constant encouragement to publish articles in journals with high impact factor.

References

  1. 1. Izzah NK, Lee, Perumal S, Park JY, Ahn K, Fu D, et al. Microsatellite-based analysis of genetic diversity in 91 commercial Brassica oleracea L. cultivars belonging to six varietal groups. Genetic Resources and Crop Evolution. 2013; 60: 1967–1986.
  2. 2. Jahangir M, Kim HK, Choi YH, Verpoorte R. Health-affecting compounds in Brassicaceae. Comprehensive Reviews in Food Science and Food Safety. 2009; 8: 31–43.
  3. 3. Tang L, Zirpoli GR, Guru K, Moysich KB, Zhang Y, Ambrosone CB. Consumption of raw cruciferous vegetables is inversely associated with bladder 544 cancer risk. Cancer Epidemiol Biomarkers Prevention. 2008; 17: 938–944.
  4. 4. Sharma R, Prasad R. Nutritional evaluation of dehydrated stems powder of cauliflower incorporated in mathri and sev. Journal of Nutrition and Food Sciences. 2018; 8: 1–3.
  5. 5. Anonymous. Indian Horticulture Database. National Horticulture Board. Ministry of Agriculture and Farmers Welfare, Government of India, Gurugram, Haryana, India. 2019. http://nhb.gov.in.
  6. 6. Zhu S, Zhang X, Liu Q, Luo T, Tang , Zhou Y. The genetic diversity and relationships of cauliflower (Brassica oleracea var. botrytis) inbred lines assessed by using SSR markers. PLoS ONE. 2018; 13: e0208551. pmid:30521622
  7. 7. Singh S, Kalia P, Parkash C, Kumar S, Sharma BB, Saha S. Seventy five years of research and development in cauliflower and cabbage: A journey from temperate to tropicalization and aristocrats to commoners. International Journal of Innovative Horticulture. 2022; 11(2): 184–197.
  8. 8. Abbas SJ, Marwat KB, Khan IA, Iqbal M. Molecular analysis of genetic diversity in Brassica species. Pakistan Journal of Botany. 2009; 41: 167–176.
  9. 9. Rana C, Sharma A, Sharma KC, Mittal P, Sinha BN, Sharma VK, et al. Stability analysis of garden pea (Pisum sativum L.) genotypes under North Western Himalayas using joint regression analysis and GGE biplots. Genetic Resources and Crop Evolution. 2021; 68: 999–1010.
  10. 10. Kalia P, Singh S. Advances in Cauliflower (Brassica oleracea L. var. botrytis) Breeding. In: Advances in Plant Breeding Strategies: Vegetable Crops. 2021; Springer. Vol. 10 (In Press).
  11. 11. Dey SS, Bhatia R, Sharma K, Bhardwaj I, Pandey CD, Prakash C, et al. Frequent introgression of European cauliflowers in the present day cultivated Indian cauliflowers and role of Indian genotypes in the evolution of tropical cauliflower. Euphytica. 2019; 215: 23. https://doi.org/10.1007/s10681-019-2346-0
  12. 12. Kumar A, Roy C, Kumar R, Kumar R, Singh VK, Verma RB. Study of genetic divergence for earliness and yield attributing traits in tropical cauliflower (Brassica oleracea var. botrytis L). J. Pharmacogn. Phytochem. 2018; 7(2): 1761–1763.
  13. 13. Chatterjee S, Sharma S, Mukherjee D, Patil S. Estimation of genetic divergence in some mid late and late cauliflower (Brassica oleracea Var. botrytis L.) germplasm. Biosccan, 2018: 13(2): 459–462.
  14. 14. Dey SS, Sharma SR, Parkash C, Barwal RN, Bhatia R. Genetic divergence in snowball cauliflower (Brassica oleracea var. botrytis L.). Indian J Plant Genet Resour. 2011; 24: 48–51.
  15. 15. Hanuman R, Sureja AK, Krishnan SG, Dey SS, Tomar BS. Diversity analysis in snowball cauliflower (Brassica oleracea var. botrytis) using microsatellite markers. Indian J. Agri. Sci. 2016; 86(10): 1299–1304.
  16. 16. Singh S, Bhatia R, Kumar R, Das A, Ghemeray H, Behera TK, et al. Characterization and genetic analysis of OguCMS and doubled haploid based large genetic arsenal of Indian cauliflowers (Brassica oleracea var. botrytis L.) for morphological, reproductive and seed yield traits revealed their breeding potential. Genet. Resour. Crop Evol. 2021; 68(4): 1603–1623.
  17. 17. Cansian R, Echeverrigaray S. Discrimination among cultivars of cabbage using randomly amplified polymorphic DNA markers. Horticultural Science. 2000; 35: 1155–1158.
  18. 18. Sharma A, Sharma S, Kumar N, Rana RS, Sharma P, Kumar P, et al. Morphomolecular genetic diversity and population structure analysis in garden pea (Pisum sativum L.) genotypes using simple sequence repeat markers. PLoS ONE. 2022; 17(9): e0273499. https://doi.org/10.1371/journal.pone.0273499
  19. 19. Li H, Chen X, Yang Y, Xu J, Gu J, Fu J, et al. Development and genetic mapping of microsatellite markers from whole genome shotgun sequences in Brassica oleracea. Mol Breed. 2011; 28: 585–596. https://doi.org/10.1007/s11032-010-9509-y
  20. 20. Clarke JD. Cetyltrimethyl Ammonium Bromide (CTAB) DNA Miniprep for Plant DNA isolation. Cold Spring Harbor Protocol. 2009. pmid:20147112
  21. 21. Louran S, Torp AM, Holme IB, Andersen SB, Jensen BD. Database derived microsatellite markers (SSRs) for cultivar differentiation in Brassica oleracea. Genetic Resources and Crop Evolution. 2007; 54: 1717–1725.
  22. 22. Gao M, Li G, Yang B, Qiu D, Farnham M, Quiro C. High-density Brassica oleracea linkage map: identification of useful new linkages. Theoretical and Applied Genetics. 2007; 115: 277–287. pmid:17592603.
  23. 23. Singh S, Dey SS, Bhatia R, Kumar R, Sharma K, Behera TK. Heterosis and combining ability in cytoplasmic male sterile and double haploid based Brassica oleracea progenies and prediction of heterosis using microsatellites. PLoS ONE. 2019; 14: e0210772.
  24. 24. Rohlf FJ. Numerical Taxonomy and Multivariate Analysis System, Version 2.02. Exeter Software: Setauket, New York. 1993.
  25. 25. Mahalanobis PC. On the generalized distance in statistics. Proceedings National Academy of Sciences of India. 1936; 22: 49–55.
  26. 26. Rao CR. Advanced Statistical Methods in Biometrical Research. John Wiley and Sons Inc. New York Edn. 1. 1952.
  27. 27. Peakall R, Smouse PE. GenAlEx 6.5: Genetic analysis in Excel. Population genetic software for teaching and research-an update. Bioinformatics. 2012; 28: 2537–2539. pmid:22820204
  28. 28. Perrier X, Jacquemoud CJP. DARwin software. 2006.
  29. 29. Anderson JA, Churchill GA, Autrique JE, Tanksley SD, Sorrells ME. Optimizing parental selection for genetic linkage maps. Genome. 1993; 36: 181–186. pmid:18469981
  30. 30. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000; 155: 945–959. pmid:10835412
  31. 31. Sharma A, Sekhon BS, Sharma S, Kumar R. Newly isolated intervarietal garden pea (Pisum sativum L.) progenies (F7) under north western Himalayan conditions of India. Experimental Agriculture. 2020; 56(1): 76–87.
  32. 32. Verma VK, Kalia P, Prasanna BM. Genetic characterization of self-incompatible lines and strategies for heterosis breeding in cauliflower. International Journal of Vegetable Science. 2017; 23: 411–429.
  33. 33. Adeyemo O, Menkir A, Melaku G, Omidiji O. Genetic diversity assessment and relationship among tropical yellow endosperm maize inbred lines using SSR markers. Maydica. 2011; 56:43–49.
  34. 34. Varalakshmi B, Pushpalatha A, Girigowda JR. Genetic diversity in early cauliflower. Indian Journal of Horticulture Science. 2010; 67: 281–283.
  35. 35. Aleem S, Tahir M, Sharif I, Aleem M, Najeebullah M, Nawaz A, et al. Principal component and cluster analyses as a tool in the assessment of genetic diversity for late season cauliflower genotypes. Pakistan Journal of Agricultural Research. 2021; 34: 176–183.
  36. 36. Sharma A, Bhardwaj A, Katoch V, Sharma JD. Assessment of genetic diversity of garden pea (Pisum sativum) as perspective to isolate horticulturally desirable transgressive segregants. Indian Journal of Agricultural Sciences. 2013; 83(12): 1334–1339.
  37. 37. Sekhon BS, Sharma A, Katoch V, Kapila RK, Sood VK. Assessment of genetic diversity in advanced breeding lines derived from intraspecific crosses of garden pea (Pisum sativum L.). Legume Research-An International Journal. 2019; 42(2): 145–152.
  38. 38. Dey SS, Sharma SR, Parkash C, Barwal RN, Bhatia R. Genetic divergence in snowball cauliflower (Brassica oleracea L var. botrytis). Indian Journal of Plant Genetic Resources. 2010; 24: 48–51.
  39. 39. Santhosha HM, Varalakshmi B, Narase, Gowda NC. Genetic diversity in early cauliflower (Brassica oleracea var. botrytis L.) germplasm. Journal of Horticultural Science. 2011; 6: 21–24.
  40. 40. Chatterjee S, Sharma S, Mukherjee D, Patil S. Estimation of genetic divergence in some mid late and late cauliflower (Brassica oleracea L. var. botrytis) germplasm. The Bioscan. 2018; 13: 459–462.
  41. 41. Kumar S, Ali B, Khaldun ABM, Islam SS, Uddin S, Akanda MMJ, et al. Genetic diversity, correlations and path coefficient analysis among the F5 populations of Brassica species. Asian Journal of Advances in Agricultural Research. 2021; 16: 20–31.
  42. 42. Sharma S, Singh Y. Genetic divergence studies in cauliflower (Brassica oleracea L. var. botrytis) under mid hill conditions of Himachal Pradesh. Himachal Journal of Agricultural Research. 2018; 44: 50–53.
  43. 43. Kumar V, Singh DK, Panchbhaiya A, Singh N. Genetic divergence studies in mid-season cauliflower (Brassica oleracea L. var. botrytis) through principal component analysis (PCA) and D2 analysis. International Journal of Chemical Studies. 2017; 5: 989–993.
  44. 44. Singh J, Sharma A, Lata H, Thakur A and Kumar N. Genetic diversity for curd yield and its attributes in late cauliflower. Indian J. Hortic. 2023; 80(2): 136–142.
  45. 45. Rahevar PM, Patel JN, Kumar S, Acharya RR. Morphological, biochemical and molecular characterization for genetic variability analysis of Capsicum annuum. Vegetos. 2019; 32: 131–141. https://doi.org/10.1007/s42535-019-00016-5.
  46. 46. Rakshita KN, Singh S, Verma VK, Sharma BB, Saini N, Iquebal MA, et al. Agro-morphological and molecular diversity in different maturity groups of Indian cauliflower (Brassica oleracea L. var. botrytis) PLoS ONE. 2021; 16: e0260246.
  47. 47. Zhao Z, Gu H, Sheng X, Yu H, Wang J, Zhao J, et al. Genetic diversity and relationships among loose-curd cauliflower and related varieties as revealed by microsatellite markers. Scientia Horticulturae. 2014; 166: 105–110.
  48. 48. Sun GL, William M, Liu J, Kasha KJ, Pauls KP. Microsatellite and RAPD polymorphisms in Ontario cornhybrids are related to the commercial sources and maturity ratings. Mol Breeding. 2001; 7: 13–24.
  49. 49. Vanlalneihi B, Saha P, Kalia P, Singh S, Singh N. Microsatellite marker based intervarietal variability study in Indian cauliflower. Indian Journal of Horticulture. 2019; 76: 300–304.
  50. 50. Yan W, Li J, Zheng D, Friedman C, Wang H. analysis of genetic population structure and diversity in Mallotus oblongifolius using ISSR and SRAP markers. PeerJ. 2019; 7: e7173. pmid:31275758
  51. 51. Levinson G, Gutman GA. Slipped-strand mispairing: a major mechanism for DNA sequence evolution. Mol. Biol. Evol. 1987; 4: 203–221. pmid:3328815
  52. 52. El-Esawi MA, Germaine K, Bourke P, Malone R. Genetic diversity and population structure of Brassica oleracea germplasm in Ireland using SSR markers. Comptes Rendus Biologies. 2016; 339: 133–140. pmid:26995396
  53. 53. Yousef EAA, Müller T, Börner A, Schmid KJ. Comparative analysis of genetic diversity and differentiation of cauliflower (Brassica oleracea var. botrytis) accessions from two ex situ gene banks. PLoS ONE. 2018; 13: e0192062.
  54. 54. Stansell Z, Hyma K, Fresnedo-Ramírez J, Sun Q, Mitchell S, Björkman T, et al. Genotyping-by-sequencing of Brassica oleracea vegetables reveals unique phylogenetic patterns, population structure and domestication footprints. Horticulture Research. 2018; 5: 38. pmid:29977574
  55. 55. Cristov NK, Tsonev S, Todorova V, Todorovska EG. Genetic diversity and population structure analysis- a prerequisite for constructing a mini core collection of Balkan Capsicum annuum germplasm. Biotechnology and Boitechnological Equipment. 2021; 35: 1010–1023.
  56. 56. Park JR, Yang WT, Kwon YS, Kim HN, Kim KM, Kim DH. 2019. Assessment of the genetic diversity of rice germplasms characterized by black-purple and red pericarp color using simple sequence repeat markers. Plants. 2019; 8(11):471. pmid:31689922
  57. 57. Tonguc M, Griffiths PD. Genetic relationships of Brassica vegetables determined using database derived simple sequence repeats. Euphytica. 2004; 137: 193–201. https://doi.org/10.1023/B:EUPH.0000041577.84388.43
  58. 58. Astarini IA, Plummer JA, Lancaster RA, Yan G. Fingerprinting of cauliflower cultivars using RAPD markers. Aus J Agric Res. 2004; 55:117–124. https://doi.org/10.1071/AR03012
  59. 59. Plieske J, Struss D. Microsatellite markers for genome analysis in Brassica. I. Development in Brassica napus and abundance in Brassicaceae species. Theor Appl Genet. 2001; 102: 689–694.