Skip to main content
Advertisement

< Back to Article

Fig 1.

Fine-grained population structure in Ireland.

(A) fineSTRUCTURE clustering dendrogram for 1,035 Irish individuals. Twenty-three clusters are defined, which are combined into cluster groups for clusters that are neighbouring in the dendrogram, overlapping in principal component space (B) and sampled from regions that are geographically contiguous. Details for each cluster in the dendrogram are provided in S1 Fig. (B) Principal components analysis (PCA) of haplotypic similarity, based on ChromoPainter coancestry matrix for Irish individuals. Points are coloured according to cluster groups defined in (A); the median location of each cluster group is plotted. (C) Map of Ireland showing the sampling location for a subset of 588 individuals analysed in (A) and (B), coloured by cluster group. Points have been randomly jittered within a radius of 5 km to preserve anonymity. Precise sampling location for 44 Northern Irish individuals from the People of the British Isles dataset was unknown; these individuals are plotted geometrically in a circle. The map and administrative boundaries were produced using data from the database of Global Administrative Areas (GADM; https://gadm.org). (D) “British admixture component” (ADMIXTURE estimates; k = 2) for Irish cluster groups. This component has the largest contribution in ancient Anglo-Saxons and the SEE cluster. (E) Linear regression of principal component 2 (B) versus British admixture component (r2 = 0.43; p < 2×10−16). Points are coloured by cluster group. (Standard error for ADMIXTURE point estimates presented in S11 Fig.).

More »

Fig 1 Expand

Fig 2.

Genes mirror geography in the British Isles.

(A) fineSTRUCTURE clustering dendrogram for combined Irish and British data. Data principally split into Irish and British groups before subdividing into a total of 50 distinct clusters, which are combined into cluster groups for clusters that formed clades in the dendrogram, overlapped in principal component space (B) and were sampled from regions that are geographically contiguous. Names and labels follow the geographical provenance for the majority of data within the cluster group. Details for each cluster in the dendrogram are provided in S2 Fig. (B) Principal component analysis (PCA) of haplotypic similarity based on the ChromoPainter coancestry matrix, coloured by cluster group with their median locations labelled. We have chosen to present PC1 versus PC4 here as these components capture new information regarding correlation between haplotypic variation across Britain and Ireland and geography, while PC2 and PC3 (Fig 4) capture previously reported splitting for Orkney and Wales, respectively, from Britain [7]. A map of Ireland and Britain is shown for comparison, coloured by sampling regions for cluster groups, the boundaries of which are defined based on the Nomenclature of Territorial Units for Statistics (NUTS 2010), with some regions combined. Sampling regions are coloured by the cluster group with the majority presence in the sampling region; some sampling regions have significant minority cluster group representations as well, for example the Northern Ireland sampling region (UKN0; NUTS 2010) is majorly explained by the NICS cluster group but also has significant representation from the NLU cluster group. The PCA plot has been rotated clockwise by 5 degrees to highlight its similarity with the geographical map of the Ireland and Britain. NI, Northern Ireland; PC, principal component. Cluster groups that share names with groups from Fig 1 (NLU; SMN; CLN; CNN) have an average of 80% of their samples shared with the initial cluster groups. The map and administrative boundaries were produced using data from the database of Global Administrative Areas (GADM; https://gadm.org), note some boundaries have been subsumed or modified to better reflect sampling regions.

More »

Fig 2 Expand

Fig 3.

t-distributed stochastic neighbour embedding (t-SNE) of Irish and British coancestry matrix.

(A) fineSTRUCTURE dendrogram with clusters and cluster groups defined as in Fig 2. (B) Two-dimensional t-SNE embedding of ChromoPainter coancestry matrix, with median locations for cluster groups plotted. As t-SNE is a stochastic method, different runs produce different solutions to the 2-dimensional embedding; shown here is a typical result. t-SNE performed significantly better with the ChromoPainter coancestry matrix than with Hamming distances (identity-by-state) computed over single SNP markers (S9 Fig). The map and administrative boundaries were produced using data from the database of Global Administrative Areas (GADM; https://gadm.org), note some boundaries have been subsumed or modified to better reflect sampling regions.

More »

Fig 3 Expand

Fig 4.

Principal components 2 and 3 of combined Irish and British coancestry matrix.

(A) fineSTRUCTURE clustering dendrogram for combined Irish and British data, with cluster groups defined as in Fig 2. Immediately following the principal inter-island split, Orkney and Wales branch in sequence, consistent with previous observations. (B) Principal component analysis (PCA) of haplotypic similarity based on the ChromoPainter coancestry matrix, coloured by cluster group with their median locations labelled. PC2 captures an Orkney split, while PC3 captures a Welsh split.

More »

Fig 4 Expand

Fig 5.

Inter-island exchange of haplotypes between the north of Ireland and northern Britain.

The boxplots show the distribution of individuals on principal component (PC) 1 for each island and for specific sampling regions (Scotland/Northern Ireland) and cluster groups (SSC and NICS; see Fig 2). A substantial proportion of Northern Irish individuals fall within the expected range for Scottish individuals in PC space and vice versa. This exchange is particularly pronounced for Northern Irish and Scottish individuals that fall within the NICS and SSC cluster groups (Fig 2), respectively.

More »

Fig 5 Expand

Fig 6.

All-Ireland GLOBETROTTER admixture date estimates for European and British surrogate admixing populations.

A summary of the date estimates and 95% confidence intervals for inferred admixture events into Ireland from European and British admixing sources is shown in (A), with ancestry proportion estimates for each historical source population for the two events and example coancestry curves shown in (B). In the coancestry curves Relative joint probability estimates the pairwise probability that two haplotype chunks separated by a given genetic distance come from the two modelled source populations respectively (i.e. FRA(8) and NOR-SG); if a single admixture event occurred, these curves are expected to decay exponentially at a rate corresponding to the number of generations since the event. The green fitted line describes this GLOBETROTTER fitted exponential decay for the coancestry curve. If the sources come from the same ancestral group the slope of this curve will be negative (as with FRA(8) vs FRA(8)), while a positive slope indicates that sources come from different admixing groups (as with FRA(8) vs NOR-SG). The adjacent bar plot shows the inferred genetic composition of the historical admixing sources modelled as a mixture of the sampled modern populations. A European admixture event was estimated by GLOBETROTTER corresponding to the historical record of the Viking age, with major contributions from sources similar to modern Scandinavians and northern Europeans and minor contributions from southern European-like sources. For admixture date estimates from British-like sources the influence of the Norman settlement and the Plantations could not be disentangled, with the point estimate date for admixture falling between these two eras and GLOBETROTTER unable to adequately resolve source and proportion details of admixture event (fit quality FQB< 0.985). The relative noise of the coancestry curves reflects the uncertainty of the British event. Cluster labels (for the European clustering dendrogram, see S4 Fig; for the PoBI clustering dendrogram, see S3 Fig): FRA(8), France cluster 8; NOR-SG, Norway, with significant minor representations from Sweden and Germany; SE_ENG, southeast England; N_SCOT(4) northern Scotland cluster 4.

More »

Fig 6 Expand