Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The use of geographic information system and 1860s cadastral data to model agricultural suitability before heavy mechanization. A case study from Malta

  • Gianmarco Alberti ,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Classics and Archaeology, University of Malta, Msida, Malta

  • Reuben Grima,

    Roles Conceptualization, Project administration, Writing – original draft, Writing – review & editing

    Affiliation Department of Conservation and Built Heritage, University of Malta, Msida, Malta

  • Nicholas C. Vella

    Roles Conceptualization, Project administration, Writing – original draft, Writing – review & editing

    Affiliation Department of Classics and Archaeology, University of Malta, Msida, Malta


The present study seeks to understand the determinants of land agricultural suitability in Malta before heavy mechanization. A GIS-based Logistic Regression model is built on the basis of the data from mid-1800s cadastral maps (cabreo). This is the first time that such data are being used for the purpose of building a predictive model. The maps record the agricultural quality of parcels (ranging from good to lowest), which is represented by different colours. The study treats the agricultural quality as a depended variable with two levels: optimal (corresponding to the good class) vs. non-optimal quality (mediocre, bad, low, and lowest classes). Seventeen predictors are isolated on the basis of literature review and data availability. Logistic Regression is used to isolate the predictors that can be considered determinants of the agricultural quality. Our model has an optimal discriminatory power (AUC: 0.92). The positive effect on land agricultural quality of the following predictors is considered and discussed: sine of the aspect (odds ratio 1.42), coast distance (2.46), Brown Rendzinas (2.31), Carbonate Raw (2.62) and Xerorendzinas (9.23) soils, distance to minor roads (4.88). Predictors resulting having a negative effect are: terrain elevation (0.96), slope (0.97), distance to the nearest geological fault lines (0.09), Terra Rossa soil (0.46), distance to secondary roads (0.19) and footpaths (0.41). The model isolates a host of topographic and cultural variables, the latter related to human mobility and landscape accessibility, which differentially contributed to the agricultural suitability, providing the bases for the creation of the fragmented and extremely variegated agricultural landscape that is the hallmark of the Maltese Islands. Our findings are also useful to suggest new questions that may be posed to the more meagre evidence from earlier periods.


The study of past landscapes and the way they have evolved over long periods of time is key to understanding the changing relationship between humans and their environment. Approaches to the study of long-term landscape change is increasingly making use of an ever-growing array of different tools and data-sets, ranging from environmental reconstruction to ethnographic comparison, from to surface survey to map regression. The present study was undertaken in the context of the five-year ERC-funded FRAGSUS project, which examines fragility and sustainability in small island contexts, with a focus of human-environment interactions in prehistoric Malta. The sheer density of human activity on Malta over the past 7,000 years makes the reconstruction of past environments extremely challenging, as the evidence is at best fragmentary, and often obliterated by subsequent erosion and anthropogenic activity [13]. The challenge increases exponentially for more remote periods in time.

While the primary focus of FRAGSUS is the prehistoric environment, a sound understanding of the subsequent evolution of the landscape was considered essential and useful for a number of key reasons: first, because they form part of the same landscape palimpsest that can only be understood in diachronic terms; second, to allow more informed predictions of where evidence of earlier landscapes may be preserved; third, because different cultural responses in better documented periods may suggest new questions that may be posed to the more meagre evidence from earlier periods, and enrich their interpretations.

Archival records preserved from the early modern period onwards include census records and cadastral maps that may contain detailed records of ownership, productivity and yield of land, range of crops and size of herds, as well as human demography. This record, which tends to be increasingly rich in coverage and detail in more recent centuries, offers ample opportunities for the historical reconstruction of early modern landscapes. In a study of another Mediterranean small island context [4], it has been noted how the uneven nature of such records may give rise to a form of analytical exceptionalism which privileges better-documented periods, and that there is an obvious risk that the unusually detailed historical & ethnographic evidence available for the late eighteenth to twentieth centuries will lead to some gross methodological differences compared to earlier periods that may skew our practical interpretations. These risks notwithstanding, the same researchers conclude that the abundant historical records and standing remains of the late eighteenth to twentieth centuries are a resource that it would be foolish to ignore [4].

Bearing the above pitfalls in mind, but also mindful of the opportunities presented by early modern archival records, the present study examined a comprehensive survey undertaken by the British colonial government in Malta in the 1860s to produce a highly accurate and detailed terrier, or cabreo. It consists of around 750 ink and watercolour drawings of all Crown property in Malta [5]. Bound in three large-format volumes with plans of government properties on the island of Malta, and a fourth with properties on the island of Gozo, these cadastral records, referred to below collectively as the cabreo, are today held in the National Archives of Malta in Rabat. The records of rural properties contained in the cabreo offer a rare and detailed glimpse into the organisation of the productive landscape in the early modern period. The detailed documentation of the productivity or land quality of different parcels of land, described in detail below, presents the researcher with a number of interesting challenges and opportunities. How did the different conditions prevailing in different parts of the landscape influence the recorded productivity of the land? And could the information on productivity recorded for government-owned land be generalised for the wider landscape? How enduring was the influence of these factors on the favourability of land for agriculture? Or in other words, could an improved understanding of the variability in land quality in the early modern landscape shed light on and pose useful questions for the study of earlier landscapes?

The work presented here attempts to provide the basis to address these questions by developing a GIS-based model that aims at understanding to what extent (if any) topographic and cultural factors could have influenced the agricultural quality recorded in the cabreo, so allowing the historic information to be generalised for the entire landscape. The process required to achieve this also entailed some innovative applications of GIS and statistical methods to accommodate historical spatially-referenced data, which are also interesting from a methodological point of view. As a matter of fact, to the best of the authors’ knowledge, the present work is the first attempt at using cabreo data for the purpose of model building in a GIS environment.

The study area

The Maltese archipelago is located in the central Mediterranean, about 90 km south of Sicily (Fig 1).

Fig 1. Maltese archipelago.

Digital Terrain Model of the Maltese Islands, with inset showing its position relative to Sicily and southern Italy. Image created by the authors in ESRI’s ArcGIS 10.1; no copyrighted material was used.

The combined surface area of the archipelago is a mere 316 square km, largely made up by the two principal islands of Malta and Gozo. The present study is focussed on Malta, the largest island in the group. In spite of the small size of the island, the landscape is highly varied and fragmented. The northwest of Malta is characterised by a series of parallel ridges, which are separated by sheltered valleys that have been prime agricultural land at least since the early modern period. The west of the island is characterised by windswept uplands, while the centre and southeast are made up of gently rolling hills that are rather more favourable for agriculture. Several of the factors that may influence land quality, discussed in detail below, are evidently enduring features of the landscape, which may have also influenced land use in much earlier periods.

Materials: Cabreo data and GIS

Cabreo maps [6,7] were made available in digital format (.tiff files) through the courtesy of the National Archives of Malta. The material dates to the mid-1800s and documents state-owned properties. The parcels of land are different in size and were consistently shaded in different colours (Fig 2).

Fig 2. Cabreo map.

Example of a 1800s cadastral map (courtesy of the National Archives of Malta) showing the extent and features of a state-owned parcel. Colours indicate different agricultural qualities. A hand-written legend is at the right-hand side of the map and provides a description of features such as water reservoirs, crop types, and presence of farmhouses and animals. See also Fig 3. Reprinted under a CC BY license, with permission from Dr Charles Farrugia (CEO and National Archivist, National Archives of Malta).

After careful examination, it has been ascertained that colours correspond to different classes of agricultural productivity quality, as devised by the agrimensori (land surveyors) who were in charge of the creation of the cadastral maps [8]. Land productivity was either assessed on the basis of written records provided by the landowners or estimated by the land surveyors themselves. Through a quite complex calculation of the ratio between the amount of crops sown and that eventually harvested [9], the land surveyors were able to classify the productivity by means of a 5-tiered scheme, which was employed in the production of the cadastral maps. Green was consistently used for parcels whose agricultural yield was described as good, while dark brown featured parcels with an exposed rocky bank and labelled as bad. The full qualitative scale used in the cabreo ranges as follows, in descending order: buona (good), mediocre (mediocre), cattiva (bad), inferiore (lower), infima (lowest). Besides the key to the colour classification, an accompanying hand-written caption at the margin of each map recorded information about the type of crops grown, the presence of farmhouses and stables, the number and types of water facilities, and the presence of animals.

As a preliminary step, the dataset has been scrutinized in order to have a general understanding of the quality of the documentation itself. The whole dataset comprised 550 parcels, including maps of (a) extremely small tenements, (b) tenements that no longer exist due to modern urbanization, and (c) isolated buildings or farmhouses. Unfortunately, some of the images (d) showed colours that varied from those in the colour code noted above for land productivity quality. After excluding the maps falling in the mentioned four mentioned categories (a-d), the total sample of maps remaining was 250. Due to time constrains, and since maps were to be georeferenced and digitized, it was decided to draw a more manageable sub-sample in which maps from each of the three principal geographical regions of Malta could have the same probability of being chosen (namely, the low-lying hills and plains of south-east to central Malta, the parallel ridges and valleys in the northwest of the island, and the uplands in the western region). Random sampling was performed, stratified across the three regions, and 20 random maps for each macro-area were thus obtained. The fraction of the sub-sample relative to the parent dataset was arbitrarily set at 25%. The total area covered by the cabreo sub-sample is 6.70 sq km, corresponding to 2.72% of the area of the island of Malta, the largest island in the Maltese archipelago (246 sq km). The percentage rises up to 3.56% if the extent of areas urbanized today (58 sq km) is subtracted from the total are of the study region. The maps have been given spatial references (using ESRI’s ArcGIS 10.1) against georeferenced 1940s survey sheets used as base map (scale: 6 inches to 1 mile), and have been digitized by using polygons (Fig 3).

Fig 3. Cabreo maps georeferencing and digitization.

Example of cadastral map georeferencing and digitization in GIS. The digitized version of the map is in the inset. Polygons represent sectors recorded as having different agricultural qualities. Quality ranges from buona (good, in Italian) to inferiore (inferior, in Italian). Image created by the authors in ESRI’s ArcGIS 10.1; no copyrighted material was used (see also Fig 2).

Different information has been stored in the attribute table of the polygon layer, the most significant for the purposes of the present study being the agricultural quality class registered for each parcel or part thereof. A total of 318 polygons were used (Fig 4).

Fig 4. Cabreo maps sample.

Sample of cabreo maps used for model building. Urban areas, in their 1895 and modern extent, are also shown. Image created by the authors in ESRI’s ArcGIS 10.1; no copyrighted material was used.


Logistic regression

Logistic regression (hereafter LR) is widely used in different research fields, spanning from social to hard sciences [1016]. It finds extensive use in GIS-based studies [1011,1719] since it allows modelling of the relation between a nominal dependent variable and independent variables (i.e., predictors) of different types (nominal and/or continuous). The reader is referred to the existing literature for an in-depth treatment of the topic [2022]. LR makes it possible to estimate the probability that a particular outcome of a dependent nominal variable (y) will occur based on information from one or more predictors (xm). The technique ultimately finds the equation that best predicts the probability p of getting a particular value of y, with p taking values from 0.0 to 1.0. If m is the number of predictors, the general form of the logistic regression model is:

Unlike the least-squares method used in linear regression, logistic regression finds the intercept (β0) and slopes (also termed logistic regression’s coefficients; β 1, β 2β m) of the best-fitting equation by means of the maximum-likelihood method, which is a computer-intensive technique that finds the values of the parameters under which you would be most likely to get the observed results [23]. The LR equation consists of values of the predictors plus weights estimated by the model to predict the outcome of the dependent variable [24]. Once logistic regression has been run, and the intercept and coefficients have been found, it is possible to calculate the probability of the outcome of y by plugging those parameters and any known value of the predictors into the logistic regression equation.

The model’s coefficients can be meaningful interpreted once they are exponentiated and thus expressed in terms of odds ratio [22]. An exponentiated coefficient of 1 leaves the odds for the positive outcome of the dependent variable unchanged, while a coefficient greater or smaller than 1 increases or decreases the odds respectively. For instance, if the presence of, say, a landslide is modelled as dependent on the terrain slope, and assuming that the latter has an estimated coefficient of 0.1871, its odds ratio is 1.206 (i.e., e0.1871). This indicates that a 1-unit increase in slope increases the odds of landslide by a factor of 1.206. In case of categorical predictors, the following interpretation holds. Let’s assume that the presence of landslides is modelled as dependent also on the type of soil, and the latter categorical predictor has three levels, soil A, B, and C. Usually, one of the levels is used as reference category and used as baseline for comparison. In other words, assuming that the reference level is soil A, an exponentiated coefficient of, say, 1.5 for soil B and of 0.50 for soil C indicate that B increases the odds for landslide by 1.5 relative to soil A, while C decreases the odds by 0.50 relative to the same reference level.

In this study, the analysis of the area under the ROC curve is used to assess the discriminatory power of the model [20,2527]. It plots the proportion of cases correctly classified as a positive outcome of the dependent variable (sensitivity) versus the proportion of cases incorrectly classified as a negative outcome (1 minus specificity) for the entire range of possible cut-off points on the model’s probability. The area under the ROC curve (AUC) provides an overall measure of the model’s ability to discriminate between the two outcomes of the dependent variable [20] in the dataset on which the model has been trained: the more the curve deviates from 45°, the higher is the model’s power. As a rule of thumb, the AUC value can be classified as follows [20]: discriminatory ability no better than chance (0.5), poor (0.5–0.7), acceptable (0.7–0.8), excellent (0.8–0.9), outstanding (0.9–1.0).

Rationale behind the choice of the model’s variables

A choice has been made as to what variables could be meaningfully used in this study. While, as touched upon earlier, the dependent variable is the quality of the land productivity as classified in the cabreo data, a critical choice needed to be made regarding the use of the whole cabreo qualitative scale. It was decided to collapse the cabreo classes into two broad ones, i.e. optimal (corresponding to the good class) vs. non-optimal quality (comprising the mediocre, bad, low, and lowest classes). This dichotomization was considered appropriate to this study’s goal of understanding which predictors are likely to have contributed to optimum agricultural yield. Of course, the model could be further refined in future stages of this research, making use of the whole cabreo classification, or collapsing categories in a different fashion.

A total of 17 predictors have been used in the model. We must acknowledge that modelling agricultural suitability is not a simple task and there is no unique set of criteria to be taken into consideration when studying agricultural potential [28]. A complex interplay between at least socio-cultural, economic, climatic, environmental, and topographic factors may indeed affect the inherent properties of the land in the context of its use as well as of its abandonment [28,29]. While they cannot be considered the only determinants of agricultural suitability, the predictors used in this study were deemed useful on the basis of the literature review and data availability (Table 1 and Fig 5).

Fig 5. Logistic regression model: Predictors.

Selection of predictors used in the cabreo model: (A) slope, (B) elevation, (C) sine of the aspect, (D) cosine of the aspect, (E) Topographic Wetness Index, (F) distance to the nearest geological fault-line, (G) distance to the coast-line, (H) distance to the nearest main, (I) secondary, (J) minor 1895 road, and (K) to the nearest footpath, (L) distance to the nearest 1895 urban area. See also Table 1. Image created by the authors in ESRI’s ArcGIS 10.1; no copyrighted material was used.

Table 1. Dependent and independent variables used for the logistic regression model.

While factors such as salinity and sodicity, soil texture, soil depth, winds, precipitation, and climate variability are likely to influence land quality in terms of suitability for agriculture [28,3033], the first eight variables listed in Table 1 have been considered potentially important for modelling agricultural suitability for their connection with soil moisture and water availability. Indeed they are widely used in the available literature [28,3436]. Moisture and availability of water are key factors for the understanding of the distribution patterns of vegetation and human activities in the Maltese archipelago [37,38]. As the table summarizes, some of the predictors affect the water flow, which influences the processes of soil erosion and deposition, which in turn affect soil depth and fertility. Terrain attributes can contribute to predict a significant portion of the spatial organization of sediments and microclimate [39], soil moisture [40], and soil properties [41]. As different authors point out [26,3133,4248], variation in slope, aspect, relative elevation, and curvature, affect the distribution of moisture near the land surface, with slope influencing infiltration, drainage, and runoff. Steeper slopes are likely to be drier than flat areas due to lower infiltration rates and higher surface runoff, and are also likely to have shallower soils. The amount of solar energy is also higher on steep slopes, since the quantity of solar radiation per unit area of the land surface decreases as the slope decreases. Steeper slopes are obviously more difficult to cultivate relative to more gentle sloping terrains [28,49]. Elevation is associated with lower temperatures and lower moisture content [28,42,44]. In fact, elevation can affect water retention since terrains at greater elevation may have more soil water draining down and are subject to receive less water from upslope [44,46]. Aspect (i.e., slope orientation) influences solar radiation, and hence evapotranspiration, soil moisture, and soil nutrients [41,42,46,50]. Slopes with different orientation are differentially subject to sunlight and prevailing winds. For these reasons, aspect is taken into consideration as an assessment criterion for the selection of the land to be used for agriculture [28,49]. Curvature, which can be further characterized as planform (i.e., perpendicular to the slope direction) and profile (i.e., in the direction of the slope) [51], influences convergence/divergence and acceleration/deceleration of rainwater flow since the latter is related to the convexity or concavity of the terrain. Where acceleration of flow occurs (convex profile curvature) erosion will be higher and soil water content lower, whereas in areas with concave profile curvature erosion is lower [52] and deceleration of flow allows water to accumulate [43]. A Topographic Wetness Index (TWI) has also been taken into account. A combination of parameters such as flow-accumulation and slope of a given cell was used to give an indication of the tendency of water to accumulate at any point of the area under study [43,51]. High values correspond to converging flat terrains, while low values are typical of steep and diverging areas [5355].

The Euclidean distance from the main geological faults has been used as a proxy for fresh water availability. Fault lines may facilitate infiltration of rainwater into the ground [56], also providing fissures and micro-fractures along which the water retained by geological clayish layers may escape [57]. The relation between springs and fault lines has been tested in a preliminary step of the present study. The distance to the nearest fault line of 49 Ghajn toponyms (place-names indicating the actual presence of a spring; located on the basis of an earlier work [58]) has been compared to the distance of 49 random points. The toponyms show a tendency to be closer to the fault lines (mean distance: 251 m) compared to random points (mean distance: 423 m). The mean difference in distance (172 m) is statistically significant (permuted p value based on 999 permutations: 0.011). As for the distance from the coastline, it has been considered as a predictor since lands close to the coast could be more subject to the negative effects of sea-spray and salt-laden air [37].

Soil type (Fig 6) has been chosen as a predictor (comprising 5 levels) since soils provide essential nutrients to plants and their different physical characteristics allow water and air to infiltrate, roots to explore, and biota to thrive [59].

Fig 6. Soil types.

Extent and distribution of soil types in Malta. Produced by georeferencing and digitizing the map published in [60]. Image created by the authors in ESRI’s ArcGIS 10.1; no copyrighted material was used.

Soils vary in their ability to drain and retain water and nutrients at a different pace, and are therefore differentially suitable for agriculture. The soil classification used for this study is that devised by Lang [60,61] in his detailed analysis of the soils of the Maltese Islands. He provided descriptions of the soils and of their distribution, mapping differences in chemistry, physics, and biology. His work was intended as an aid to agricultural planning and development in the study area. Interestingly, he also provided some remarks on the agricultural suitability of the different soil types, basing his observations on his first-hand knowledge of the Maltese man-made landscape [60,62].

In order to assess the possible influence of land accessibility on the agricultural quality, the distances to the nearest road (classified as main, secondary, or minor) and to the nearest footpath have also been taken into account as predictors (Figs 5 and 7).

Fig 7. 1895 road network in Malta.

Road network in Malta in 1895, broken down by road type. Footpaths and the 1895 urban area are also shown. Image created by the authors in ESRI’s ArcGIS 10.1; no copyrighted material was used.

Road networks may provide an important driving force in shaping the relation between human activity and spatial phenomena, such as urban growth [63], forest clearance and agricultural abandonment [12,64,65], land use changes and vegetation dynamics [66,67], and the spatial organization of land use [68]. The use of these predictors allows the importance of this cultural factor to be compared to that of environmental and topographic ones [64]. For the same reason [68], the distance to the nearest urban area (as it stood in 1895) has also been included as a predictor. Finally, the geographic coordinates have been entered as predictors (see later on).

Modelling strategy and sample size

Each independent variable was entered into ArcGIS (10.1) as a raster layer. The elevation layer is a Digital Terrain Model (1 m cell size) derived from LiDAR [69] data made available through an agreement signed between the University of Malta and the former Malta Environment and Planning Authority in 2013. Slope, aspect, and curvature layers were obtained from the DTM by using the appropriate tools in ArcGIS Spatial Analyst. Aspect was split into two components, aspect-cosine and aspect-sine [46,51,70], producing values from 1 to -1. North-facing slopes have aspect-cosine tending to 1, whereas south-facing ones tend to -1. East-facing slopes have aspect-sine tending to 1, whereas west-facing slopes will tend to -1. The TWI raster was produced in ArcGIS using the Geomorphometry and Gradient Metrics toolbox created by Jeffrey Evans and colleagues [71]. The raster layers expressing the distance from the coastline, geological faults, road types, and 1800s urban areas, were generated on the basis of vector data created by digitazing those features against different survey sheets, which have been preliminarily georeferenced. The 1993 geological map of the Maltese Islands (scale 1:25,000) [57] has been used for digitizing the fault lines, the geological formations (which have not been used as predictors, for the reasons explained later on in this paragraph), and the coast-line. For the road network and urban areas, the map of the Island of Malta (sketched and compiled by captain E. M. Woodward, Leicestershire Regiment D.A.A.G.) has been used (scale 1:21,120), dating to 1895. Finally, Lang’s map (scale 1:31,680) charting the distribution and extent of soil types in Malta has been used to feed soil information into GIS.

A set of random points (n = 16,643) with a minimum spacing of 20m was generated within the polygon layer representing the cabreo maps. Each point has been used as a sampling location [11] and has been given values of both the dependent variable and the predictors. The spacing of the points was chosen to alleviate spatial autocorrelation [72], which poses issues to traditional statistical methods in terms of inference, coefficients estimation, and assessment of the relative importance of predictors [7276]. Two additional steps were taken beside the 20m points spacing. First, a 25% random sub-sample of points (n = 3,897) was drawn from the larger universe of sampling points to further increase the between-points distance relative to the full dataset [63]. The subjective choice of the 25% fraction was made in an attempt to find a balance between: a) dealing with a more manageable sample, which could also be less prone to assure statistical significance to otherwise small and unimportant effects, just as a result of the increase in power due to the very large sample size [20,22]; b) assuring an adequate sample size to properly perform LR (as discussed later on). Secondly, the geographic coordinates of each sampling point have been entered in the model as predictors. Besides using an auto-covariate term as predictor [7274,76,77], adding geographic coordinates as extra predictors is well documented in the literature as a method to alleviate spatial autocorrelation [63,7883]. The presence of spatial autocorrelation in the model’s residuals has been nonetheless tested [8488] to formally check if the fitted model and the estimated coefficients can be considered reliable.

Predictors have been preliminarily checked for the presence of a strong correlation (i.e., collinearity) among them [89]. Pearson’s r has been calculated between pairs of predictors, and 0.70 has been considered as a critical threshold [8991]. Table 2 shows that there is no critical collinearity among predictors.

Variance Inflation Factor was also considered since no pair of predictors is critically correlated but there are several variables tied by interdependencies [90]. VIF shows how much the variance of the coefficient estimate is being inflated by multicollinearity [90]. In our case (Table 3), no predictor has a VIF larger than 10, which is the critical value suggested in the literature [89,90].

Geology has not been used as predictor since, in a preliminary step of the present study, it proved (as expected) strongly correlated to soil types (Table 4).

Table 4. Cross-tabulation of Maltese soil types (after [60]) (rows) against geological formation (columns); the association is as expected strong and significant (see text).

This was assessed using chi-square test (chi-square: 2864.67, df: 12, p: <0.0001) and Pearson’s phi coefficient (0.86) [92].

As for sample size for LR [20], we followed Peduzzi et al. [93] who provide a guideline for calculating the minimum sample size: considering at least 10 observations per predictor, divided by the proportion of negative (‘non-optimal’ quality; n = 2,965) or positive cases (‘optimal’ quality; n = 932), whichever is smaller. In our case, the minimum size is 708 observations (i.e., 10x17:0.24). The sample used in this study is more than five times larger than the minimum necessary for a model with 17 predictors. The dataset on which the model has been built is available in a tab-delimited.txt file (which can be easily imported into any statistical program) provided as Supporting Information (S1 Text. Dataset).

Predictors selection and model validation

The “best” model was selected via the backward stepwise procedure implemented by D. Rizopoulos’s boot.StepAIC R package [94]. While mixed opinions exist about stepwise procedures [20,9597], the aim of such an approach is to isolate a parsimonious model. The principle of parsimony suggests avoiding models with unnecessary complexities, and to give preference to simpler models (in comparative terms) that explain the greatest amount of data variability with the lowest level of complexity [98,99]. The package implements the model selection devised by Austin and Tu [100], widely used in literature [101104]. The assessment of how much the selected model is able to generalize outside the training data (i.e., model validation [20,27,105,106]) has been performed by means of internal validation, following the method described by Arboretti Giancristofaro-Salmaso [27], which has been implemented in R [107,108]. Further details about both procedures are provided as Supporting Information (S2 Text. Predictors selection and model validation).


Eleven predictors out of 17 candidates can be considered truly independent ones since they were selected in practically all the 1000 bootstrap resamples (Table 5).

Table 5. Percentage of times in which each candidate predictor was selected using the backward stepwise model selection in 1000 bootstrap samples, following the method devised by [100] and implemented in R by [94].

The six excluded predictors (planform curvature, TWI, profile curvature, distance to the nearest urban area, cosine of the aspect, distance to the nearest main road) were selected only in tiny fractions of the samples, and their estimated coefficients were unstable, indicating that they do not contribute to the prediction of the outcome of the dependent variable (Table 6).

Table 6. Percentage of times in which each candidate predictor resulted having a positive or negative coefficient during the backward stepwise model selection in 1000 bootstrap samples.

The model comprising the selected predictors is statistically significant: the p value of the difference between the null and the full model is well below 0.01, indicating that the predictors significantly affect the outcome of the dependent variable. The discriminatory power of the model can be considered excellent/outstanding, according to the 5-tiered scale described earlier: the AUC value is 0.92. As for model validation (Fig 8A), the fitting distribution of AUC is excellent, with a median value which is practically identical to the AUC for the original full dataset, and a minimum value of 0.91 which still points to an excellent/outstanding discriminatory power.

Fig 8. Logistic regression model diagnostic.

(A) Internal validation following the procedure devised by [27] and implemented in R [108]; ‘AUC training’ provides an estimate of the performance of the model in the population of all the theoretical training samples; ‘AUC testing’ represents an estimate of the model’s performance on new and independent data. The latter indicates that the model excellently discriminates both on the original sample and outside it. (B) Spatial correlogram, returned by the SAM program (v. 4.0), showing a negligible spatial autocorrelation among the model’s residuals.

The validation distribution of AUC is excellent, with about 50% of the values falling between 0.91 and 0.92, and nearly 25% of them being larger than 0.89. This indicates that the model excellently discriminates both on the original sample and outside it. No spatial autocorrelation proves present in the model residuals. The spatial correlogram returned by the SAM 4.0 program [109] shows that the values of Moran’s I are extremely close to 0 along all the 22 distance classes (Fig 8B). Their statistical significance is just the result of the very large sample size.

The details of the fitted model are summarized in Table 7.

The estimated coefficients and the constant were fed into ArcGIS via the Raster Calculator facility in order to produce a raster representing the fitted model. The raster has been given a colour scale reflecting the probability for ‘optimal’ land quality, ranging from the lowest (red = 0.0 probability) to the highest (green = 1.0 probability) (Fig 9).

Fig 9. Fitted logistic regression model.

Fitted cabreo model; colours represent the probability (ranging between 0.0 and 1.0) for optimal agricultural quality. Probability values have been categorized into five classes spanning from very low to very high. The location and extent of public spaces in mid-1800s Malta, and the location of farmhouses in which stables are recorded, is also shown. The latter information derives from the sample of cabreo maps described in the text. Image created by the authors in ESRI’s ArcGIS 10.1; no copyrighted material was used.

As for the predictors related to topography, elevation and slope have a negative effect on the chances for optimal agricultural quality: when they increase by 1 unit, the odds of optimal agricultural quality decrease by a factor of 0.96 and 0.97 respectively (or, put another way, the odds are 0.96 and 0.97 times as small for an additional metre of elevation and degree of slope, respectively). The sine of the aspect turns out to have a positive effect according to the model: as it increases (i.e., turning from west to east), the odds of optimal agricultural quality increase by 1.42. The distance to the coastline and to the nearest geological fault have an opposite effect on the outcome of the dependent variable. As the distance to the coastline increases by 1 unit, the odds of optimal agricultural quality increase by 2.42, whereas 1-unit increase in distance to the nearest fault line decreases the odds by 0.09.

The fitted model also allows to assess the effect of different soil types on the outcome of the dependent variable. Brown Rendzinas soils are associated with an increase in the odds of optimal agricultural quality by a factor of 2.31, and the same holds true for Carbonate Raw and Xerorendzinas soils, by a factor of 2.62 and 9.23 respectively. On the other hand, Terra Rossa soils are associated with a decrease in the odds by 0.46.

As for the cultural predictors related to land accessibility (i.e., distance to the road network), the distance to the nearest secondary road and to the nearest footpath was found to have a negative effect on the odds of optimal agricultural quality. As the model indicates, a 1-unit increase in those variables translates into a decrease of the odds by 0.19 and 0.41 respectively. On the other hand, a 1-unit increase in the distance to the nearest minor road is associated with an increase in the odds of optimal agricultural quality by a factor of 4.88.


The model made it possible to isolate a host of variables that had an influence, either positive or negative, on the suitability for agriculture at the time when the cabreo was created. The literature reviewed earlier in this work provides grounds to understand the negative impact of elevation and slope. At least three factors may account for the negative contribution of elevation. As already mentioned, relative elevation affects water retention: terrain at higher elevation drain more readily and receives less water from upslope. In addition, terrain at lower elevations is generally better sheltered against the negative effects of winds [37]. Furthermore, changes in elevation go hand-in-hand with changes in temperature. The negative impact of the slope on agricultural quality is consistent with many aspects of the reviewed literature. Steep slopes can in fact be considered a handicap [32] for agricultural areas not only because they hamper the use of mechanical devices [30], but also because slope influences solar radiation, water infiltration rates, surface runoff, moisture, erosion, and depth of soils. The practice of terracing aims at reducing some of these negative effects [110,111]. The modelled increase in the odds of optimal agricultural quality as the terrain starts to gently slope down is consistent with Rolé’s [38] remarks describing an increase in productivity as ones moves from mid-slope to the valley, which is also characterised by deeper soils [62]. This was already stressed by Lang [61], who noted that alluvial flats, flatter lands and (notably) damper valley bottoms were usually heavily cultivated in Malta. An interesting parallel is provided, for instance, by northwestern Keos (in the Aegean area) where in the early 1900s agricultural suitability was shaped by different factors, including physical ones. As Cherry et al. [68] argue, valley bottoms were preferred because they were more accessible, easier to irrigate, and characterised by deeper soils.

The influence of eastern aspects as opposed to western ones, with the former being more suitable for agriculture, turns out to be an interesting result. On the grounds of what was reviewed earlier in this study, it can be posited that east-facing slopes were more favourable for being relatively cooler than those exposed to south and west, so retaining more soil moisture. This can be considered a crucial factor for agriculture especially in a climate like that of the Maltese archipelago, where summers are hot and dry, and springs are characterised by rainfall deficit [62], with a resulting high rate of evapo-transpiration [112]. On the other hand, the decrease in the odds of optimal agricultural quality associated with west-facing slopes is consistent with the fact that slopes facing west and southwest receive a greater amount of solar radiance [43], resulting in drier conditions and in a different microclimate at ground level. Given that young plants and seedlings are killed by heat stresses [31], e.g. by temperature exceeding 38 degrees Celsius [37], it makes perfect sense that the cooler slopes have a positive effect on the chances for optimal agricultural quality. Eastern exposures also benefit from morning sun that allows plants to dry from dew or rain sooner than those on western slopes. Moreover, in the study area, east-facing slopes are more sheltered from the prevailing winds. Meteorological data gathered between 1997–2006 indicate that the most frequent direction is from northwest, followed in frequency by westerly winds [113].

The positive impact of the distance to the coastline can be explained in the light of what was touched upon previously. Being relatively distant from the coast implies being less prone to sea-spray and salt-laden air. On the other hand, the model shows that the distance to the nearest geological fault line has a negative impact on the odds of optimal agricultural quality. Since this predictor has been used as proxy for fresh water availability, the model seems to coherently indicate that being progressively far from a fault line translates into a decrease in the chances to have access to fresh water. This can be easily considered a crucial factor for agricultural development in an arid climate such as the Maltese one [62].

Another interesting achievement of the study is the possibility of assessing the contribution of different soil types to the agricultural quality as recorded in the cabreo. The model confirms many of the empirical observations made by Lang in the 1960s, and there seems to be a good correspondence between his remarks on land productivity and the agricultural suitability as predicted by the fitted model. The soils defined by Lang as giving satisfactory crops were Xerorendzinas, Brown Rendzinas, and Carbonate Raw [61]. The first two belong to those Rendzina soils that, elsewhere in Europe [114,115], are regarded as having favourable physical properties, characterised by high water infiltration rates when wet and high water-holding capacity, resulting in high biological activity and high natural fertility. According to the model, they are associated with an increase in the odds of optimal agricultural quality, which is consistent with Lang’s remarks. The latter are consistent with the model’s results also in relation to Terra Rossa soils, which were rather dry, compact, and difficult to cultivate and were usually left uncultivated, or used for bird catching or sheep/goat grazing [61]. Notably, the model proposed in this study pointed to this type of soil as a negative factor for optimal agricultural quality. It is worth stressing that the areas for which the model estimated a low probability of optimal agricultural quality can be nonetheless thought of as being potentially good for purposes other than agriculture. Indeed, historical and ethnographic sources from Malta and elsewhere reveal that thin-soiled and scrub-covered karstland was used for a variety of purposes that turned an apparently unproductive landscape into an important part of the agrarian economy. They actually provided grazing grounds for sheep and goats, quarried stone for construction, brushwood for fuel, apart from herbs, greens, wild game, and flowering plants for bee pasture [116118]. It is also worthy of note that in our cabreo sample farmhouses incorporating pens for traction animals and others (stalle, or stables) tend to occur in areas with predicted low probability for optimal agriculture, or on the fringe of good agricultural zones (see the aforementioned Fig 9). Moreover, public spaces (spazi pubblici) or wasteland occur amongst such uncultivated areas. If this form of opportunistic exploitation of a common resource and resulting economic return is often hard to assess, the investment in demarcating such apparently unproductive areas with rubble walls and ensuring access to them for humans and herds through walled paths or tracks is hard to miss. As a matter of fact, our data indicate that there is a tendency for stables to be located close to public spaces. A randomized test, performed with the aid of the PASSaGE v.2 program [119], shows that the average minimum distance is 635 m, which is significantly smaller (p: <0.01) than the randomized average minimum distance of 1141 m, calculated across 999 permutations. It would appear, therefore, that there is a ‘symbiosis’ between the location of farmhouses incorporating stables, on the one hand, and these public spaces on the other. Indeed, we are currently investigating the latter aspect in the framework of the same project to which reference has been made earlier on in this study. Employing the fitted cabreo model as constrain, we will use GIS to isolate potential foraging routes from stables toward those areas known from ethnographic accounts for being used as pastures. Among other things, the likely routes (generated via Least-Cost Path analysis) will be compared to information derived from interviews of local shepherds and to evidence regarding the spatial distribution of disappeared villages originally connected to the movement of flocks across the landscape [120]. All in all, our research into the pastoral foraging landscape will aim at exploring the ways in which zones flagged by the cabreo model as non optimal for agriculture could have been used for other aspects of the economic exploitation of the Maltese landscape.

The model’s results regarding land accessibility are also interesting. The analysis formally shows that accessibility had an effect on agricultural suitability; in particular, the more distant a plot was from secondary roads, the lesser the odds of optimal agricultural quality. This makes sense inasmuch the secondary road network can be thought of as allowing a gradual shift from urbanized areas to more peripheral zones, going deeper into the landscape relative to the main road system (which, remarkably, turned out not to have a significant contribution to the model), and providing access to the countryside. Seen from this perspective, it is not by chance if being distant from secondary roads (i.e., being in less accessible plots of land) decreases the odds of optimal agricultural quality, according to the model. While this proves an interesting new acquisition for the study area, it is something that has been stressed in ethnoarchaeological studies of rural settlements and land use elsewhere in the Mediterranean. For instance, Cherry et al. [68] have stressed that in northwestern Keos land accessibility was one of the factors influencing the decision to cultivate a particular land as of early 1900s, before the use of motorized transport became widespread. This situation now finds an interesting parallel in mid-1800s Malta.

The issue of land accessibility and its interpretation holds true for the distance to the nearest footpath, which has a similar negative effect: as seen, the increase in distance to footpaths is associated with a decrease in the odds for optimal land quality. It must be noted that the model’s results for the distance to the nearest minor road seem counterintuitive, and deserve comments. If seen from the standpoint of the secondary road network and of its ability to make the landscape accessible, one would expect minor roads to have an effect similar to that of secondary roads. Yet, unlike secondary roads, the distance to the nearest minor road is associated with an increase in odds, i.e. the larger the distance the larger the odds for optimal quality. A review of the literature shows that proximity to roads may alter the chemical composition of adjoining soils, leading to a decrease in fertility [121123]. Besides chemical factors, physical ones may also account for a decrease in agricultural suitability: S. Vella [62], for instance, argues that roads increase the quantity and velocity of runoff, enhancing soil erosion and damage to rubble walls. She goes on saying that loose materials used for surfaced tracks release gravel, which may increase the erosive ability of runoff. From this standpoint, the model result as to distance to the nearest minor road may prove less counterintuitive, and may help explaining the modelled increase in the odds of optimal agricultural quality as one moves away from minor roads. Such a scenario may explain what is happening in the immediate vicinity of a road. However, it may well be that minor roads occur in areas of garrigue and karstland which in the model correspond to an area less favourable for agriculture. Optimal land quality would be found further away from these, downslope or in valley bottoms, areas highlighted by the model for their agricultural suitability.

All in all, our findings shed considerable new light on some of the questions posed in the introduction to this paper. One of the striking characteristics displayed both by the archival evidence and by the resulting model is the wide variability in land quality that is evident even over small distances. The rugged profile of the island presents dramatically different micro-environments which typically range from exposed plateaux, to well-watered but steep clay slopes, to more sheltered valley bottoms with rich and deep alluvial and colluvial fills. Each of these micro-environments may present very different agricultural affordances. As stressed, some may be unsuitable for crop cultivation, but ideal for grazing for sheep and goat. The fragmented and variable nature of the Maltese landscape is interestingly an enduring characteristic, which would have been no less variable in more remote periods, and even during prehistory. Each of these environments may of course have undergone considerable transformations over long periods of time, resulting in very different agricultural affordances in different periods. However this does not alter the basic fact that the Maltese landscape, like many small island environments, presents a range of different agricultural opportunities in close juxtaposition. Mixed subsistence strategies, such as those combining animal grazing and food collecting on more inhospitable areas, with crop cultivation in more sheltered and favourable zones, appear to be better suited to such an environment. This characteristic should be taken into account in any research efforts to reconstruct prehistoric and ancient subsistence strategies in such landscapes.


This work has sought to use for the first time mid-1800s cabreo maps as a basis to develop a statistical model in a GIS environment to understand possible determinants of agricultural quality in Malta before heavy mechanization. Logistic regression modelling has made it possible to gauge the effect of a host of topographic and cultural factors on the agricultural quality, enabling us to build a predictive model for the entire study area. It has been therefore possible to isolate a negative contribution of some topographic factors such as elevation, slope, distance to the nearest geological fault line (used as proxy for fresh water availability). Other factors, such as distance to the coast, turned out to have a positive effect on the chances for optimal agricultural quality. The model has also indicated that different soil types had different effects on agricultural quality, with Terra Rossa proving to have a negative influence on agriculture relative to other types of soils. This proved to be consistent with observations published almost a century later, hence pointing to an interesting continuity in some constrains to optimal agriculture in the Maltese landscape. Remarkably, the analysis also showed that cultural factors, such as roads network and related mobility and landscape accessibility, had a constraining role as well, in a way consistent with findings in other Mediterranean islands. It has also been stressed that wide zones flagged as non-optimal for agriculture by the model must have been nonetheless good for a host of other activities such as grazing grounds, quarrying, fuel procurement, as both documentary evidence and the location of farmhouses incorporating stables indeed indicate. They were therefore part of a broader economic picture. All in all, our model has shown that different factors are likely to have shaped the agricultural landscape of the Maltese Islands. A host of topographic and cultural factors, the latter related to human mobility and landscape accessibility, did contribute to differential agricultural suitability. They provided the bases to the creation of that fragmented and extremely variegated agricultural landscape that is the hallmark of the Maltese Islands.

Supporting information

S2 Text. Predictors selection and model validation.



We acknowledge with thanks the ERC Advanced Grant 323727 FRAGSUS that supported this work. We wish to express our warmest gratitude to the Academic Editor, Dr Hugo Rebelo, for the competence and patience shown during the long editorial process through which this study has gone. We also thank the anonymous reviewers for their constructive and encouraging comments, which allowed us to significantly improve the quality and structure of the final version of the article. We are grateful to Dr Charles Farrugia, National Archivist (Malta), for permitting the procurement of copies of the cabreo documentation for the research presented here, and for granting (on November 23 2017) the authorization to use the cabreo map represented in Fig 2. Prof. Saviour Formosa and Dr Timmy Gambin, both of the University of Malta, facilitated the donation of LiDAR data and orthophotos, together with computer facilities, as part of the European project ERDF156 Developing National Environmental Monitoring Infrastructure and Capacity, from the former Malta Environment and Planning Authority. We would also like to acknowledge the Principal Investigator of the FRAGSUS project, Prof. Caroline Malone (Queen’s University, Belfast, UK) for her interest and support in seeing this research through to publication. The authors are solely responsible for any error or misunderstanding that may remain.


  1. 1. Fenech K. Human-induced Changes in the Environment and Landscape of the Maltese Islands from the Neolithic to the 15th Century AD as inferred from a scientific study of sediments from Marsa, Malta. Oxford: Archaeopress; 2007.
  2. 2. Mottershead D, Farres P, Pearson A. The changing Maltese soil environment: evidence from the ancient cart tracks at San Pawl Tat-Tarġa, Naxxar. Geol Soc London, Spec Publ. 2010;331: 219–229.
  3. 3. Marriner N, Gambin T, Djamali M, Morhange C, Spiteri M. Geoarchaeology of the Burmarrad ria and early Holocene human impacts in western Malta. Palaeogeogr Palaeoclimatol Palaeoecol. 2012;339–341: 52–65.
  4. 4. Bevan A, Conolly J. Mediterranean Islands, Fragile Communities and Persistent Landscapes. Cambridge: Cambridge University Press; 2013.
  5. 5. Vella NC, Spiteri M. Documentary sources for a study of the Maltese landscape. Storja.: 16–29.
  6. 6. Ginori Lisci L. Cabrei in Toscana. Raccolte di mappe prospetti e vedute sec. XVI-sec. XIX. Firenze: Cassa di Risparmio di Firenze; 1978.
  7. 7. Caucci von Saucken P, Critien JE, Dean M, Eroli G, Sottani R, Velatta A. Cabrevatio Bonorum. Priorati, Baliaggi e Commende dell’Ordine di Malta. Perugia: Benucci Editore; 1997.
  8. 8. Spiteri M, Borg D. The formation of the architect-engineer, perito and agrimensore, and their regulation by the Order of St Johnon eighteen century Malta. J Baroque Stud. 2015;1: 129–173.
  9. 9. Filonzi FA. Pratiche matematiche divise in tre trattati. Ancona: Stamepria di Michelangelo Sartori; 1775.
  10. 10. Yesilnacar E, Topal T. Landslide susceptibility mapping: A comparison of logistic regression and neural networks methods in a medium scale study, Hendek region (Turkey). Eng Geol. 2005;79: 251–266.
  11. 11. Wang L, Sawada K, Moriguchi S. Landslide Susceptibility Mapping by Using Logistic Regression Model with Neighborhood Analysis: A Case Study in Mizunami City. Int J Geomate. 2011;1: 99–104. Available:
  12. 12. Arekhi S. Modeling spatial pattern of deforestation using GIS and logistic regression: A case study of northern Ilam forests, Ilam province, Iran. African J Biotechnol. 2011;10: 16236–16249.
  13. 13. Alberti G. Modeling Group Size and Scalar Stress by Logistic Regression from an Archaeological Perspective. PLoS One. 2014;9: e91510. Available: pmid:24626241
  14. 14. Westcott KL, Brandon RJ. Practical Application of GIS for Archaeologist. A Predictive Modeling Kit. London: Taylor & Francis; 2000.
  15. 15. Porčić M. House Floor Area as a Correlate of Marital Residence Pattern: A Logistic Regression Approach. Cross-Cultural Res. 2010;44: 405–424.
  16. 16. Warren R, Asch D. A Predictive Model of Archaeological Site Location in the Eastern Prairie Peninsula. Practical Applications of Gis for Archaeologists A Predictive Modeling Kit. London: Taylor & Francis; 2000. pp. 7–36.
  17. 17. Zhang ZX, Zhang HY, Zhou DW. Using GIS spatial analysis and logistic regression to predict the probabilities of human-caused grassland fires. J Arid Environ. Elsevier Ltd; 2010;74: 386–393.
  18. 18. Ayalew L, Yamagishi H. The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan. Geomorphology. 2005;65: 15–31.
  19. 19. Costanzo D, Chacón J, Conoscenti C, Irigaray C, Rotigliano E. Forward logistic regression for earth-flow landslide susceptibility assessment in the Platani river basin (southern Sicily, Italy). Landslides. 2013;
  20. 20. Hosmer DW, Lemeshow S, Sturdivant R. Applied Logistic Regression. Third. Hoboken, NJ, USA: John Wiley & Sons, Inc.; 2000.
  21. 21. Allison PD. Logistic Regression Using the SAS System: Theory and Application. Cary: Wiley-Blackwell; 2001.
  22. 22. Pampel FC. Logistic Regression: A Primer. London: SAGE; 2000.
  23. 23. McDonald J. Handbook of Biological Statistics. Baltimore: Sparky House Publishing; 2009.
  24. 24. Cokluk O. Logistic Regression: Concept and Application. Educ Sci Theory Pract. 2010;10: 1397–1407. Available:
  25. 25. Peng CJ, Lee KL, Ingersoll GM. An Introduction to Logistic Regression Analysis and Reporting. J Educ Res. 2002;96: 3–14.
  26. 26. Park HA. An introduction to logistic regression: from basic concepts to interpretation with particular attention to nursing domain. J Korean Acad Nurs. 2013;43: 154–64. pmid:23703593
  27. 27. Arboretti Giancristofaro R, Salmaso L. Model performance analysis and model validation in logistic regression. Statistica. 2003;63: 375–396. Available:
  28. 28. Akıncı H, Özalp AY, Turgut B. Agricultural land use suitability analysis using GIS and AHP technique. Comput Electron Agric. 2013;97: 71–82.
  29. 29. Prishchepov AA, Müller D, Dubinin M, Baumann M, Radeloff VC. Determinants of agricultural land abandonment in post-Soviet European Russia. Land use policy. Elsevier Ltd; 2013;30: 873–884.
  30. 30. Van Orshoven J, Terres JM, Toth T. Updated common bio-physical criteria to define natural constraints for agriculture in Europe [Internet]. Luxembourg; 2013. Available:
  31. 31. Van Orshoven J, Terres J, Eliasson A. Common bio-physical criteria to define natural constraints for agriculture in Europe. Luxembourg; 2008.
  32. 32. Smit A, Brouwer F. Natural handicaps in Dutch agricultural areas: assessment of less favoured areas based on biophysical criteria. Alterra-report. Wageningen; 2009.
  33. 33. Clarke MJ. Akha Feasting. In: Dietler M, Hayden B, editors. Feasts Archaeological and Ethnographic Perspective on Food, Politics, and Power. Washington-London; 2001. pp. 144–167.
  34. 34. AbdelRahman MAE, Natarajan A, Hegde R. Assessment of land suitability and capability by integrating remote sensing and GIS for agriculture in Chamarajanagar district, Karnataka, India. Egypt J Remote Sens Sp Sci. 2016;19: 125–141.
  35. 35. Bandyopadhyay S, Jaiswal RK, Hegde VS, Jayaraman V. Assessment of land suitability potentials for agriculture using a remote sensing and GIS based approach. Int J Remote Sens. 2009;30: 879–895.
  36. 36. Ahmed GB, Shariff ARM, Balasundram SK, Fikri bin Abdullah A. Agriculture land suitability analysis evaluation based multi criteria and GIS approach. IOP Conf Ser Earth Environ Sci. 2016;37: 12044.
  37. 37. Schembri P. The Maltese Islands: climate, vegetation and landscape. GeoJournal. 1997;41: 115–125.
  38. 38. Rolé A. The Terraced Landscapes of the Maltese Islands Malta. In: Pedroli B, Van Doom A, De Blust G, Paracchini M, Wescher D, Bunce F, editors. Europe’s living landscape Essays on exploring our identity in the countryside. KNNV Uitgeverij; 2007. pp. 405–420.
  39. 39. Collerton H, Hock B, Hawke R, Payn T. Digital terrain modelling for site productivity assessment and stand management in plantation forestry. J For. 2008;53: 33–41.
  40. 40. Western AW, Grayson RB, Blöschl G, Willgoose GR, McMahon TA. Observed spatial organization of soil moisture and its relation to terrain indices. Water Resour Res. 1999;35: 797–810.
  41. 41. Begum F, Bajracharya RM, Sharma S, Sitaula BK. Influence of slope aspect on soil physico-chemical and biological properties in the mid hills of central Nepal. Int J Sustain Dev World Ecol. 2010;17: 438–443.
  42. 42. Famiglietti J, Rudnicki J, Rodell M. Variability in surface moisture content along a hillslope transect: Rattlesnake Hill, Texas. J Hydrol. Elsevier; 1998;210: 259–281.
  43. 43. Sulebak JR, Tallaksen LM, Erichsen B. Estimation of areal soil moisture by use of terrain data. Geogr Ann Ser A, Phys Geogr. 2000;82: 89–105. Available:
  44. 44. Taghizadeh Mehrjardi R, Akbarzadeh A, Biniyaz M, Mahmoodi S, Zare Chahouki M. Studying the effect of elevation and edaphic variables on vegetation composition in Khezrabad rangelands using principal component analysis (PCA). Analele Univ di Oradea, Fasc Biol. 2009;16: 155–161. Available: Taghizadeh.pdf
  45. 45. Tromp-van Meerveld HJ, McDonnell JJ. On the interrelations between topography, soil depth, soil moisture, transpiration rates and species distribution at the hillslope scale. Adv Water Resour. 2006;29: 293–310.
  46. 46. Qiu Y, Fu B, Wang J, Chen L. Spatial variability of soil moisture content and its relation to environmental indices in a semi-arid gully catchment of the Loess Plateau, China. J Arid Environ. 2001;49: 723–750.
  47. 47. Chang JH. Climate and Agriculture. An Ecological Survey. New Brunswick-London: Aldine Transaction; 2009.
  48. 48. Rosenberg NJ, Blad BL, Verma SB. Microclimate: The Biological Environment. New York: Wiley; 1983.
  49. 49. Stewart K. Storey’s Guide to Growing Organic Vegetables & Herbs for Market- Site & Crop Selection—Planting, Care & Harvesting—Business Basics. Storey Publishing; 2013.
  50. 50. Reid I. The influence of slope orientation upon the soil moisture regime, and its hydrogeomorphological significance. J Hydrol. 1973;19: 309–321.
  51. 51. Butler D, Malanson G, Walsh S, Fagre D. The Changing Alpine Treeline: The Example of Glacier National Park, MT, USA. First. Amsterdam: Elsevier Science; 1 edition; 2009.
  52. 52. Sensoy H, Kara O. Slope shape effect on runoff and soil erosion under natural rainfall conditions. iForest—Biogeosciences For. 2014;7: 110–114.
  53. 53. Palmisano A. Settlements Patterns and Interactions in The West Bank highlands in the Iron Age I period. University College of London. 2009.
  54. 54. Andresen J. Topographic Wetness Index and Prehistoric Land Use. In: Posluschny A, Lambers K, Herzog I, editors. Layers of Perception Proceedings of the 35th International Conference on Computer Applications and Quantitative Methods in Archaeology (CAA). Bonn: Dr. Rudolf Habelt GmbH; 2008. pp. 405–410. Available:
  55. 55. Carrer F. An ethnoarchaeological inductive model for predicting archaeological site location. A case-study of pastoral settlement patterns in the Val di Fiemme and Val di Sole (Trentino, Italian Alps). J Anthropol Archaeol. 2013;32: 54–62.
  56. 56. De Ketelaere D, Spiteri A, Vella J. The Natural Heritage of the Island of Gozo, Malta. In: Evelpidou N, de Figueiredo T, Mauro F, Tecim V, Vassilopoulos A, editors. Natural Heritage from East to West Case studie from 6 EU Countries. Berlin-Heidelberg: Springer; 2010. pp. 231–256.
  57. 57. Pedely M, Hughes Clarke M, Galea P. Limestone Isles in a Crystal Sea. The geology of the Maltese Islands. San Gwann: Publisher Enterprise Group; 2002.
  58. 58. Grima R. Monuments in search of a landscape: the landscape context of monumentality in late Neolithic Malta. University College London. 2005.
  59. 59. Schaetzl RJ, Anderson S. Soils: Genesis and Geomorphology. Cambridge: Cambridge University Press; 2005.
  60. 60. Lang DM. Soils of Malta and Gozo. London: Her Majesty’s Stationery Office; 1960.
  61. 61. Lang DM. Soils of Malta and Gozo. In: Bowen-Jones H, Dewdney J, Fisher W, editors. Malta Background for Development. Durham: Department of Geography, Durham Colleges; 1961. pp. 83–98.
  62. 62. Vella S. Soil information in the Maltese Islands. In: Zdruli P, Steduto P, Lacirignola C, Montanarella L, editors. Soil resources of Southern and Eastern Mediterranean countries. Bari: CIHEAM; 2001. pp. 171–191. Available:
  63. 63. Hu Z, Lo CP. Modeling urban growth in Atlanta using logistic regression. Comput Environ Urban Syst. 2007;31: 667–688.
  64. 64. Flinn KM, Vellend M, Marks PL. Environmental causes and consequences of forest clearance and agricultural abandonment in central New York, USA. J Biogeogr. 2005;32: 439–452.
  65. 65. Ludeke AK, Maggio RC, Reid LM. An analysis of anthropogenic deforestation using logistic regression and GIS. J Environ Manage. 1990;31: 247–259.
  66. 66. Foster R. Land-use history (1730–1990) and vegetation dynamics in central New England, USA. J Ecol. 1992;80: 753–771. Available:
  67. 67. Serneels S, Lambin EF. Proximate causes of land-use change in Narok district, Kenya: A spatial statistical model. Agric Ecosyst Environ. 2001;85: 65–81.
  68. 68. Cherry JF, Davis JL, Mantzourani E. Landscape Archaeology as Long-Term History. Nothern Keos in the Cycladic Islands. Los Angeles: Institute of Archaeology, University of California; 1991.
  69. 69. Shan J, Toth CK. Topographic Laser Ranging and Scanning. Principles and Processing. Boca Raton: CRC Press; 2008.
  70. 70. Jenness J. Some thoughts on analyzing topographic habitat characteristics. Flagstaff; 2007.
  71. 71. Evans J, Oakleaf J, Cushman S, Theoblad D. A Toolbox for Surface Gradient Modeling. Jeffrey S. Evans spatial ecology website [Internet]. 2014. Available:
  72. 72. Miller J, Franklin J, Aspinall R. Incorporating spatial dependence in predictive vegetation models. Ecol Modell. 2007;202: 225–242.
  73. 73. Dormann CF. Effects of incorporating spatial autocorrelation into the analysis of species distribution data. Glob Ecol Biogeogr. 2007;16: 129–138.
  74. 74. Bo Y-C, Song C, Wang J-F, Li X-W. Using an autologistic regression model to identify spatial risk factors and spatial risk patterns of hand, foot and mouth disease (HFMD) in Mainland China. BMC Public Health. 2014;14: 358. pmid:24731248
  75. 75. Beale CM, Lennon JJ, Yearsley JM, Brewer MJ, Elston D a. Regression analysis of spatial data. Ecol Lett. 2010;13: 246–264. pmid:20102373
  76. 76. Wang F. Quantitative Methodsand Socio-Economic Applications in GIS. Boca Raton: CRC Press; 2014.
  77. 77. Augustin NH, Mugglestone MA, Buckland ST. An autologistic model for the spatial distribution of wildlife. J Appl Ecol. 1996;33: 339–347.
  78. 78. de Frutos Á, Olea PP, Vera R. Analyzing and modelling spatial distribution of summering lesser kestrel: The role of spatial autocorrelation. Ecol Modell. 2007;200: 33–44.
  79. 79. Matsui T, Yagihashi T, Nakaya T, Tanaka N, Taoda H. Climatic controls on distribution of Fagus crenata forests in Japan. J Veg Sci. 2004;15: 57–66.
  80. 80. Preisler HK, Rappaport NG, Wood DL. Regression Methods for Spatially Correlated Data: An Example Using Beetle Attacks in a Seed Orchard. 1997;43: 71–77.
  81. 81. Franklin J. Predicting the distribution of shrub species in southern California from climate and terrain-derived variables. J Veg Sci. 1998;9: 733–748.
  82. 82. Bustamante J, Seoane J. Predicting the distribution of four species of raptors (Aves: Accipitridae) in southern Spain: Statistical models work better than existing maps. J Biogeogr. 2004;31: 295–306.
  83. 83. Yost AC, Petersen SL, Gregg M, Miller R. Predictive modeling and mapping sage grouse (Centrocercus urophasianus) nesting habitat using Maximum Entropy and a long-term dataset from Southern Oregon. Ecol Inform. 2008;3: 375–386.
  84. 84. Diniz-Filho JAF, Bini LM, Hawkins BA. Spatial autocorrelation and red herrings in geographical ecology. Glob Ecol Biogeogr. 2003;12: 53–64.
  85. 85. Warren TL, Betts MG, Diamond AW, Forbes GJ. The influence of local habitat and landscape composition on cavity-nesting birds in a forested mosaic. For Ecol Manage. 2005;214: 331–343.
  86. 86. Michel JF, Dray S, De La Rocque S, Desquesnes M, Solano P, De Wispelaere G, et al. Modelling bovine trypanosomosis spatial distribution by GIS in an agro-pastoral zone of Burkina Faso. Prev Vet Med. 2002;56: 5–18. pmid:12419596
  87. 87. Klute DS, Lovallo MJ, Tzilkowski WM. Autologistic regression Modeling of American Woodcock Habitat Use with Spatially dependent data. In: Scott JM, Heglund P, Morrison ML, editors. Predicting Species Occurrences: Issues of Accuracy and Scale. Island Press; 2002. pp. 335–343.
  88. 88. Nuñez MA, Medley KA. Pine invasions: climate predicts invasion success; something else predicts failure. Divers Distrib. 2011;17: 703–713.
  89. 89. Dormann CF, Elith J, Bacher S, Buchmann C, Carl G, Carré G, et al. Collinearity: A review of methods to deal with it and a simulation study evaluating their performance. Ecography (Cop). 2013;36: 027–046.
  90. 90. Midi H, Sarkara SK, Ranaa S. Collinearity diagnostics of binary logistic regression model. J Interdiscip Math. 2010;13: 253–267.
  91. 91. Tabachnik BG, Fidell LS. Using Multivariate Statistics. 6th ed. Upper Saddle River: Pearson; 2013.
  92. 92. Muir SW, Berg K, Chesworth B, Klar N, Speechley M. Balance impairment as a risk factor for falls in community-dwelling older adults who are high functioning: a prospective study. Phys Ther. 2010;90: 338–347. pmid:20056721
  93. 93. Peduzzi P, Concato J, Kemper E, Holford TR, Feinstein AR. A Simulation Study of the Number of Events per Variable in Logistic Regression Analysis. J Clin …. 1996;49: 1373–1379. Available:
  94. 94. Rizopoulos D. Package ‘bootStepAIC’ [Internet]. 2009. Available:
  95. 95. Hilbe JM. Logistic Regression Models. Boca Raton: CRC Press; 2009.
  96. 96. Agresti A. Categorical Data Analysis. Second. Hoboken: Wiley; 2002.
  97. 97. Menard S. Applied Logistic Regression Analysis. Second. Thousand Oaks: Sage; 2002.
  98. 98. Eve SJ, Crema ER. A house with a view? Multi-model inference, visibility fields, and point process analysis of a Bronze Age settlement on Leskernick Hill (Cornwall, UK). J Archaeol Sci. 2014;43: 267–277.
  99. 99. Beh EJ, Lombardo R. Correspondence Analysis: Theory, Practice and New Strategies [Internet]. Chichester: Wiley; 2014. Available:,+Practice+and+New+Strategies#0
  100. 100. Austin PC, Tu J V. Statistical Bootstrap Methods Practice for Developing Predictive Models. Am Stat. 2004;58: 131–137.
  101. 101. Luder MT, Pittet I, Berchtold A, Akré C, Michaud PA, Surís JC. Associations between online pornography and sexual behavior among adolescents: Myth or reality? Arch Sex Behav. 2011;40: 1027–1035. pmid:21290259
  102. 102. Pittet I, Berchtold a, Akré C, Michaud P, Surís J-C. Are adolescents with chronic conditions particularly at risk for bullying? Arch Dis Child. 2010;95: 711–716. pmid:19307194
  103. 103. Cooke CR, Shah C V, Gallop R, Bellamy S, Ancukiewicz M, Eisner MD, et al. A simple clinical predictive index for objective estimates of mortality in acute lung injury. Crit Care Med. 2009;37: 1913–1920. pmid:19384214
  104. 104. Dunn DC, Stewart K, Bjorkland RH, Haughton M, Singh-Renton S, Lewison R, et al. A regional analysis of coastal and domestic fishing effort in the wider Caribbean. Fish Res. 2010;102: 60–68.
  105. 105. Harrell FE, Lee KL, Mark DB. Tutorial in biostatistics multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996;15: 361–387. pmid:8668867
  106. 106. Steyerberg EW, Harrell FE, Borsboom GJJM, Eijkemans MJC, Vergouwe Y, Habbema JDF. Internal validation of predictive models. J Clin Epidemiol. 2001;54: 774–781. pmid:11470385
  107. 107. R Team Core. R: A language and environment for statistical computing. [Internet]. Vienna: R Foundation for Statistical Computing; 2013. Available:
  108. 108. Alberti G. ‘model.valid’: R function for cross-validated AUC. 2016.
  109. 109. Rangel TF, Diniz-Filho JAF, Bini LM. SAM: a comprehensive application for Spatial Analysis in Macroecology. Ecography (Cop). 2010;33: 46–50.
  110. 110. Dorren L, Rey F. A review of the effect of terracing on erosion. Soil Conserv Prot Eur. 2004; 97–108. Available:
  111. 111. Sandor JA, Hawley JW, Schiowitz RH, Gersper PL. Soil-geomorphic setting and change in prehistoric agricultural terraces in the Mimbres Area, New Mexico. In: Mack G, Witcher J, Lueth V, editors. Geology of the Gila Wilderness—Silver City area. Socorro: New Mexico Geological Society; 2008. pp. 167–176. Available:
  112. 112. Anderson EW. The wied: a representative Mediterranean landform. GeoJournal. 1997;41: 111–114.
  113. 113. Galdies C. The Climate of Malta: statistics, trends and analysis 1951–2010. Valletta; 2011.
  114. 114. Shishkov T, Kolev N. The Soils of Bulgaria. Dordrecht Heidelberg New York London: Springer Netherlands; 2014.
  115. 115. Bašić F. Soils of Croatia. Dordrecht Heidelberg New York London: Springer; 2013.
  116. 116. Wettinger G. Agriculture in Malta in the Late Middle Ages. In: Buhagiar M, editor. Proceedings of History Week. Malta: The Historical Society; 1981. pp. 1–48.
  117. 117. Blouet B. The Changing Landscape of Malta during the rule of the Order of St John of Jerusalem 1530–1798. University of Hull. 1963.
  118. 118. Forbes H. The uses of the uncultivated landscape in modern Greece: a pointer to the value of the wilderness in antiquity? In: Shipley G, Salmon J, editors. Human Landscapes in Classical Antiquity: Environment and Culture. London-New York: Routledge; 1996. pp. 68–97.
  119. 119. Rosenberg MS, Anderson CD. PASSaGE: Pattern Analysis, Spatial Statistics and Geographic Exegesis. Version 2. Methods Ecol Evol. 2011;2: 229–232.
  120. 120. Dalli C. From medieval dar al-Islam to contemporary Malta: rahal toponymy in a wider western Mediterranean context. Isl Stud J. 2016;11: 369–380.
  121. 121. Park SJ, Cheng Z, Yang H, Morris EE, Sutherland M, McSpadden Gardener BB, et al. Differences in soil chemical properties with distance to roads and age of development in urban areas. Urban Ecosyst. 2010;13: 483–497.
  122. 122. Akbar KF, Hale WHG, Šerá B, Ashraf I. Phytometric Assessment of Fertility of Roadside Soils and Its Relationship with Major Nutrients. 2012;21: 1141–1145.
  123. 123. Iwara AI, Gani BS, Adeyemi JA, Ewa EE. Effect of road construction on adjoining soil properties in Tinapa Resort, south-southern Nigeria. Open J Adv Eng Tech. 2013;1: 42–48.