From the Bottom Up: Assessing the Spectral Ability of Common Multispectral Sensors to Detect Surface Archaeological Deposits Using Field Spectrometry and Advanced Classifiers in the Shashi-Limpopo Confluence Area

Thabeng, Olaotse L.; Merlo, Stefania; Adam, Elhadi

doi:10.1007/s10437-020-09372-z

From the Bottom Up: Assessing the Spectral Ability of Common Multispectral Sensors to Detect Surface Archaeological Deposits Using Field Spectrometry and Advanced Classifiers in the Shashi-Limpopo Confluence Area

Original Article
Published: 11 March 2020

Volume 37, pages 25–49, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

African Archaeological Review Aims and scope Submit manuscript

From the Bottom Up: Assessing the Spectral Ability of Common Multispectral Sensors to Detect Surface Archaeological Deposits Using Field Spectrometry and Advanced Classifiers in the Shashi-Limpopo Confluence Area

Download PDF

503 Accesses
6 Citations
1 Altmetric
Explore all metrics

Abstract

This paper investigates the ability of six common multispectral sensors (GeoEye, Landsat 8 OLI, RapidEye, Sentinel-2, SPOT 5, and WorldView-2) to map archaeological sites typically inhabited by the farming communities of Southern Africa and characterized by surface features such as middens, non-vitrified dung, and vitrified dung. To achieve this, hyperspectral data collected in the field using a GER-1500 field spectroradiometer were resampled to the spectral resolutions of the selected sensors using the spectral library resampling tool in ENVI. Mean decrease in accuracy was used to assess the importance of both hyperspectral wavelengths and each band allocated to a multispectral sensor in discriminating the selected archaeological classes. Two predictive models based on the resampled hyperspectral data were developed in R using algorithms for support vector machine (SVM) and random forest (RF) classifiers. The results demonstrate that data resampled to the resolution of common multispectral sensors have the ability to predict surface archaeological features using RF and SVM classifiers. Important bands for predicting sites are mostly in the visible and shortwave infrared regions of the electromagnetic spectrum. The best performance was achieved with data resampled to the resolution of the Sentinel-2 sensor, which attained 81.90% and 92.38% accuracy in both RF and SVM classifiers respectively. The predictions indicate the relevance of field spectroscopy studies to better understand the spectral models critical for archaeological sites detection.

Résumé

Cet article étudie la capacité de six courants capteurs multispectraux (GeoEye, Landsat 8 OLI, RapidEye, Sentinel-2, SPOT 5 et WorldView-2) les plus appropriés pour la cartographie des sites archéologiques habités par les communautés agricoles de l’Afrique australe. Ces sites ont des caractéristiques de surface spécifiques, telles que des amas, de bouse non vitrifiée et de bouse vitrifiée. Pour y parvenir, des données hyperspectrales ont été recueillies sur le terrain à l’aide d’un spectroradiomètre de champ GER-1500. Les données ont ensuite été rééchantillonnées aux résolutions spectrales de les capteurs sélectionnés. Cela s’est fait à l’aide de l’outil de rééchantillonnage des bibliothèques spectrales integré dans l’ENVI. La diminution moyenne de la precision a été utilisée pour évaluer l’importance des longueurs d’onde hyperspectrales et de chaque bande attribuée à un capteur multispectral, afin de distinguer les classes archéologiques susmentionnées. Deux modèles prédictifs basés sur les données hyperspectrales rééchantillonnées ont été développés dans le R, en utilisant des algorithmes classificateurs « support vector machine » (SVM) et « random forest » (RF). Les résultats ont montré que les données rééchantillonnées à la résolution des capteurs multispectraux courants permettent de prédire les caractéristiques archéologiques de surface à l’aide de classificateurs RF et SVM. Les bandes importantes pour la prédiction des sites étaient principalement dans les régions de l’infrarouge visible et à ondes courtes du spectre électromagnétique. Les meilleures performances ont été obtenues avec des données rééchantillonnées à la résolution du capteur Sentinel-2, qui ont atteint une précision de 81,90% et 92,38% dans les classificateurs RF et SVM. Les prédictions indiquent la pertinence des études de spectroscopie de terrain pour la compréhension des modèles spectraux les plus importants pour la détection des sites archéologiques.

Using resampled nSight-2 hyperspectral data and various machine learning classifiers for discriminating wetland plant species in a Ramsar Wetland site, South Africa

Article Open access 14 March 2024

Forest mapping: a comparison between hyperspectral and multispectral images and technologies

Article 09 November 2017

Testing the efficacy of hyperspectral (AVIRIS-NG), multispectral (Sentinel-2) and radar (Sentinel-1) remote sensing images to detect native and invasive non-native trees

Article 27 April 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Africa is rich with heritage that documents human history from early primates up to recent complex societies (Connah 2004; Haaland 1995; Lange 2007; MacDonald 2013; Mattingly et al. 2007; Phillipson 2005; Shaw et al. 1993; Stahl 1994). Heritage sites on the continent are faced with dangers posed by both anthropogenic and natural threats such as mining activities, urban development, looting, flooding, erosion, and fires (Chirikure 2013; Kankpeyeng and DeCorse 2004; Khandlhela and May 2006; Lasaponara et al. 2016; Musyoki et al. 2016; Nienaber et al. 2008; Parcak 2015; Schmidt and McIntosh 1996; Smith 2012). In addition to this, heritage management institutions in Africa are facing several challenges, including lack of funds, which often lead to inadequate surveying, documentation, and monitoring of heritage sites (Chirikure 2013; Mabulla 2001; McIntosh 1993). Site surveying, documentation, and monitoring in some regions are also hampered by inaccessibility due to factors such as the presence of dangerous wild animals, conflicts, and property rights (Biagetti et al. 2017; Mabulla 2001; Thabeng et al. 2019).

The identification and documentation of archaeological features in Africa has traditionally been done through fieldwalking surveys (Fleisher and LaViolette 1999; Hitchner 1995; Huffman 2009a, 2011; McIntosh and McIntosh 1993). Fieldwalking surveys offer the surveyor an opportunity to identify, appreciate, and record finer details of different types of archaeological sites on the ground and to provide contextual records of archaeological materials (Foard 1977; Reid and Segobye 2000). Their limitation is that they are time-consuming, costly, and difficult to carry out over large areas (Banning et al. 2006; Corrie 2011; Hitchings et al. 2013). As a relatively cheap, fast, and systematic alternative, heritage managers and researchers have devised analytical techniques to predict the locations of archaeological sites over large and/or inaccessible areas within a short period of time (i.e., “predictive models”). These are based either on a sample of a region or on fundamental assumptions about human behavior (Danese et al. 2014; Keay et al. 2014; Kohler and Parker 1986; Lasaponara et al. 2014; Verhagen and Whitley 2012). Traditional models predict the location of archaeological sites based on the spatial analysis of environmental variables and/or other sites (Danese et al. 2014; Sharafi et al. 2016), while remote sensing predictive models exploit the spectral contrast between features and their surroundings (Corrie 2011). Although landscapes carry the cumulative traces of human–environment interactions, anthropogenic activities can have localized, long-lasting impacts on the soil’s physical and chemical properties, thus making certain areas distinct from their surroundings (Oonk et al. 2009; Wilson et al. 2008). For example, negative vegetation marks have been identified as indicators of the presence of subsurface archaeological features such as walls (Hejcman and Smrž 2010). This is because the presence of walls in the soil makes it more compact and less moisture-retentive therefore resulting in stunted vegetation growth (Gojda and Hejcman 2012). High moisture-retentive features such as ditches have been linked with positive vegetation marks (Featherstone et al. 1999; Reeves 1936). On the other hand, surface archaeological features can be identified based on their physical characteristics such as form (De Laet et al. 2007; Mason 1968; Sadr 2016) and ecological indicators (Denbow 1979; Reid 2016). Lastly, soil chemical and physical characteristics strongly influence the spectral behavior of soils and can be discriminated through spectral imaging (Ben-Dor 2002). As a result, several studies (Agapiou et al. 2012b, 2014a; Altaweel 2005; Beck 2007; Crawford 1923; Klehm et al. 2019; Mason 1968; Opitz and Herrmann 2018; Parcak 2007) have successfully employed remote sensing techniques to identify a number of archaeological site indicators. However, due to limited funding and incomplete site databases, research on the applicability of predictive remote sensing in an African context remains sporadic (Denbow 1979; Klehm et al. 2019; Mason 1968; Sadr and Rodier 2012).

Remote sensing data can be captured using broadband (multispectral) and narrowband (hyperspectral) sensors housed on handheld, airborne, and spaceborne platforms (Bradbury et al. 2013; Cavalli et al. 2013; Doneus et al. 2014; Mutanga et al. 2015; Schmidt and Skidmore 2003). At present, there are several multispectral satellite sensors with different spatial and spectral characteristics providing large volumes of data with great potential for the identification of archaeological sites; the challenge is to identify the suitable sensors for studying different archaeological features (Agapiou et al. 2014a; Parcak 2007). This is because, in addition to optimum environmental conditions, the ability to detect archaeological materials using remote sensing depends on the spatial and spectral resolutions of the sensor (Beck 2007). Spatial resolution represents the area on the ground that each pixel in an image covers and is a measure of the smallest object that can be resolved by the sensor (Liang et al. 2012). Higher spatial resolution means each pixel represents a smaller square of ground, with higher chances of detecting small archaeological features. In multispectral imagery (which are datasets containing more than one spectral band), spectral resolution is the width of each band (wavelength range) of the electromagnetic spectrum in the dataset, and it measures the ability of the sensor to resolve features in the electromagnetic spectrum (Lillesand et al. 2008). Since different surface materials can be distinguished by comparing their spectral responses (reflected radiation) over distinct wave ranges, the finer the bandwidth, the higher the ability of a sensor to make this distinction. Often, a trade-off between the two is needed for the identification of desired surface features. As such, a number of studies have compared the accuracies of different satellites in detecting archaeological features (Fowler 2002; Parcak 2007). This approach can be time-consuming and expensive, especially when using commercial satellite images.

Hyperspectral data offer high spectral resolution by capturing narrow bands across visible, near-infrared, and shortwave infrared portions of the electromagnetic spectrum. This high spectral resolution permits the identification of distinctive attributes of different features (Agapiou et al. 2012b; Cavalli et al. 2007; Cerra et al. 2018). As a result, many studies have used field and laboratory hyperspectral data to pilot investigations on the potential application of remote sensing principles in various fields including the analysis of soil’s physical and chemical properties (Cozzolino and Moron 2003; Nocita et al. 2014; Sørensen and Dalsgaard 2005), vegetation health (Dhau et al. 2018b; Kokaly 2001), spectral identification of different vegetation species (Adam et al. 2009; Cochrane 2000), and spectral discrimination of archaeological sites (Agapiou et al. 2010, 2012b; Melillos et al. 2018). Hyperspectral data has also been used to investigate the ability of planned multispectral satellite sensors to detect vegetation indices associated with buried archaeological features (Agapiou et al. 2014b). Currently, there are very few studies aimed at identifying spectral bands suitable for discriminating surface archaeological features in current operational multispectral sensors (Thabeng et al. 2019).

The use of hyperspectral sensors for discriminating different features has some limitations such as high computational demands and the large data redundancy due to the strong correlation between the spectral features (Burger and Gowen 2011; Doneus et al. 2014; Feng et al. 2016; Metternicht et al. 2010; Sibanda et al. 2016). Additionally, there are no operational airborne and spaceborne sensors matching the very high spectral resolution of hand-held spectrometers. As a result, numerous studies have resampled field and laboratory hyperspectral data acquired over small areas to the spectral resolutions of existing multispectral and hyperspectral sensors, in order to investigate applications for soil analysis (Nawar et al. 2014), vegetation studies (Adam et al. 2012; Mansour et al. 2012), and archaeology (Agapiou et al. 2014a). The major limitation of resampling using field and lab spectroscopy data is that these data have a high signal-to-noise ratio (SNR), which is impossible to achieve with imagery from airborne and spaceborne sensors (Mutanga et al. 2015). SNR is a measure that compares the level of a desired signal to the level of background noise and indicates, in remote sensing, how much of the recorded signal that appears as a pixel is useable information vs. unwanted distortion or noise. However, several studies (Mansour et al. 2012; Mutanga et al. 2015) have found no significant difference between the results obtained from resampling fine resolution data and those from the actual satellite image.

This study seeks to identify the most suitable multispectral sensors for mapping archaeological sites previously occupied by farming communities. This is done through resampling in situ hyperspectral data to the spectral resolutions of the most common multispectral sensors (namely GeoEye, Landsat 8 OLI, RapidEye, Sentinel-2, SPOT 5 and WorldView-2). The study was carried out in the Mapungubwe Cultural Landscape, an area of Southern Africa occupied by farming communities since the beginning of the first millennium AD (Huffman 2008; Huffman and Du Piesanie 2011). Unique surface features and a distinct settlement organization, known as the Central Cattle Pattern (CCP) (Hanisch 2002), further described below, make this landscape ideal for testing. The specific objectives of the paper are (i) to identify the optimum spectral resolution for predicting archaeological sites (middens, non-vitrified dung, and vitrified dung) using in situ hyperspectral data resampled to different remote sensing multispectral platforms; (ii) to compare the prediction accuracies of middens, non-vitrified dung, and vitrified dung achieved using resampled data, RF, and SVM classifiers; and (iii) to identify the importance of the different bands allocated in different multispectral sensors in predicting archaeological sites (middens, non-vitrified dung, and vitrified dung) using RF algorithm.

Materials and Methods

Study Area and Archaeological Context

The Mapungubwe Cultural Landscape is a UNESCO-listed heritage area situated where the Shashi and Limpopo rivers meet in the province of Limpopo, South Africa (Fig. 1). The Shashi-Limpopo Confluence Area (SLCA) forms the boundaries of three countries: Botswana to the west, South Africa to the south, and Zimbabwe to the north. Geologically, the SLCA lies within the Limpopo mobile belt, which joins the Zimbabwe and Kaapvaal cratons (Chinoda et al. 2009). This area is characterized by igneous and sedimentary rocks of the Karoo supergroup (Bordy and Catuneanu 2002). Erosion is rampant, particularly in areas closer to the river channels, thus forming sandstone ridges and outcrops, which cover most parts of the SLCA, with a sparse distribution of volcanic intrusions (Götze et al. 2008; Hanisch 1981). Generally, soils in the Limpopo mobile belt include clays and sands originating from the Karoo system.

The Mapungubwe Cultural Landscape was occupied by different farming communities, which combined cultivation with herding and the smelting and forging of iron (Mitchell 2013) in two distinctive periods. The first occupation occurred during the early centuries of the first millennium AD (Huffman 2008; Huffman and Du Piesanie 2011) and the second from AD 900 onwards (Calabrese 2000; Eloff and Meyer 1981; Huffman 2000; Vogel and Calabrese 2000). A model of settlement organization known as the Central Cattle Pattern (CCP) (Hanisch 2002; Huffman 1982, 1986) has been used to describe the structure of the villages and the worldview of their inhabitants, both reflecting the centrality of cattle in the life of these communities. The main features of the CCP are (1) a central cattle byre (also called a kraal) with elite burials and storage pits for grains; (2) an area next to the kraal where men would gather; (3) and an outer residential zone characterized by huts arranged according to seniority (Fagan 1964; Huffman 2000, 2001, 2009b). Social and political changes in the Mapungubwe Cultural Landscape took place during the early centuries of the second millennium AD (AD 1000–1300), with the development of class distinction and sacred leadership (Huffman 2000; Meyer 2000). The chief/king was physically separated from the commoners at the beginning of thirteenth century AD with the occupation of Mapungubwe Hill (Huffman 2009b). This led to changes in the organization of the main settlements whereby the traditional centrality of the cattle byre was abandoned and stonewalls were built to seclude rulers from the commoners in major settlements (Huffman 2000; Meyer 2000). However, the CCP continued in the satellite settlements occupied by commoners (Huffman 2000).

Mapungubwe societies traded with merchants along the Indian Ocean coast (Huffman 2000; Meyer 2000; Pwiti 2005). Materials such as glass beads and marine shells were exchanged for metals, salt, ivory, and animal skins from the interior polities such as Toutswe and Bosutswe (Denbow 1990; Huffman 2000; Klehm et al. 2019; Klehm 2017; Koleini et al. 2016). At the peak of its power, the leadership of Mapungubwe is believed to have dominated societies up to 200 km away (Huffman 1982). A shift of power came towards the end of the thirteenth century AD, as the Mapungubwe Kingdom collapsed and the Great Zimbabwe Kingdom became dominant in the region (Calabrese 2000; Denbow 1990; Huffman 2009a; Klehm et al. 2019). However, trade between the societies in the Mapungubwe Cultural Landscape and the east coast merchants continued into the historic period (Huffman 2012).

Archaeologically, the most distinct features that remain in the Mapungubwe Cultural Landscape are cattle byres, marked by deposits of vitrified and/or non-vitrified dung (Huffman 2009b; Meyer 2000). Non-vitrified dung deposits consist of unburned dung (Huffman et al. 2013). Vitrified dung is a glassy biomass slag with high deposits of nitrates and phosphates formed by burning thick dung deposits at very high temperatures, usually in the region of 1100 °C (Peter 2001; Thy et al. 1995). The causes of dung vitrification are debated. Thy et al. (1995) posit that, for vitrification to occur, dung may have been burned by veld fires or lightning at very high temperatures, in an environment conducive to internal combustion. Other researchers (Huffman et al. 2013; Peter 2001) argue that vitrification results from the intentional burning of byres, most likely for cleansing purposes. Generally, the sites appear as bare patches within the savanna woody vegetation, in some cases barren and grayish-white in color (in particular when the dung is vitrified) and in other cases covered by grass, predominantly Cenchrus ciliaris (Denbow 1979; Mothulatshipi 2008). The distinct spectral signature, large size, and centrality of cattle byres, already examined by remote sensing-based studies in the region (Denbow 1979), make them an ideal indicator for the prediction of a household or village, depending on the scale at which the study is carried out.

Pits, grain bins, and middens are the other major features characterizing many sites (Huffman 2007). Middens include the discarded remains of materials such as broken potsherds, animal bones, beads and other artifacts, and the ashes from fireplaces (Chirikure et al. 2014; Huffman 2012). While pits and grain bins are small features of sub-meter sizes (not easily detectable by any optical remote sensing images), middens, which can differ in size depending on the duration and density of site occupation (Eloff and Meyer 1981), are generally larger than a few meters and could easily serve as another excellent site indicator.

Given the distinct spatial and spectral characteristics of these archaeological features, their detection through the analysis of multispectral remote sensing imagery could have major implications not only for the construction of predictive models. These features are not just associated with determining site location and settlement patterns but they can also be associated with sociopolitical factors such as site hierarchy and/or use (Denbow 1986; Huffman 1986, 2000, 2001, 2009b; Manyanga 2007; Meyer 2000; Mothulatshipi 2008). Expanding the knowledge of location and size of sites over vast areas, from local scales to regional landscapes, is fundamental for gaining insight into political hierarchies of contemporaneous settlements (Huffman 1986) and diachronic population aggregation and environmental strategies. This is especially true for understanding the role of small sites in hinterland locales (Antonites and Ashley 2016; Klehm 2017; Klehm and Ernenwein 2016).

Field Data Collection

A total of 356 soil surface samples (at a depth of 0–20 cm) were collected in February 2017 and packed in zip-lock plastic bags for spectral measurements in the laboratory. This procedure followed the traditional method of acquiring reproducible, stable, and accurate spectral measurements for the analysis of soil spectral characteristics (Ben-Dor et al. 2017; Stevens et al. 2010). Between 60 and 117 samples were collected for each category: non-sites, archaeological soils characterized by middens, and vitrified and non-vitrified dung deposits. A purposive sampling method was used during the fieldwork data collection by visiting archaeological sites that were known to be characterized by dung deposits and middens (Huffman 2009a, 2011). Non-site soil samples served as a control; these were collected at some distance from the targeted archaeological features in order to avoid possible contamination that could come from wind and water erosion. Although this measure does not guarantee that the collected soils are non-archaeological, the procedure ensured that the control soils were distinct and distant from the targeted archaeological features—byres and middens.

Lab Spectral Measurements and Resampling

A portable field spectrometer (FieldSpec® 4) was used to measure the reflectance spectra of vitrified dung, non-vitrified dung, midden, and non-site soils in a controlled environment. This was done in order to minimize the atmospheric effects caused by weather conditions. The Analytical Spectral Device (ASD) captures visible-near infrared and shortwave infrared spectral data between 350 and 2500 nm, at a bandwidth of 1.4 nm in the visible-near infrared region (350–1000 nm) and 1.1 nm in shortwave infrared region (1001–2500 nm) (Analytical Spectral Devices, Inc. 2018). These very narrow spectral channels have been successfully resampled to the resolution of broadband sensors (Castaldi et al. 2016; Mutanga et al. 2015). The spectrometer was calibrated using a white spectrolon reference panel before taking measurements of a new sample and thereafter every 10–15 measurements to offset any change in atmospheric condition (Analytical Spectral Devices, Inc. 2018). Soil samples were flattened on a black plastic plate to create a smooth surface. The spectral measurements were then taken directly from the soil surface of each sample at nadir position with 10-mm field of view using Hi-Brite contact probe fitted with 100 W halogen reflector lamp (Ben-Dor et al. 2015; Ogen et al. 2017). Between 60 and 117 samples were collected from non-sites, middens, vitrified dung, and non-vitrified dung sites in the field (see Table 1 below). Three spectral measurements were taken per sample by randomly moving the probe over the soil surface, in order to obtain a representative reflectance spectrum for the sample. The spectral measurements were then averaged to represent the absolute spectral reading of the soil class of interest (Fig. 2).

Table 1 Spectral characteristics showing band description, bandwidth, band center, and spatial resolution of different multispectral sensors

Full size table

Hyperspectral data measured in the lab were then converted to an ASCII file containing 10-nm-wide band spacing using wavelengths between 350 and 2500 nm. The resultant hyperspectral data contained in the ASCII file was averaged to mimic, through resampling, the spectral resolutions of common multispectral sensors using the resampling spectral library function inherent within Environment for Visualizing Images (ENVI) software (v. 5.4). The resampling tool in ENVI employs a Gaussian model with a full width at half maximum (FWHM) equal to the specified band spacing to resample the data (Dhau et al. 2018a; Oumar and Mutanga 2010; Verrelst et al. 2013). The hyperspectral data were resampled to the spectral resolutions of a selection of popular multispectral sensors (GeoEye, Landsat 8 OLI, RapidEye, Sentinel-2, SPOT 5 and WorldView-2) using band centers in Table 1. Bands between 350 and 400 nm and 2400–2500 nm were removed from the data before resampling, as these bands are affected by noise (Castaldi et al. 2016).

The resulting resampled satellite datasets were divided into training (70%) and test (30%) datasets (Table 2). Thereafter, the datasets were used as input variables in RF and SVM classifiers to test if their spectral resolutions are suitable for predicting archaeological sites.

Table 2 Training and validation dataset for all the soil classes created by splitting the field data into 70:30

Full size table

Data Classification

Although the use of conventional parametric classifiers such as Maximum Likelihood remains the preferred method for many remote sensing applicative studies, including archaeological ones (e.g., De Laet et al. 2009), this study used Random Forest (RF) and Support Vector Machines (SVM) to classify all soil classes (sites and non-sites). Despite the advantages offered by the availability of parametric classifiers in conventional image processing software packages (Yu et al. 2014) vis-à-vis uncertainties in how to use and implement machine-learning techniques effectively (Maxwell et al. 2018), RF and SVM classification algorithms have proved to provide better classification performance (higher accuracy) than traditional, statistically based, parametric procedures (Ahmad et al. 2010; Belgiu and Drăguţ 2016; Chagas et al. 2016; Grimm et al. 2008; Maxwell et al. 2018; Mountrakis et al. 2011; Pal and Mather 2003). Moreover, the machine-learning classifiers able to model complex class signatures characterized by many predictor variables (high dimensional feature space) are non-parametric. That is, they do not make assumptions about the data distribution (Maxwell et al. 2018) and can accept limited training datasets (Rodriguez-Galiano et al. 2012; Shao and Lunetta 2012). Furthermore, RF and SVM have a high generalization capacity, which makes it possible to apply them on incomplete or noisy (error prone) databases (Rodriguez-Galiano et al. 2012; Rodriguez-Galiano and Chica-Rivas 2014; Shao and Lunetta 2012). These characteristics are advantageous for archaeological site prediction, particularly in complex archaeological landscapes that may contain a high number of land covers, with low interclass separability, and/or limited access for the collection of training data.

Random Forest

RF is a non-parametric machine learning classification algorithm developed by Breiman (2001). The algorithm uses an ensemble of classification and regression trees for prediction. The algorithm grows each tree, without trimming it until its nodes reach purity, using a random subset of predictor variables (Adam et al. 2017). Each tree from the forest then contributes a single vote for the prediction class with the majority votes deciding the class. RF needs the optimization of the number of trees (ntree) and the number of the predictive variables taken into consideration at each node (mtry) in order to improve the classification accuracy (Genuer et al. 2010; Mureriwa et al. 2016). The bootstrap sampling of variables at random carried out in building each tree was performed with replacement from the population (Breiman 1996; Rodriguez-Galiano et al. 2012). This sampling technique divides the variables into two-thirds training data and uses the remaining third to assess the importance of each variable in classification and generalization error (Belgiu and Drăguţ 2016). The testing data is defined as the Out-Of-Bag (OOB) sample.

One major advantage of RF over other machine learning algorithms, such as artificial neural networks and SVM, is its inherent ability to measure the importance of each candidate predictor in the classification process. This advantage has been demonstrated in a number of studies where RF was used for reduction of dimensionality and variable selection in various domains like bioinformatics (Díaz-Uriarte and De Andres 2006; Farhat et al. 2016; Wu et al. 2008), ecology (Brieuc et al. 2015; Wei et al. 2010), remote sensing (Mutanga et al. 2012), and medical imaging (Lebedev et al. 2014). Gini importance measures the contribution of each predictor in keeping the nodes pure in a forest. The second measure of importance, mean decrease in accuracy, is calculated using the RF internal measure of accuracy. RF assesses the importance of each variable in the final model by measuring the decrease in accuracy by means of OOB error, when its values are removed from the sample with other variables remaining constant (Breiman 2001). The error is expected to rise if the variable is important in the prediction of the forest. The importance of the predictor variable y_j can be defined as follows:

$$ \mathrm{MDA}\left(\ {y}_j\right)=\frac{1}{ntree}\sum \limits_{t=1}^{ntree}\left({ap}_{tj}-{a}_{tj}\right) $$

Whereby:

ntree is the number of trees of the RF,
ap_tb is the OOB error of tree t after randomly permuting the values of the predictor variable y_j, and
a_tb is the OOB error of tree t before randomly permuting the values of the predictor variable y_j

The end results for each predictor variable can then be used to assess its importance in relation to others in the prediction process. In this study, mean decrease accuracy was used to measure the importance of hyperspectral data and resampled satellite bands in predicting non-sites, middens, non-vitrified dung, and vitrified dung. The mtry and ntree were optimized using grid search and 10-fold cross-validation in the e1071 library of R statistical packages version 3.4.1 (Meyer et al. 2017). The resampled hyperspectral data was then classified in R using the randomForest package, which is based on the original RF algorithm developed by Breiman and Cutler (2007).

Support Vector Machines

SVM classification algorithm has previously been used to classify land cover data from satellite sensors (Adam et al. 2014; Ustuner et al. 2015). This is because of its robust generalization ability and capacity to deal with noise effects and achieve high classification accuracies (Shao and Lunetta 2012). SVM are non-parametric classifiers, therefore they do not assume normality within training statistics. In this study, SVM was used to predict the soil classes using resampled satellite bands. SVM is a kernel-based algorithm that predicts classes by finding the hyperplane that optimally separates two classes in high dimensional feature space (Chen and Lin 2006; Zhu and Blumberg 2002). The most used SVM kernels are the polynomial, sigmoid, linear, and radial basis function (RBF) (Ben-Hur and Weston 2010; Lin and Lin 2003; Pal and Mather 2005). A radial basis kernel function was used to classify the data in this study because of its ability to handle nonlinear relations between class labels and attributes (Hsu et al. 2003). The RBF defined as follows:

k(x, x¹) = exp( − γ‖x − x¹‖²)

Whereby x and x¹ represent two points from training data with default kernel function parameter (γ), which is (1/(data dimension)). RBF requires two user-defined parameters, which are the regularization parameter (C) and kernel function parameter (γ) to run the SVM model. The regularization parameter regulates the accepted level of misclassification errors by determining the margin between class boundaries (Li et al. 2015). Kernel function parameter defines the width of the Gaussian kernel. In general, these parameters have an influence on the overall classification accuracy. Hence the need to run the model on optimum parameters in order to obtain good classification accuracy (Hsu et al. 2003). In this study, pairs of C and γ parameters were optimized using a 10-fold cross-validation and grid search. This method tests various combinations of C and γ parameters and chooses the one which attained the best cross-validation accuracy. The model follows the procedure described below:

1.
Consider a grid space of (C, γ) with log₂C ∈ {− 5,− 3, . ., 13} and log₂γ ∈ {− 13,− 11, ., 3}.
2.
For each pair of C and γ parameters in the search space, carry out 10-fold cross-validation on the training set.
3.
Select a pair of C and γ, which will result in the best overall cross-validation classification rate.
4.
Train a model using the selected best combination of parameters (C, γ)

The optimization of parameters and classification of the resampled hyperspectral data were done using the e1071 library of R statistical packages version 3.4.1 (Meyer et al. 2017).

Accuracy Assessment

Classification accuracy was assessed by means of the confusion matrix, which was constructed using a holdout dataset created by randomly dividing the resampled data into 70% (training data) and 30% (test data) (see Table 1 above). The confusion matrix enables the assessment of the classification of each class by giving the user’s accuracy and the producer’s accuracy (Congalton 1991). User’s accuracy shows the proportion of predictor variables correctly predicted as they are in reality. This measure is achieved by dividing the number of correctly predicted variables by the row total. Producer’s accuracy, on the other hand, measures the proportion of predictor variables, which were correctly predicted within a class. Producer’s accuracy is attained by dividing the number of correctly predicted variables by the column total. Above all, the confusion also offers the overall accuracy, which is the percentage of correctly classified test pixels across all classes. Cohen’s kappa coefficient was used to assess the agreement between the reference data and the classifier because of its ability to compensate for chance agreement (Rosenfield and Fitzpatrick-Lins 1986). Cohen’s kappa coefficient is defined as follows:

$$ K=\frac{\Pr \left(\mathrm{o}\right)-\Pr \left(\mathrm{c}\right)}{1-\Pr \left(\mathrm{c}\right)} $$

Where Pr(o) is the observed agreement and Pr(c) is the expected agreement. A perfect agreement is achieved if the kappa value (K) is one or close to one (McHugh 2012; Rosenfield and Fitzpatrick-Lins 1986).

Results

Optimization of RF and SVM

The optimization results of RF parameters (mtry and ntree) for different sensors are shown in Fig. 3. In general, the lowest error rates achieved by the different optimum mtry and ntree combinations for spectral data resampled to resolutions of various sensors ranges between 0.120 and 0.168 (Fig. 3). The optimum mtry and ntree parameter combinations for Sentinel-2 achieved the lowest OOB error rate at the value of 0.12. The best mtry and ntree parameter combination for hyperspectral data resampled to resolution each satellite sensor was used to classify its related data in the RF algorithm.

The exponentially growing sequence of C and γ values were assessed using grid search in an attempt to select the best parameter combinations for classifying dataset resampled to the spectral resolutions of different sensors. The optimization model achieved varying optimum combinations of C and γ for classifying data resampled to resolutions of GeoEye (C = 1000 and γ = 1), Landsat 8 OLI (C = 100 and γ = 1), RapidEye (C = 100 and γ = 1), Sentinel-2 (C = 1000 and γ = 0.1), SPOT 5 (C = 1000 and γ = 1), and WorldView-2 (C = 100 and γ = 1) sensors, in SVM classifier using RBF.

Band Importance

RF algorithm was used to assess the relative importance of each resampled band in predicting the classes of midden, non-vitrified dung, non-sites, and vitrified dung. Although the most important bands for each sensor are situated in different portions of the electromagnetic spectrum in the different sensors (Fig. 4), these are generally located within the visible spectrum. The green band (545 nm) was the most important band in discriminating midden, vitrified dung, non-vitrified dung, and non-sites in the SPOT 5 satellite sensor. This band combines wavelengths in the blue with those in the green part of the electromagnetic spectrum (Fig. 4). The SWIR band in the SPOT 5 sensor was the second most important band. The blue band was the most important band for satellites that have the ability to capture data in the blue portion of the electromagnetic spectrum such as GeoEye (480 nm), RapidEye (470 nm), Sentinel-2 (490 nm), and WorldView-2 (480 nm). Landsat 8 OLI was the only satellite that, despite having a blue band, had its most important band in discriminating midden, non-vitrified dung, non-sites, and vitrified dung located in the SWIR (1610 nm). The red band was the least important band for discriminating these soil classes for satellite sensors Landsat 8 OLI, SPOT 5, and GeoEye, while the red edge bands were the least important for satellite sensors WorldView-2, Sentinel-2, and RapidEye, which capture data in the red edge wavelengths. Overall, SPOT 5 had the most important bands for discriminating midden, vitrified dung, non-vitrified dung, and non-sites. Most bands of the Sentinel-2 sensor, which captures data in most regions of the magnetic spectrum, have low values in mean decrease of accuracy as compared with other sensors (Fig. 4).

The mean decrease in accuracy in RF was used to assess the relationship between the important bands for discriminating midden, non-vitrified dung, non-sites, and vitrified dung soil classes using hyperspectral data and the location of these bands for different sensors. The important bands for classification using hyperspectral data are spread across visible, near-infrared, and shortwave infrared portions of the electromagnetic spectrum (350–2500 nm), as shown in Fig. 5. However, there are notable peaks in the visible and the shortwave infrared portions of the electromagnetic spectrum between 350–576 nm, 1292–1380 nm, 1575–1748 nm, and 1801–1808 nm. All the satellite sensors have their bands located in the different areas of the visible spectrum. New satellite sensors, with a spatial resolution of less than 5 m do not have bands covering the shortwave infrared region, which also possesses some important bands in classifying the midden, non-vitrified dung, non-sites, and vitrified dung (Fig. 5). Nevertheless, only medium resolution sensors (Landsat 8 OLI, Sentinel-2, and SPOT 5) have bands that can capture data in the SWIR of the electromagnetic spectrum. The SWIR bands from the previously mentioned sensors are located at the same position of hyperspectral bands, which are important for discriminating midden, non-vitrified dung, non-sites, and vitrified dung. This, therefore, corroborates the importance of these bands in archaeological classification as shown in Fig. 4.

Classification Accuracy

The classification of the midden, non-vitrified dung, non-sites, and vitrified dung sites was performed using RF and SVM on the hyperspectral data resampled to the spectral resolution of GeoEye, Landsat 8 OLI, RapidEye, Sentinel-2, SPOT 5, and WorldView-2 sensors, respectively. The error matrices for the output of each classifier were built using a holdout sample created by randomly dividing resampled laboratory data into 70 and 30% for training and validation, respectively. SVM achieved higher classification accuracies than RF for all datasets.

Accuracy assessment of the RF classifier, which was done using the validation data, achieved overall accuracies of 78.10, 80.00, 72.38, 81.90, 77.14, and 77.14% and Kappa coefficients of 0.7030, 0.7276, 0.6262, 0.7529, 0.6877, and 0.6905 when classifying hyperspectral data resampled to the spectral resolutions of GeoEye, Landsat 8 OLI, RapidEye, Sentinel-2, SPOT 5, and WorldView-2, respectively (Fig. 6). Generally, a lower classification accuracy of 72.38% and Kappa coefficient of 0.6262 were attained with the data resampled to the spectral resolution of RapidEye sensor, while Sentinel-2 achieved a very high classification accuracy of 81.90% and a kappa coefficient of 0.7529 (Table 3; Fig. 6). Sentinel-2 attained high producer’s and user’s accuracies of 82.86 and 78.38%, respectively, for NVD. RapidEye attained producer’s accuracy of 60.00% and user’s accuracy of 75.00% for the same class (Tables 3 and 4). However, mixed results were attained when classifying MD, with data resampled to RapidEye sensor resolution achieving high producer’s accuracy of 64.00% as compared with 60.00% for Sentinel-2 (Table 4). On the other hand, Sentinel-2 achieved higher user’s accuracy of 65.22% as compared with 50.00% (Tables 3 and 4).

Table 3 Error matrices of RF classification results for non-vitrified dung (NVD), midden (MD), non-sites (NS), and vitrified dung (VD) based on the holdout sample for hyperspectral data resampled to the spectral resolutions of RapidEye and Sentinel-2 sensors

Full size table

Table 4 RF classification accuracies of non-vitrified dung (NVD), midden (MD), non-sites (NS), and vitrified dung (VD) achieved using a holdout sample from hyperspectral data resampled to resolutions of different multispectral sensors

Full size table

Table 5 Error matrices of SVM classification results for non-vitrified dung (NVD), midden (MD), non-sites (NS), and vitrified dung (VD) based on the holdout sample for hyperspectral data resampled to the spectral resolutions of RapidEye and Sentinel-2 sensors

Full size table

SVM classifier achieved overall accuracies of 86.67, 92.38, 82.86, 92.38, 91.43, and 86.67% and kappa coefficients of 0.8188, 0.8967, 0.7663, 0.8963, 0.8836, and 0.8192 when using a holdout sample from the data resampled to spectral resolutions of GeoEye, Landsat 8 OLI, RapidEye, Sentinel-2, SPOT 5, and WorldView-2 sensors, respectively (Fig. 7; Table 6). Overall, Sentinel-2 achieved a high overall classification accuracy of 92.38% and a kappa coefficient of 0.8963, while RapidEye attained the lowest overall classification of 82.86% and a kappa coefficient of 0.7663 (Table 5). MD achieved producer’s accuracy of 68.00% and user’s accuracy of 68.00% for hyperspectral data resampled to RapidEye sensor, while user’s and producer’s accuracies of 84.00 and 87.50% were achieved for the one resampled to a spectral resolution of the Sentinel-2 sensor. Similar user’s accuracies of 100% were archived for VD from the data resampled to the spectral resolutions of RapidEye and Sentinel-2, while varying producer’s accuracies of 83.33 and 94.44% were attained for the same datasets, respectively (Tables 5 and 6). Landsat 8 OLI also achieved a very high overall classification accuracy of 92.38%, which was similar to that of Sentinel-2 when using SVM classifier (Table 6; Fig. 7). However, their producer’s and user’s accuracy for NVD and MD were different (Table 6). Further results on producer’s and user’s accuracies for SVM classifiers are provided in Table 6.

Table 6 SVM classification accuracy of non-vitrified dung (NVD), midden (MD), non-sites (NS), and vitrified dung (VD) achieved using a holdout sample from hyperspectral data resampled to resolutions of different multispectral sensors

Full size table

Discussion

Following recent improvements in both the spatial and spectral resolutions of satellite sensors, numerous studies have reported successful mapping of subsurface and surface archaeological material using various multispectral remote sensing data (Agapiou et al. 2012a; Beck et al. 2007; Lasaponara and Masini 2006; Masini and Lasaponara 2007; Melillos et al. 2018; Parcak 2007; Schuetter et al. 2013; Thabeng et al. 2019). Archaeological materials produce localized signatures that alter soil chemical and physical properties differently. On the one hand, this is the very reason why spectral variations of soils (and vegetation growing on them) can be used to discriminate archaeological features from their surroundings. On the other hand, these variations are not universal and cannot be uncritically used to predict archaeological sites in different contexts. As such, creating spectral libraries specific to local archaeological features and testing the potential of different sensors before acquiring imagery for predictive classification and further analyses of data is necessary. This is particularly important when, in the context of limited access to funding, there is the necessity of acquiring commercial imagery at a high cost. This study is an example of the methods available for identifying the best spectral bands, and thus the most suitable multispectral sensors, for detecting archaeological sites characterized by known archaeological surface features. This specific case study tested the separability of middens, vitrified dung, and non-vitrified dung, all of which characterize Southern African Iron Age sites interpreted as farming community settlements. Nevertheless, the methods presented here can be applied in other time periods and regions around the world, anywhere distinctive archaeological features expressed by soil or vegetation variations can also be measured on the ground.

The importance of each wavelength (350–2500 nm) in resampled hyperspectral data was effectively assessed using mean decrease in accuracy in RF. Results are in line with studies of the spectral characteristics of soils showing that VIS (400–700 nm), NIR (700–1100 nm), and SWIR (1100–2500 nm) spectral regions all serve as powerful tools for recognizing soils qualitatively and quantitatively (Ben-Dor et al. 2009). These wavelengths are commonly associated with soil color, which is influenced by a number of soil chromophores, parameters, or substances (chemical or physical) that significantly affect the shape and nature of a soil spectrum—for example, minerals, organic content, water, and particle size (Ben-Dor et al. 1997). This study found that wavelengths in the visible and shortwave infrared regions are the most important in discriminating natural soils and archaeological sites characterized by midden, non-vitrified dung, and vitrified dung deposits in all the resampled multispectral sensor data, followed by the IR region, while the resampled bands in the red and red edge are the least important in the classification of the aforementioned soils. Furthermore, the additional bands in the new Very High Resolution (VHR) satellites such as the yellow band in WorldView-2 are of limited importance (Fig. 4). The blue band is the most important variable for predicting the targeted archaeological classes using hyperspectral data resampled to GeoEye, RapidEye, Sentinel-2, and WorldView-2. However, the green band, which combines the wavelengths from the blue and the green sections of the electromagnetic spectrum in SPOT 5 sensor, is the most important across all bands (Fig. 5). The SWIR bands from the SPOT 5, Landsat 8 OLI, and Sentinel 2 sensors also show very high importance in the discrimination of the different deposits under study.

Despite their high spatial resolution, the results in this study show that new VHR multispectral sensors (GeoEye and WorldView-2) do not have the best bands for detecting these archaeological deposits. These sensors only capture data in the visible and near-infrared regions; they do not have bands in the SWIR that are sensitive to the important soil characteristics discussed above.

Overall, Sentinel-2, SPOT 5, and Landsat 8 OLI sensors detect the highest number of important spectral bands, making them the most suitable in discriminating midden, non-vitrified dung, and vitrified dung deposits from their surrounding natural soils. This is also supported by the high prediction accuracies they attained when using SVM (91.43–92.38%) and RF (77.14–81.90%) classifiers to predict the abovementioned features. Notably, Sentinel-2 achieved the highest classification accuracies in SVM (92.38%) and RF (81.90%) for the archaeological features discussed here because of its high spectral resolution, which captures data across wide portions of the electromagnetic spectrum. This is in line with findings by Cavalli et al. (2007) that bands in SWIR are important in detecting soil characteristics related to archaeological remains. The high classification accuracies achieved by Landsat 8 OLI, Sentinel-2, and SPOT 5 are an important development for archaeological heritage managers and researchers, particularly on the African continent where funding is often limited. In fact, imagery captured by Sentinel-2 and Landsat 8 OLI is free and readily available via World Wide Web portals. As a result, these sensors are potentially cost-effective for survey, documentation, and monitoring of archaeological sites over large areas.

Although the spectral resolutions of Landsat 8 OLI, Sentinel-2, and SPOT 5 are ideal for the detection of archaeological features in the study area, the lower spatial resolutions (2.5–15 m panchromatic; 10–60 m multispectral) might pose a challenge for scaling models up to satellite sensors. Two main issues can be distinguished: the size of single features and their proximity in space. On average, the diameter range of individual byres and middens in the study area is 3–18 m (Huffman pers. comm.). This is smaller than the minimum size of features that can be discriminated by the above satellites, which have a spatial resolution between 20 and 30 m in the SWIR region of the spectrum (Myint et al. 2011; Thabeng et al. 2019). The second issue is linked to the proximity of middens and byres (Calabrese 2000; Huffman 2009b), which have high chances of spectral confusion when captured in a single cell larger than 10 m.

As such, this study recommends two avenues for further research using actual satellite imagery. The first approach will be to compare the potential of very high spatial resolution multispectral satellites (WorldView-2 and GeoEye) and lower spatial resolution satellites (Landsat 8 OLI and Sentinel-2) in detecting archaeological sites characterized by surface features. This is because, in general, the spatial resolutions (0.46 m panchromatic; 1.84 m multispectral) of VHR satellite sensors have the potential to capture individual features with diameters as small as 4 m (Thabeng et al. 2019). On the other hand, low spatial resolution satellite sensors have the spectral ability to detect soil characteristics, which show reflectance differences within the visible and SWIR bands as discussed above. The second approach for detecting the middens and non-vitrified and vitrified dung deposits will be to assess the potential of data fusion combining the spectral abilities of low spatial resolution satellite sensors and the spatial abilities of very high spatial resolution satellite sensors.

In general, the results of this study show that RF and SVM classifiers can accurately predict midden, non-vitrified dung, and vitrified dung materials based on their spectral characteristics. These results agree with those of other research using RF and SVM together in a number of spectral mapping applications for vegetation species (Ghosh et al. 2014; Sesnie et al. 2010), vegetation health (Abdel-Rahman et al. 2014), agriculture (Duro et al. 2012), land cover (Adam et al. 2014; Noi and Kappas 2018), soil texture (Bousbih et al. 2019), and archaeology (Thabeng et al. 2019). Comparison of the accuracies achieved by the two classifiers has revealed that SVM achieved higher overall classification accuracies than RF in all datasets. For example, RF classifier achieved 78.10, 80.00, 72.38, 81.90, 77.14, and 77.14% when classifying hyperspectral data resampled to the spectral resolutions of GeoEye, Landsat 8 OLI, RapidEye, Sentinel-2, SPOT 5, and WorldView-2. Meanwhile, SVM classifier achieved overall accuracies of 86.67, 92.38, 82.86, 92.38, 91.43, and 86.67% for the same datasets. This is in line with other studies (Adam et al. 2014; Sesnie et al. 2010) that attained variable classification accuracies between the two classifiers when dealing with similar data samples. However, other researchers (Pelletier et al. 2016; Thabeng et al. 2019) found out that, even though there may be variations in their results, RF and SVM classifiers complement each other because of their different classification abilities.

RF and SVM also varied in their predictions of individual classes. In general, RF classifier picked a lot more confusion between middens and non-vitrified dung than SVM classifier (Tables 3 and 5). RF had the lowest producer’s accuracy (52.00%) and user’s accuracy (54.17%) for hyperspectral data resampled to the resolution of SPOT 5 sensor. The lowest producer’s and user’s accuracies attained by SVM classifier stood at 68.00% each and were from hyperspectral data resampled to RapidEye sensor. The confusion between MD and NVD might be a result of chemical similarities between the two of them (Thabeng et al. 2019). Another reason might be signature confusion influenced by post-depositional processes such as erosion, which is rampant in the study area, mixing the two deposits, which are located in close proximity to each other. Sesnie et al. (2010) posits that SVM is a superior method for solving complex classification problems, which would make it a better predictor of archaeological sites in the study area, particularly those characterized by the presence of MD and NVD in close proximity. Both classifiers achieved the highest user’s accuracy (100%) for VD across all the datasets.

In sum, the approach presented is a cost-effective manner of using information on the spectral characteristics of archaeological surface features (acquired through field spectroscopy in limited sample areas) to gain insight on the potential of different sensors and classifiers to predict archaeological sites. As remote sensing sensors all have different spectral and spatial resolutions, the selection of the appropriate data for archaeological applications is challenging. This assessment step helps to maximize the time and effort put into image treatment and classification. Doing so in advance of image acquisition will aid the exploration of vast and understudied areas in Africa and across the globe. This method can be applied anywhere as a cost-effective way of identifying sensors with spectral abilities to predict the archaeological material of interest. If the results are positive, the mapping of archaeological sites could be scaled up to data from operational sensors on satellite platforms. Properly identifying appropriate images and classifiers for the detection of archaeological features leads to accurate and reliable predictive maps. These maps have the potential to capture the archaeological landscapes in a synoptic manner that would be challenging (or even unachievable) by other means (Klehm et al. 2019).

Conclusions

This study investigates the possibility of discriminating archaeological features using hyperspectral data resampled to the spectral resolutions of the most widely used multispectral sensors (GeoEye, Landsat 8 OLI, RapidEye, Sentinel-2, SPOT 5, and WorldView-2). The following findings can be reported:

The bands within the visible and SWIR portions of the electromagnetic spectrum are the most important for predicting the natural soils and archaeological sites characterized by middens, vitrified dung, and non-vitrified dung—all of which are in line with the spectral resolutions of Sentinel-2, SPOT 5, and Landsat 8 OLI. This, therefore, makes them the most suitable sensors for detecting archaeological sites. Their lower spatial resolution is nevertheless a limitation and restricts the detection to larger byres and middens.
The high classification accuracies achieved in this study demonstrate that multispectral sensors have the ability to detect middens, non-vitrified dung, and vitrified dung. In general, higher classification accuracies were achieved by SVM than by RF classifier. The highest classification accuracies were achieved when classifying data resampled to the resolution of the Sentinel-2 sensor using both RF (81.90%) and SVM (92.38%). Landsat 8 OLI also achieved the highest classification accuracy similar to that of Sentinel-2 when using SVM classifier.
The green band and SWIR bands in SPOT 5 satellite sensor were the most important bands in discriminating between midden, vitrified dung, non-vitrified dung, and non-sites. The blue band is the most important discrimination band in the GeoEye, RapidEye, Sentinel-2, and WorldView-2 sensors. Other important bands included the SWIR bands in Landsat 8 OLI and Sentinel-2 and the NIR bands in sensors without SWIR bands.
VHR satellite sensors, although characterized by lower spectral resolution, still possess important bands for discriminating archaeological and non-archaeological features; they also achieved high classification accuracies in this study. These sensors offer the potential of overcoming the limitation of higher spectral resolution sensors that can only detect larger archaeological features.

The results of this study have revealed the prospects for discriminating middens, natural soils, vitrified dung, and non-vitrified dung by means of hyperspectral data resampled to the resolutions of multispectral satellite sensors. This, in turn, offers an opportunity to scale this approach up to spaceborne sensors for mapping and monitoring archaeological features. This is an important development for archaeological researchers and heritage managers because it can facilitate satellite image identification and classification for predictive modelling across vast regions. This will reduce the challenges inherent to traditional fieldwalking surveys—including high costs, restricted land access, long surveying periods, wild animals, and active war zones. This is one of the earliest studies to assess the potential of several sensors to detect surface archaeological material. Although hyperspectral data resampled to the spectral resolutions of low spatial resolution sensors (Landsat 8 OLI, Sentinel-2, and SPOT 5) achieved relatively high classification accuracies, this study recommends further research using actual images to assess the potential of very high spatial resolution satellites (GeoEye and WorldView-2) in detecting archaeological sites characterized by surface features. As with other remote sensing applications, the trade-offs among multispectral sensors, in terms of their spectral and spatial resolutions, needs careful consideration and assessment within the context of the features being studied.

References

Abdel-Rahman, E. M., Mutanga, O., Adam, E., & Ismail, R. (2014). Detecting Sirex noctilio grey-attacked and lightning-struck pine trees using airborne hyperspectral data, random forest and support vector machines classifiers. ISPRS Journal of Photogrammetry and Remote Sensing, 88, 48–59.
Article Google Scholar
Adam, E., Mutanga, O., Rugege, D., & Ismail, R. (2009). Field spectrometry of papyrus vegetation (Cyperus papyrus L.) in swamp wetlands of St Lucia, South Africa. IEEE, IV, 260–263.
Google Scholar
Adam, E., Mutanga, O., Rugege, D., & Ismail, R. (2012). Discriminating the papyrus vegetation (Cyperus papyrus L.) and its co-existent species using random forest and hyperspectral data resampled to HYMAP. International Journal of Remote Sensing, 33, 552–569.
Article Google Scholar
Adam, E., Mutanga, O., Odindi, J., & Abdel-Rahman, E. M. (2014). Land-use/cover classification in a heterogeneous coastal landscape using RapidEye imagery: Evaluating the performance of random forest and support vector machines classifiers. International Journal of Remote Sensing, 35, 3440–3458.
Article Google Scholar
Adam, E., Deng, H., Odindi, J., Abdel-Rahman, E. M., & Mutanga, O. (2017). Detecting the early stage of Phaeosphaeria leaf spot infestations in maize crop using in situ hyperspectral data and guided regularized random forest algorithm. Journal of Spectroscopy, 2017. https://doi.org/10.1155/2017/6961387.
Agapiou, A., Hadjimitsis, D., Sarris, A., Themistocleous, K., & Papadavid, G. (2010). Hyperspectral ground truth data for the detection of buried architectural remains. In M. Ioannides, D. Fellner, A. Georgopoulos, & D. G. Hadjimitsis (Eds.), Digital heritage: Lecture notes in computer science (Vol. 6436, pp. 318–331). New York: Springer.
Chapter Google Scholar
Agapiou, A., Hadjimitsis, D. G., & Alexakis, D. D. (2012a). Evaluation of broadband and narrowband vegetation indices for the identification of archaeological crop marks. Remote Sensing, 4, 3892–3919.
Article Google Scholar
Agapiou, A., Hadjimitsis, D. G., Alexakis, D., & Sarris, A. (2012b). Observatory validation of Neolithic tells (‘Magoules’) in the Thessalian plain, Central Greece, using hyperspectral spectroradiometric data. Journal of Archaeological Science, 39, 1499–1512.
Article Google Scholar
Agapiou, A., Alexakis, D. D., & Hadjimitsis, D. G. (2014a). Spectral sensitivity of ALOS, ASTER, IKONOS, LANDSAT and SPOT satellite imagery intended for the detection of archaeological crop marks. International Journal of Digital Earth, 7, 351–372.
Article Google Scholar
Agapiou, A., Alexakis, D. D., Sarris, A., & Hadjimitsis, D. G. (2014b). Evaluating the potentials of Sentinel-2 for archaeological perspective. Remote Sensing, 6, 2176–2194.
Article Google Scholar
Ahmad, S., Kalra, A., & Stephen, H. (2010). Estimating soil moisture using remote sensing data: A machine learning approach. Advances in Water Resources, 33, 69–80. https://doi.org/10.1016/j.advwatres.2009.10.008.
Article Google Scholar
Altaweel, M. (2005). The use of ASTER satellite imagery in archaeological contexts. Archaeological Prospection, 12, 151–166.
Article Google Scholar
Analytical Spectral Devices, Inc. (2018). FieldSpec 4 Standard-Res Spectroradiometer. Available at www.asdi.com/products-and-services/fieldspec-spectroradiometers/fieldspec-4-standard-res. Accessed 20 Nov 2018.
Antonites, A., & Ashley, C. Z. (2016). The mobilities turn and archaeology: New perspectives on socio-political complexity in thirteenth-century northern South Africa. Azania: Archaeological Research in Africa, 51, 469–488. https://doi.org/10.1080/0067270X.2016.1249586.
Article Google Scholar
Banning, E. B., Hawkins, A. L., & Stewart, S. T. (2006). Detection functions for archaeological survey. American Antiquity, 71, 723–742.
Article Google Scholar
Beck, A. R. (2007). Archaeological site detection: The importance of contrast. In Proceedings of the 2007 Annual Conference of the Remote Sensing and Photogrammetry Society, Newcastle University, Sept. 11–14, 2007 (pp. 307–312). Newcastle: The Remote Sensing and Photogrammetry Society.
Beck, A., Philip, G., Abdulkarim, M., & Donoghue, D. (2007). Evaluation of Corona and Ikonos high resolution satellite imagery for archaeological prospection in western Syria. Antiquity, 81, 161–175.
Article Google Scholar
Belgiu, M., & Drăguţ, L. (2016). Random forest in remote sensing: A review of applications and future directions. ISPRS Journal of Photogrammetry and Remote Sensing, 114, 24–31.
Article Google Scholar
Ben-Dor, E. (2002). Quantitative remote sensing of soil properties. Advances in Agronomy, 75, 173–243. https://doi.org/10.1016/S0065-2113(02)75005-0.
Article Google Scholar
Ben-Dor, E., Inbar, Y., & Chen, Y. (1997). The reflectance spectra of organic matter in the visible near-infrared and short wave infrared region (400–2500 nm) during a controlled decomposition process. Remote Sensing of Environment, 61, 1–15.
Article Google Scholar
Ben-Dor, E., Chabrillat, S., Demattê, J. A. M., Taylor, G. R., Hill, J., Whiting, M. L., & Sommer, S. (2009). Using imaging spectroscopy to study soil properties. Remote Sensing of Environment, 113, (Suppl. 1), S38–S55. https://doi.org/10.1016/j.rse.2008.09.019.
Ben-Dor, E., Ong, C., & Lau, I. C. (2015). Reflectance measurements of soils in the laboratory: Standards and protocols. Geoderma, 245, 112–124.
Article Google Scholar
Ben-Dor, E., Granot, A., & Notesco, G. (2017). A simple apparatus to measure soil spectral information in the field under stable conditions. Geoderma, 306, 73–80. https://doi.org/10.1016/j.geoderma.2017.06.025.
Article Google Scholar
Ben-Hur, A., & Weston, J. (2010). A user’s guide to support vector machines. In O. Carugo & F. Eisenhaber (Eds.), Data mining techniques for the life sciences (pp. 223–239). New York: Humana Press.
Chapter Google Scholar
Biagetti, S., Merlo, S., Adam, E., Lobo, A., Conesa, F. C., Knight, J., Bekrani, H., Crema, E. R., Alcaina-Mateos, J., & Madella, M. (2017). High and medium resolution satellite imagery to evaluate late Holocene human–environment interactions in arid lands: A case study from the Central Sahara. Remote Sensing, 9, 351.
Article Google Scholar
Bordy, E. M., & Catuneanu, O. (2002). Sedimentology of the Beaufort-Molteno Karoo fluvial strata in the Tuli Basin, South Africa. South African Journal of Geology, 105, 51–66.
Article Google Scholar
Bousbih, S., Zribi, M., Pelletier, C., Gorrab, A., Lili-Chabaane, Z., Baghdadi, N., Ben Aissa, N., & Mougenot, B. (2019). Soil texture estimation using radar and optical data from Sentinel-1 and Sentinel-2. Remote Sensing, 11, 1520. https://doi.org/10.3390/rs11131520.
Article Google Scholar
Bradbury, G., Mitchell, K., & Weyrich, T. (2013). Multi-spectral material classification in landscape scenes using commodity hardware. In R. Wilson, E. Hancock, A. Bors, & W. Smith (Eds.), Computer analysis of images and patterns (pp. 209–216). Berlin: Springer.
Chapter Google Scholar
Breiman, L. (1996). Bagging predictors. Machine Learning, 24, 123–140.
Google Scholar
Breiman, L. (2001). Random forests. Machine Learning, 45, 5–32.
Article Google Scholar
Breiman, L., & Cutler, A. (2007). Random forests-classification description. Berkeley: Department of Statistics. Available at https://www.stat.berkeley.edu/~breiman/RandomForests/cc_home.htm. Accessed 24 Nov 2018.
Brieuc, M. S. O., Ono, K., Drinan, D. P., & Naish, K. A. (2015). Integration of Random Forest with population-based outlier analyses provides insight on the genomic basis and evolution of run timing in Chinook salmon (Oncorhynchus tshawytscha). Molecular Ecology, 24, 2729–2746.
Article Google Scholar
Burger, J., & Gowen, A. (2011). Data handling in hyperspectral image analysis. Chemometrics and Intelligent Laboratory Systems, 108, 13–22.
Article Google Scholar
Calabrese, J. A. (2000). Interregional interaction in southern Africa: Zhizo and Leopard’s Kopje relations in northern South Africa, southwestern Zimbabwe, and eastern Botswana, AD 1000 to 1200. African Archaeological Review, 17, 183–210.
Article Google Scholar
Castaldi, F., Palombo, A., Santini, F., Pascucci, S., Pignatti, S., & Casa, R. (2016). Evaluation of the potential of the current and forthcoming multispectral and hyperspectral imagers to estimate soil texture and organic carbon. Remote Sensing of Environment, 179, 54–65.
Article Google Scholar
Cavalli, R. M., Colosi, F., Palombo, A., Pignatti, S., & Poscolieri, M. (2007). Remote hyperspectral imagery as a support to archaeological prospection. Journal of Cultural Heritage, 8, 272–283.
Article Google Scholar
Cavalli, R. M., Licciardi, G. A., & Chanussot, J. (2013). Detection of anomalies produced by buried archaeological structures using nonlinear principal component analysis applied to airborne hyperspectral image. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 6, 659–669. https://doi.org/10.1109/JSTARS.2012.2227301.
Article Google Scholar
Cerra, D., Agapiou, A., Cavalli, R. M., & Sarris, A. (2018). An objective assessment of hyperspectral indicators for the detection of buried archaeological relics. Remote Sensing, 10, 500.
Article Google Scholar
Chagas, C. d. S., de Carvalho Jr., W., Bhering, S. B., & Calderano Filho, B. (2016). Spatial prediction of soil surface texture in a semiarid region using random forest and multiple linear regressions. CATENA, 139, 232–240. https://doi.org/10.1016/j.catena.2016.01.001.
Article Google Scholar
Chen, Y.-W., & Lin, C.-J. (2006). Combining SVMs with various feature selection strategies. In I. Guyon, M. Nikravesh, S. Gunn, & L. A. Zadeh (Eds.), Feature extraction: Studies in fuzziness and soft computing (Vol. 207, pp. 315–324). Berlin: Springer.
Chapter Google Scholar
Chinoda, G., Moyce, W., Matura, N., & Owen, R. (2009). Baseline report on the geology of the Limpopo Basin Area. WaterNet Working Paper 7.
Chirikure, S. (2013). Heritage conservation in Africa: The good, the bad, and the challenges. South African Journal of Science, 109, 1–3. https://doi.org/10.1590/sajs.2013/a003.
Article Google Scholar
Chirikure, S., Manyanga, M., Pollard, A. M., Bandama, F., Mahachi, G., & Pikirayi, I. (2014). Zimbabwe culture before Mapungubwe: New evidence from Mapela Hill, South-Western Zimbabwe. PLoS One, 9, e111224.
Article Google Scholar
Cochrane, M. A. (2000). Using vegetation reflectance variability for species level classification of hyperspectral data. International Journal of Remote Sensing, 21, 2075–2087. https://doi.org/10.1080/01431160050021303.
Article Google Scholar
Congalton, R. G. (1991). A review of assessing the accuracy of classifications of remotely sensed data. Remote Sensing of Environment, 37, 35–46.
Article Google Scholar
Connah, G. (2004). Forgotten Africa: An introduction to its archaeology. Abingdon: Routledge.
Book Google Scholar
Corrie, R. K. (2011). Detection of ancient Egyptian archaeological sites using satellite remote sensing and digital image processing. Proc. SPIE 8181, Earth Resources and Environmental Remote Sensing/GIS Applications II, 81811B (26 October 2011). International Society for Optics and Photonics. https://doi.org/10.1117/12.898230.
Cozzolino, D., & Moron, A. (2003). The potential of near-infrared reflectance spectroscopy to analyse soil chemical and physical characteristics. The Journal of Agricultural Science, 140, 65–71.
Article Google Scholar
Crawford, O. G. S. (1923). Air survey and archaeology. Geographical Journal, 342–360.
Danese, M., Masini, N., Biscione, M., & Lasaponara, R. (2014). Predictive modeling for preventive archaeology: Overview and case study. Open Geosciences, 6. https://doi.org/10.2478/s13533-012-0160-5.
De Laet, V., Paulissen, E., & Waelkens, M. (2007). Methods for the extraction of archaeological features from very high-resolution Ikonos-2 remote sensing imagery, Hisar (Southwest Turkey). Journal of Archaeological Science, 34, 830–841.
Article Google Scholar
De Laet, V., Paulissen, E., Meuleman, K., & Waelkens, M. (2009). Effects of image characteristics on the identification and extraction of archaeological features from Ikonos-2 and Quickbird-2 imagery: Case study Sagalassos (Southwest Turkey). International Journal of Remote Sensing, 30, 5655–5668. https://doi.org/10.1080/01431160802705821.
Article Google Scholar
Denbow, J. R. (1979). Cenchrus ciliaris: An ecological indicator of Iron age middens using aerial photography in eastern Botswana. South African Journal of Science, 75, 405–408.
Google Scholar
Denbow, J. R. (1986). A new look at the later prehistory of the Kalahari. The Journal of African History, 27, 3–28.
Denbow, J. R. (1990). Congo to Kalahari: Data and hypotheses about the political economy of the western stream of the Early Iron Age. African Archaeological Review, 8, 139–175.
Article Google Scholar
Dhau, I., Adam, E., Mutanga, O., Ayisi, K., Abdel-Rahman, E. M., Odindi, J., & Masocha, M. (2018a). Testing the capability of spectral resolution of the new multispectral sensors on detecting the severity of grey leaf spot disease in maize crop. Geocarto International, 33, 1223–1236. https://doi.org/10.1080/10106049.2017.1343391.
Article Google Scholar
Dhau, I., Adam, E., Mutanga, O., & Ayisi, K. K. (2018b). Detecting the severity of maize streak virus infestations in maize crop using in situ hyperspectral data. Transactions of the Royal Society of South Africa, 73, 8–15. https://doi.org/10.1080/0035919X.2017.1370034.
Article Google Scholar
Díaz-Uriarte, R., & De Andres, S. A. (2006). Gene selection and classification of microarray data using random forest. BMC Bioinformatics, 7, 3.
Article Google Scholar
Doneus, M., Verhoeven, G., Atzberger, C., Wess, M., & Ruš, M. (2014). New ways to extract archaeological information from hyperspectral pixels. Journal of Archaeological Science, 52, 84–96.
Article Google Scholar
Duro, D. C., Franklin, S. E., & Dubé, M. G. (2012). A comparison of pixel-based and object-based image analysis with selected machine learning algorithms for the classification of agricultural landscapes using SPOT-5 HRG imagery. Remote Sensing of Environment, 118, 259–272.
Article Google Scholar
Eloff, J. F., & Meyer, A. (1981). The Greefswald sites. In E. A. Voigt (Ed.), Guide to archaeological sites in the northern and eastern Transvaal (pp. 7–22). Pretoria: Transvaal Museum.
Google Scholar
Fagan, B. (1964). The Greefswald sequence: Bambandyanalo and Mapungubwe. The Journal of African History, 5, 337–361. https://doi.org/10.1017/S0021853700005053.
Article Google Scholar
Farhat, M. R., Sultana, R., Iartchouk, O., Bozeman, S., Galagan, J., Sisk, P., Stolte, C., Nebenzahl-Guimaraes, H., Jacobson, K., & Sloutsky, A. (2016). Genetic determinants of drug resistance in Mycobacterium tuberculosis and their diagnostic value. American Journal of Respiratory and Critical Care Medicine, 194, 621–630.
Article Google Scholar
Featherstone, R., Horne, P., Macleod, D., & Bewley, R. (1999). Aerial reconnaissance over England in summer 1996. Archaeological Prospection, 6, 47–62. https://doi.org/10.1002/(SICI)1099-0763(199906)6:2<47::AID-ARP113>3.0.CO;2-Y.
Article Google Scholar
Feng, J., Jiao, L., Liu, F., Sun, T., & Zhang, X. (2016). Unsupervised feature selection based on maximum information and minimum redundancy for hyperspectral images. Pattern Recognition, 51, 295–309.
Article Google Scholar
Fleisher, J., & LaViolette, A. (1999). Elusive wattle-and-daub: Finding the hidden majority in the archaeology of the Swahili. Azania: Archaeological Research in Africa, 34, 87–108. https://doi.org/10.1080/00672709909511473.
Article Google Scholar
Foard, G. (1977). Systematic fieldwalking and the investigation of Saxon settlement in Northamptonshire. World Archaeology, 9, 357–374.
Article Google Scholar
Fowler, M. J. F. (2002). Satellite remote sensing and archaeology: A comparative study of satellite imagery of the environs of Figsbury Ring, Wiltshire. Archaeological Prospection, 9, 55–69.
Article Google Scholar
Genuer, R., Poggi, J.-M., & Tuleau-Malot, C. (2010). Variable selection using random forests. Pattern Recognition Letters, 31, 2225–2236.
Article Google Scholar
Ghosh, A., Fassnacht, F. E., Joshi, P. K., & Koch, B. (2014). A framework for mapping tree species combining hyperspectral and LiDAR data: Role of selected classifiers and sensor across three spatial scales. International Journal of Applied Earth Observation and Geoinformation, 26, 49–63.
Article Google Scholar
Gojda, M., & Hejcman, M. (2012). Cropmarks in main field crops enable the identification of a wide spectrum of buried features on archaeological sites in Central Europe. Journal of Archaeological Science, 39, 1655–1664.
Article Google Scholar
Götze, A. R., Cilliers, S. S., & Bezuidenhout, H. (2008). Analysis of the vegetation of the sandstone ridges (Ib land type) of the north-eastern parts of the Mapungubwe National Park, Limpopo Province, South Africa. Koedoe: African Protected Area Conservation and Science, 50, 72–81.
Article Google Scholar
Grimm, R., Behrens, T., Märker, M., & Elsenbeer, H. (2008). Soil organic carbon concentrations and stocks on Barro Colorado Island — Digital soil mapping using random forests analysis. Geoderma, 146, 102–113. https://doi.org/10.1016/j.geoderma.2008.05.008.
Article Google Scholar
Haaland, R. (1995). Sedentism, cultivation, and plant domestication in the Holocene middle Nile region. Journal of Field Archaeology, 22, 157–174.
Google Scholar
Hanisch, E. O. M. (1981). Schroda: A Zhizo site in the northern Transvaal. In E. A. Voigt (Ed.), Guide to archaeological sites in the northern and eastern Transvaal (pp. 37–53). Pretoria: Transvaal Museum.
Google Scholar
Hanisch, E. O. M. (2002). Schroda: The archaeological evidence. In J. A. Van Schalkwyk (Ed.), Sculptured in clay: Iron Age figurines from Schroda, Limpopo Province, South Africa (pp. 20–39). Pretoria: National Cultural History Museum.
Google Scholar
Hejcman, M., & Smrž, Z. (2010). Cropmarks in stands of cereals, legumes and winter grape indicate sub-soil archaeological features in the agricultural landscape of Central Europe. Agriculture, Ecosystems & Environment, 138, 348–354. https://doi.org/10.1016/j.agee.2010.06.004.
Article Google Scholar
Hitchings, P. M. N., Abu Jayyab, K., Bikoulis, P., & Banning, E. B. (2013). A Bayesian approach to archaeological survey in North-West Jordan. Antiquity, 87, 336.
Google Scholar
Hitchner, R. B. (1995). Historical text and archaeological context in Roman North Africa: The Albertini tablets and the Kasserine survey. In D. B. Small (Ed.), Methods in the Mediterranean: Historical and archaeological views on texts and archaeology (pp. 124–142). Leiden: E. J. Brill.
Google Scholar
Hsu, C.-W., Chang, C.-C., & Lin, C.-J. (2003). A practical guide to support vector classification. Taiwan: Department of Computer Science andInformation Engineering. Available at https://www.csie.ntu.edu.tw/~cjlin/papers/guide/guide.pdf. Accessed 24 Nov 2018.
Huffman, T. N. (1982). Archaeology and ethnohistory of the African Iron Age. Annual Review of Anthropology, 11, 133–150.
Article Google Scholar
Huffman, T. N. (1986). Iron Age settlement patterns and the origins of class distinctions in southern Africa. Advances in World Archaeology, 5, 291–338.
Google Scholar
Huffman, T. N. (2000). Mapungubwe and the origins of the Zimbabwe culture. Goodwin Series, 8, 14–29.
Article Google Scholar
Huffman, T. N. (2001). The Central Cattle Pattern and interpreting the past. Southern African Humanities, 13, 19–35.
Google Scholar
Huffman, T. N. (2007). Handbook to the iron age: The archaeology of pre-colonial farming societies in southern Africa. Pietermaritzburg: University of KwaZulu-Natal Press.
Huffman, T. N. (2008). Climate change during the iron age in the Shashe-Limpopo Basin, southern Africa. Journal of Archaeological Science, 35, 2032–2047.
Article Google Scholar
Huffman, T. N. (2009a). Origins of Mapungubwe project, progress report 2008 prepared for De beers, the NRF, SAHRA and SANParks, Unpublished, pp. 1–65.
Huffman, T. N. (2009b). Mapungubwe and Great Zimbabwe: The origin and spread of social complexity in southern Africa. Journal of Anthropological Archaeology, 28, 37–54.
Article Google Scholar
Huffman, T. N. (2011). Origins of Mapungubwe project: Progress report, 2009–2010. Johannesburg: Archaeological Resources Management, School of Geography, Archaeology & Environmental Studies, University of the Witwatersrand.
Google Scholar
Huffman, T. N. (2012). Historical archaeology of the Mapungubwe area: Boer, Birwa, Sotho-Tswana and Machete. Southern African Humanities, 24, 33–59.
Google Scholar
Huffman, T. N., & Du Piesanie, J. (2011). Khami and the Venda in the Mapungubwe landscape. Journal of African Archaeology, 9, 189–206.
Article Google Scholar
Huffman, T. N., Elburg, M., & Watkeys, M. (2013). Vitrified cattle dung in the Iron Age of southern Africa. Journal of Archaeological Science, 40, 3553–3560.
Article Google Scholar
Kankpeyeng, B. W., & DeCorse, C. R. (2004). Ghana’s vanishing past: Development, antiquities, and the destruction of the archaeological record. African Archaeological Review, 21, 89–128. https://doi.org/10.1023/B:AARR.0000030786.24067.19.
Article Google Scholar
Keay, S. J., Parcak, S. H., & Strutt, K. D. (2014). High resolution space and ground-based remote sensing and implications for landscape archaeology: The case from Portus, Italy. Journal of Archaeological Science, 52, 277–292.
Article Google Scholar
Khandlhela, M., & May, J. (2006). Poverty, vulnerability and the impact of flooding in the Limpopo Province, South Africa. Natural Hazards, 39, 275–287.
Article Google Scholar
Klehm, C. E. (2017). Local dynamics and the emergence of social inequality in Iron Age Botswana. Current Anthropology, 58, 604–633. https://doi.org/10.1086/693960.
Article Google Scholar
Klehm, C. E., & Ernenwein, E. G. (2016). Iron Age transformations at Mmadipudi Hill, Botswana: Identifying spatial organization through electromagnetic induction survey. African Archaeological Review, 33, 45–59.
Article Google Scholar
Klehm, C., Barnes, A., Follett, F., Simon, K., Kiahtipes, C., & Mothulatshipi, S. (2019). Toward archaeological predictive modeling in the Bosutswe region of Botswana: Utilizing multispectral satellite imagery to conceptualize ancient landscapes. Journal of Anthropological Archaeology, 54, 68–83. https://doi.org/10.1016/j.jaa.2019.02.002.
Article Google Scholar
Kohler, T. A., & Parker, S. C. (1986). Predictive models for archaeological resource location. Advances in Archaeological Method and Theory, 9, 397–452.
Article Google Scholar
Kokaly, R. F. (2001). Investigating a physical basis for spectroscopic estimates of leaf nitrogen concentration. Remote Sensing of Environment, 75, 153–161.
Article Google Scholar
Koleini, F., Prinsloo, L. C., Biemond, W. M., Colomban, P., Ngo, A., Boeyens, J. C. A., & van der Ryst, M. M. (2016). Towards refining the classification of glass trade beads imported into Southern Africa from the 8th to the 16th century AD. Journal of Cultural Heritage, 19, 435–444. https://doi.org/10.1016/j.culher.2015.11.003.
Article Google Scholar
Lange, D. (2007). The emergence of social complexity in the southern Chad Basin towards 500 BC: Archaeological and other evidence. Borno Museum Society Newsletter, 1, 68–71.
Google Scholar
Lasaponara, R., & Masini, N. (2006). Identification of archaeological buried remains based on the normalized difference vegetation index (NDVI) from quickbird satellite data. IEEE Geoscience and Remote Sensing Letters, 3, 325–328.
Article Google Scholar
Lasaponara, R., Leucci, G., Masini, N., & Persico, R. (2014). Investigating archaeological looting using satellite images and GEORADAR: The experience in Lambayeque in North Peru. Journal of Archaeological Science, 42, 216–230.
Article Google Scholar
Lasaponara, R., Elfadaly, A., & Attia, W. (2016). Low cost space technologies for operational change detection monitoring around the archaeological area of Esna-Egypt. In O. Gervasi, B. Murgante, S. Misra, A. M. A. C. Rocha, C. M. Torre, D. Taniar, B. O. Apduhan, E. Stankova, & S. Wang (Eds.), Computational science and its applications – ICCSA 2016, lecture notes in computer science (pp. 611–621). New York: Springer.
Chapter Google Scholar
Lebedev, A. V., Westman, E., Van Westen, G. J. P., Kramberger, M. G., Lundervold, A., Aarsland, D., Soininen, H., Kłoszewska, I., Mecocci, P., & Tsolaki, M. (2014). Random Forest ensembles for detection and prediction of Alzheimer’s disease with a good between-cohort robustness. NeuroImage: Clinical, 6, 115–125.
Article Google Scholar
Li, W., Chen, C., Su, H., & Du, Q. (2015). Local binary patterns and extreme learning machine for hyperspectral imagery classification. IEEE Transactions on Geoscience and Remote Sensing, 53, 3681–3693.
Article Google Scholar
Liang, S. J., Wang, J., & Jiang, B. (2012). A systematic view of remote sensing. In S. Liang, X. Li, & J. Wang (Eds.), Advanced remote sensing (pp. 1–31). New York: Academic Press.
Google Scholar
Lillesand, T., Kiefer, R. W., & Chipman, J. (2008). Remote sensing and image interpretation. New York: Wiley.
Google Scholar
Lin, H.-T., & Lin, C.-J. (2003). A study on sigmoid kernels for SVM and the training of non-PSD kernels by SMO-type methods. Neural Computation, 1–32.
Mabulla, A. Z. P. (2001). Strategy for cultural heritage management (CHM) in Africa: A case study. African Archaeological Review, 17, 211–233.
Article Google Scholar
MacDonald, K. (2013). Complex societies, urbanism, and trade in the Western Sahel. In P. Mitchell & P. Lane (Eds.), The Oxford handbook of African archaeology (pp. 829–844). Oxford: University of Oxford Press.
Google Scholar
Mansour, K., Mutanga, O., Everson, T., & Adam, E. (2012). Discriminating indicator grass species for rangeland degradation assessment using hyperspectral data resampled to AISA eagle resolution. ISPRS Journal of Photogrammetry and Remote Sensing, 70, 56–65.
Article Google Scholar
Manyanga, M. (2007). Resilient landscapes socio-environmental dynamics in the Shashi-Limpopo basin, Southern Zimbabwe C. AD 800 to the present. Uppsala: Societa Archaeologica Uppsaliensis.
Masini, N., & Lasaponara, R. (2007). Investigating the spectral capability of QuickBird data to detect archaeological remains buried under vegetated and not vegetated areas. Journal of Cultural Heritage, 8, 53–60.
Article Google Scholar
Mason, R. J. (1968). Transvaal and Natal Iron Age settlement revealed by aerial photography and excavation. African Studies, 27, 167–180.
Article Google Scholar
Mattingly, D. J., Lahr, M., Armitage, S., Barton, H., Dore, J., Drake, N., Foley, R., Merlo, S., Salem, M., & Stock, J. (2007). Desert migrations: People, environment and culture in the Libyan Sahara. Libyan Studies, 38, 115–156.
Article Google Scholar
Maxwell, A. E., Warner, T. A., & Fang, F. (2018). Implementation of machine-learning classification in remote sensing: An applied review. International Journal of Remote Sensing, 39, 2784–2817. https://doi.org/10.1080/01431161.2018.1433343.
Article Google Scholar
McHugh, M. L. (2012). Interrater reliability: The kappa statistic. Biochemia Medica, 22, 276–282.
Article Google Scholar
McIntosh, S. K. (1993). Archaeological heritage management and site inventory systems in Africa. Journal of Field Archaeology, 20, 500–504.
Article Google Scholar
McIntosh, S. K., & McIntosh, R. J. (1993). Field survey in the tumulus zone of Senegal. African Archaeological Review, 11, 73–107.
Article Google Scholar
Melillos, M., Themistocleous, K., Agapiou, A., Michaelides, S., & Hadjimitsis, D. G. (2018). Detecting underground structures in Cyprus using field spectroscopy. Proc. SPIE 10773, Sixth International Conference on Remote Sensing and Geoinformation of the Environment (RSCy2018), 107730A (6 August 2018). https://doi.org/10.1117/12.2324270
Metternicht, G., Zinck, J. A., Blanco, P. D., & Del Valle, H. F. (2010). Remote sensing of land degradation: Experiences from Latin America and the Caribbean. Journal of Environmental Quality, 39, 42–61.
Article Google Scholar
Meyer, A. (2000). K2 and Mapungubwe. Goodwin Series, 8, 4–13.
Article Google Scholar
Meyer, D., Dimitriadou, E., Hornik, K., Weingessel, A., Leisch, F., Chang, C.-C., Lin, C.-C., & Meyer, M. D. (2017). Package ‘e1071.’
Mitchell, P. (2013). Early farming communities of southern and south-Central Africa. In P. Mitchell & P. Lane (Eds.), The Oxford handbook of African archaeology (pp. 657–670). Oxford: Oxford University Press.
Chapter Google Scholar
Mothulatshipi, S. M. (2008). Landscape archaeology of the later farming communities of the Shashe Limpopo Basin, eastern Botswana: Land use diversity and human behavior. PhD Dissertation, University of Edinburgh.
Mountrakis, G., Im, J., & Ogole, C. (2011). Support vector machines in remote sensing: A review. ISPRS Journal of Photogrammetry and Remote Sensing, 66, 247–259.
Article Google Scholar
Mureriwa, N., Adam, E., Sahu, A., & Tesfamichael, S. (2016). Examining the spectral separability of Prosopis glandulosa from co-existent species using field spectral measurement and guided regularized random forest. Remote Sensing, 8, 144.
Article Google Scholar
Musyoki, A., Thifhulufhelwi, R., & Murungweni, F. M. (2016). The impact of and responses to flooding in Thulamela Municipality, Limpopo Province South Africa. Jàmbá: Journal of Disaster Risk Studies, 8. https://doi.org/10.4102/jamba.v8i2.166.
Mutanga, O., Adam, E., & Cho, M. A. (2012). High density biomass estimation for wetland vegetation using WorldView-2 imagery and random forest regression algorithm. International Journal of Applied Earth Observation and Geoinformation, 18, 399–406.
Article Google Scholar
Mutanga, O., Adam, E., Adjorlolo, C., & Abdel-Rahman, E. M. (2015). Evaluating the robustness of models developed from field spectral data in predicting African grass foliar nitrogen concentration using WorldView-2 image as an independent test dataset. International Journal of Applied Earth Observation and Geoinformation, 34, 178–187.
Article Google Scholar
Myint, S. W., Gober, P., Brazel, A., Grossman-Clarke, S., & Weng, Q. (2011). Per-pixel vs. object-based classification of urban land cover extraction using high spatial resolution imagery. Remote Sensing of Environment, 115, 1145–1161.
Article Google Scholar
Nawar, S., Buddenbaum, H., Hill, J., & Kozak, J. (2014). Modeling and mapping of soil salinity with reflectance spectroscopy and landsat data using two quantitative methods (PLSR and MARS). Remote Sensing, 6, 10813–10834.
Article Google Scholar
Nienaber, W. C., Keough, N., Steyn, M., & Meiring, J. H. (2008). Reburial of the Mapungubwe human remains: An overview of process and procedure. The South African Archaeological Bulletin, 63, 164–169. https://doi.org/10.2307/20475012.
Article Google Scholar
Nocita, M., Stevens, A., Toth, G., Panagos, P., van Wesemael, B., & Montanarella, L. (2014). Prediction of soil organic carbon content by diffuse reflectance spectroscopy using a local partial least square regression approach. Soil Biology and Biochemistry, 68, 337–347.
Article Google Scholar
Noi, P. T., & Kappas, M. (2018). Comparison of random Forest, k-nearest neighbor, and support vector machine classifiers for land cover classification using Sentinel-2 imagery. Sensors, 18, 18. https://doi.org/10.3390/s18010018.
Article Google Scholar
Ogen, Y., Goldshleger, N., & Ben-Dor, E. (2017). 3D spectral analysis in the VNIR–SWIR spectral region as a tool for soil classification. Geoderma, 302, 100–110.
Article Google Scholar
Oonk, S., Slomp, C. P., Huisman, D. J., & Vriend, S. P. (2009). Geochemical and mineralogical investigation of domestic archaeological soil features at the Tiel-Passewaaij site, the Netherlands. Journal of Geochemical Exploration, 101, 155–165.
Article Google Scholar
Opitz, R., & Herrmann, J. (2018). Recent trends and long-standing problems in archaeological remote sensing. Journal of Computer Applications in Archaeology, 1, 19–41. https://doi.org/10.5334/jcaa.11.
Article Google Scholar
Oumar, Z., & Mutanga, O. (2010). Predicting plant water content in Eucalyptus grandis forest stands in KwaZulu-Natal, South Africa using field spectra resampled to the Sumbandila satellite sensor. International Journal of Applied Earth Observation and Geoinformation, 12, 158–164. https://doi.org/10.1016/j.jag.2010.02.002.
Article Google Scholar
Pal, M., & Mather, P. M. (2003). An assessment of the effectiveness of decision tree methods for land cover classification. Remote Sensing of Environment, 86, 554–565. https://doi.org/10.1016/S0034-4257(03)00132-9.
Article Google Scholar
Pal, M., & Mather, P. M. (2005). Support vector machines for classification in remote sensing. International Journal of Remote Sensing, 26, 1007–1011.
Article Google Scholar
Parcak, S. H. (2007). Satellite remote sensing methods for monitoring archaeological tells in the Middle East. Journal of Field Archaeology, 32, 65–81.
Article Google Scholar
Parcak, S. H. (2015). Archaeological looting in Egypt: A geospatial view (case studies from Saqqara, Lisht, and el Hibeh). Near Eastern Archaeology, 78, 196–203.
Article Google Scholar
Pelletier, C., Valero, S., Inglada, J., Champion, N., & Dedieu, G. (2016). Assessing the robustness of random forests to map land cover with high resolution satellite image time series over large areas. Remote Sensing of Environment, 187, 156–168.
Article Google Scholar
Peter, B. (2001). Vitrified dung in archaeological contexts: An experimental study on the process of its formation in the Mosu and Bobirwa areas. Pula: Botswana Journal of African Studies, 15, 125–143.
Google Scholar
Phillipson, D. W. (2005). African archaeology. Cambridge: Cambridge University Press.
Book Google Scholar
Pwiti, G. (2005). Southern Africa and the east African coast. In A. B. Stahl (Ed.), African archaeology: A critical introduction (pp. 378–391). Malden: Blackwell Publishing.
Google Scholar
Reeves, D. M. (1936). Aerial photography and archaeology. American Antiquity, 2, 102. https://doi.org/10.2307/275881.
Article Google Scholar
Reid, S. H. (2016). Satellite remote sensing of archaeological vegetation signatures in coastal West Africa. African Archaeological Review, 33, 163–182.
Article Google Scholar
Reid, A., & Segobye, A. (2000). Politics, society and trade on the eastern margins of the Kalahari. Goodwin Series, 8, 58–68.
Article Google Scholar
Rodriguez-Galiano, V. F., & Chica-Rivas, M. (2014). Evaluation of different machine learning methods for land cover mapping of a Mediterranean area using multi-seasonal Landsat images and digital terrain models. International Journal of Digital Earth, 7, 492–509. https://doi.org/10.1080/17538947.2012.748848.
Article Google Scholar
Rodriguez-Galiano, V. F., Ghimire, B., Rogan, J., Chica-Olmo, M., & Rigol-Sanchez, J. P. (2012). An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS Journal of Photogrammetry and Remote Sensing, 67, 93–104.
Article Google Scholar
Rosenfield, G. H., & Fitzpatrick-Lins, K. (1986). A coefficient of agreement as a measure of thematic classification accuracy. Photogrammetric Engineering and Remote Sensing, 52, 223–227.
Google Scholar
Sadr, K. (2016). The impact of coder reliability on reconstructing archaeological settlement patterns from satellite imagery: A case study from South Africa. Archaeological Prospection, 23, 45–54.
Article Google Scholar
Sadr, K., & Rodier, X. (2012). Google Earth, GIS and stone-walled structures in southern Gauteng, South Africa. Journal of Archaeological Science, 39, 1034–1042.
Article Google Scholar
Schmidt, P. R., & McIntosh, R. J. (Eds.). (1996). Plundering Africa’s past. Bloomington: Indiana University Press.
Google Scholar
Schmidt, K. S., & Skidmore, A. K. (2003). Spectral discrimination of vegetation types in a coastal wetland. Remote Sensing of Environment, 85, 92–108.
Article Google Scholar
Schuetter, J., Goel, P., McCorriston, J., Park, J., Senn, M., & Harrower, M. (2013). Autodetection of ancient Arabian tombs in high-resolution satellite imagery. International Journal of Remote Sensing, 34, 6611–6635. https://doi.org/10.1080/01431161.2013.802054.
Article Google Scholar
Sesnie, S. E., Finegan, B., Gessler, P. E., Thessler, S., Bendana, Z. R., & Smith, A. M. S. (2010). The multispectral separability of Costa Rican rainforest types with support vector machines and Random Forest decision trees. International Journal of Remote Sensing, 31, 2885–2909. https://doi.org/10.1080/01431160903140803.
Article Google Scholar
Shao, Y., & Lunetta, R. S. (2012). Comparison of support vector machine, neural network, and CART algorithms for the land-cover classification using limited training data points. ISPRS Journal of Photogrammetry and Remote Sensing, 70, 78–87.
Article Google Scholar
Sharafi, S., Fouladvand, S., Simpson, I., & Alvarez, J. A. B. (2016). Application of pattern recognition in detection of buried archaeological sites based on analysing environmental variables, Khorramabad Plain, West Iran. Journal of Archaeological Science: Reports, 8, 206–215. https://doi.org/10.1016/j.jasrep.2016.06.024.
Article Google Scholar
Shaw, T., Andah, B., Okpoko, A., & Sinclair, P. (Eds.). (1993). The archaeology of Africa: Food, metals and towns. London: Routledge.
Google Scholar
Sibanda, M., Mutanga, O., & Rouget, M. (2016). Discriminating rangeland management practices using simulated hyspIRI, landsat 8 OLI, sentinel 2 MSI, and VENμs spectral data. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 9, 3957–3969.
Article Google Scholar
Smith, B. (2012). Helping to defend Africa’s heritage: The African archaeological conservation advisory committee of the PanAfrican archaeological association. Azania: Archaeological Research in Africa, 47, 379–380. https://doi.org/10.1080/0067270X.2012.711574.
Article Google Scholar
Sørensen, L. K., & Dalsgaard, S. (2005). Determination of clay and other soil properties by near infrared spectroscopy. Soil Science Society of America Journal, 69, 159–167.
Article Google Scholar
Stahl, A. B. (1994). Innovation, diffusion, and culture contact: The Holocene archaeology of Ghana. Journal of World Prehistory, 8, 51–112.
Article Google Scholar
Stevens, A., Udelhoven, T., Denis, A., Tychon, B., Lioy, R., Hoffmann, L., & Van Wesemael, B. (2010). Measuring soil organic carbon in croplands at regional scale using airborne imaging spectroscopy. Geoderma, 158, 32–45.
Article Google Scholar
Thabeng, O. L., Merlo, S., & Adam, E. (2019). High-resolution remote sensing and advanced classification techniques for the prospection of archaeological sites’ markers: The case of dung deposits in the Shashi-Limpopo Confluence area (southern Africa). Journal of Archaeological Science, 102, 48–60. https://doi.org/10.1016/j.jas.2018.12.003.
Article Google Scholar
Thy, P., Segobye, A. K., & Ming, D. W. (1995). Implications of prehistoric glassy biomass slag from east-Central Botswana. Journal of Archaeological Science, 22, 629–637.
Article Google Scholar
Ustuner, M., Sanli, F. B., & Dixon, B. (2015). Application of support vector machines for landuse classification using high-resolution RapidEye images: A sensitivity analysis. European Journal of Remote Sensing, 48, 403–422.
Article Google Scholar
Verhagen, P., & Whitley, T. G. (2012). Integrating archaeological theory and predictive modeling: A live report from the scene. Journal of Archaeological Method and Theory, 19, 49–100. https://doi.org/10.1007/s10816-011-9102-7.
Article Google Scholar
Verrelst, J., Rivera, J. P., Moreno, J., & Camps-Valls, G. (2013). Gaussian processes uncertainty estimates in experimental Sentinel-2 LAI and leaf chlorophyll content retrieval. ISPRS Journal of Photogrammetry and Remote Sensing, 86, 157–167. https://doi.org/10.1016/j.isprsjprs.2013.09.012.
Article Google Scholar
Vogel, J. C., & Calabrese, J. A. (2000). Dating of the Leokwe Hill site and implications for the regional chronology. Goodwin Series, 8, 47–50.
Article Google Scholar
Wei, C.-L., Rowe, G. T., Escobar-Briones, E., Boetius, A., Soltwedel, T., Caley, M. J., Soliman, Y., Huettmann, F., Qu, F., & Yu, Z. (2010). Global patterns and predictions of seafloor biomass using random forests. PLoS One, 5, e15323.
Article Google Scholar
Wilson, C. A., Davidson, D. A., & Cresser, M. S. (2008). Multi-element soil analysis: An assessment of its potential as an aid to archaeological interpretation. Journal of Archaeological Science, 35, 412–424.
Article Google Scholar
Wu, J., Liu, H., Duan, X., Ding, Y., Wu, H., Bai, Y., & Sun, X. (2008). Prediction of DNA-binding residues in proteins from amino acid sequences using a random forest model with a hybrid feature. Bioinformatics, 25, 30–35.
Article Google Scholar
Yu, L., Liang, L., Wang, J., Zhao, Y., Cheng, Q., Hu, L., Liu, S., Yu, L., Wang, X., Zhu, P., Li, X., Xu, Y., Li, C., Fu, W., Li, X., Li, W., Liu, C., Cong, N., Zhang, H., Sun, F., Bi, X., Xin, Q., Li, D., Yan, D., Zhu, Z., Goodchild, M. F., & Gong, P. (2014). Meta-discoveries from a synthesis of satellite-based land-cover mapping research. International Journal of Remote Sensing, 35, 4573–4588. https://doi.org/10.1080/01431161.2014.930206.
Article Google Scholar
Zhu, G., & Blumberg, D. G. (2002). Classification using ASTER data and SVM algorithms: The case study of Beer Sheva, Israel. Remote Sensing of Environment, 80, 233–240.
Article Google Scholar

Download references

Acknowledgments

We would like to thank Inos Dhau, Tshekiso Kgosietsile, and Andani Gangashe for their assistance during data collection. We are also grateful to SANParks for allowing access to the Mapungubwe National Park and the DeBeers Group (through Duncan MacFadyen) for allowing access to the Venetia Nature Reserve and use of the research facility. The authors are also grateful to Prof. Thomas Huffman for availing his data and devoting his time to take us through the study area. Special thanks to Lesego Madisha (former archaeologist at SANParks, Mapungubwe) for her kind assistance and SANParks Cultural Heritage Manager Crispen Chauke and the Venetia Nature Reserve staff for their help.

Funding

This study was funded by the University of Botswana training department and the University of the Witwatersrand.

Author information

Authors and Affiliations

Archaeology Unit, University of Botswana, Plot 4775 Notwane Rd, Gaborone, Botswana
Olaotse L. Thabeng
School of Geography, Archaeology and Environmental Studies, University of the Witwatersrand, Johannesburg, 2050, South Africa
Olaotse L. Thabeng, Stefania Merlo & Elhadi Adam

Authors

Olaotse L. Thabeng
View author publications
You can also search for this author in PubMed Google Scholar
Stefania Merlo
View author publications
You can also search for this author in PubMed Google Scholar
Elhadi Adam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Olaotse L. Thabeng.

Ethics declarations

Conflict of Interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Archaeological time period: Tenth to nineteenth century AD

Country and region discussed: Botswana, South Africa, and Zimbabwe, Southern Africa

Rights and permissions

Reprints and permissions

About this article

Cite this article

Thabeng, O.L., Merlo, S. & Adam, E. From the Bottom Up: Assessing the Spectral Ability of Common Multispectral Sensors to Detect Surface Archaeological Deposits Using Field Spectrometry and Advanced Classifiers in the Shashi-Limpopo Confluence Area. Afr Archaeol Rev 37, 25–49 (2020). https://doi.org/10.1007/s10437-020-09372-z

Download citation

Published: 11 March 2020
Issue Date: March 2020
DOI: https://doi.org/10.1007/s10437-020-09372-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

From the Bottom Up: Assessing the Spectral Ability of Common Multispectral Sensors to Detect Surface Archaeological Deposits Using Field Spectrometry and Advanced Classifiers in the Shashi-Limpopo Confluence Area

Abstract

Résumé

Similar content being viewed by others

Using resampled nSight-2 hyperspectral data and various machine learning classifiers for discriminating wetland plant species in a Ramsar Wetland site, South Africa

Forest mapping: a comparison between hyperspectral and multispectral images and technologies

Testing the efficacy of hyperspectral (AVIRIS-NG), multispectral (Sentinel-2) and radar (Sentinel-1) remote sensing images to detect native and invasive non-native trees

Introduction