Introduction

It is generally believed that the scale of the data to be used is a critical consideration in an analysis. Levin (1992) argues that scale of observation is a filter for processes. At different scales of observation, different processes are observed. This hypothesis is particularly well supported when measured data, such as geophysical measurements, are used as the primary predictor. Nykanen and Raines (2007) used aeromagnetic data from Finland with a line spacing of 200 m for an analysis of scale in predicting the long, thin gold deposits in a greenstone belt. By selecting increasingly wider line spacing, they demonstrated that the correlation with the deposits decreased in a manner consistent with conventional wisdom of the properties of scale. Thus, as the line spacing increased (resulting in a smaller map scale), the correlation with the gold deposits decreased. In another study, Raines and Mihalasky (2002) used the 1:2,500,000-scale King and Beikman geology of the United States (King and Beikman, 1974) to predict the location of porphyry deposits, and found a good correlation with resource tracts defined by the 1:1,000,000-scale USGS porphyry assessment of the United States (U.S. Geological Survey National Mineral Assessment Team, 1998). It should be pointed out that King and Beikman made a point to differentiate felsic magmatic domains, which are believed to be associated spatially with the felsic porphyry intrusions that typically host porphyry copper deposits. The USGS assessment was based on 1:500,000-scale geology of the individual states and other larger scale information and was published for use at a scale of 1:1,000,000.

The differing results of these two studies indicate that the utility of the scale of map information cannot be separated easily from differences in map information content portrayed at different scales. For geologic map, only an insignificant decrease in correlation with the USGS US porphyry tracts occurs with a large decrease in map scale; whereas, for a decrease in scale for geophysical measurements, correlation decreases with each decrease in scale. This poses the question: what is different about geologic maps that lead to the apparent contradiction in the properties of scale? The answer seems to be that geologic maps are illustrations of geologic relationships rather than simple, scale-dependent measurements. Depending on the type of geologic map, small-scale maps may be effective in capturing the important information elements for a specific analysis, despite the expected inadequacy of scale. This is not to imply that such map could replace the large-scale maps that are clearly required to address more focused and spatially rigorous problems such as where to drill the next exploration hole. In fact, it should be noted that maps of the scales discussed here (1:1,000,000 and smaller) generally are compilations from larger scale maps. Whatever the intent of the compiler, the resultant map is dependant on the high-quality observations recorded at the larger scale so that the pertinent geologic relationships can be preserved at the smaller scale, despite the reduction in spatial accuracy. Clearly, large-scale maps are essential for some tasks; however, the objectives of a national or even global assessment of a mineral resource include defining the regional extent and locations where the resource is likely to occur. For this application, small-scale maps may prove surprisingly effective.

In this analysis, two global geologic maps prepared for different purposes at differing scales were used to evaluate the interplay between map scale and information content in spatial analysis. The geologic maps at 1:10,000,000- and 1:35,000,000-scale were used as predictors of porphyry deposits. The Mutschler, Ludington, and Bookstorm (1999) compilation of global giant copper camps is used as training points for simple weights-of-evidence (WofE) models based on a single evidence layer, a multiclass generalization of the geologic map. The global assessment tracts derived from these two maps using similar WofE techniques are compared in this report to the USGS porphyry tracts of the US to evaluate the spatially modeling effects of differing scales and information content.

The WofE method used in this study (Bonham-Carter, 1994) is an objective data-driven analysis technique that can be used to compare the prediction utility of differing spatial data sets. In this example, two different sets of geologic map units, compiled at different scales, will be used to predict porphyry copper deposit locations. By considering the confidence measures of the contrast from the WofE method, the geologic map units can be generalized into multiple, statistically significance classes as discussed in Raines and Bonham-Carter (2006). This approach creates a multiclass posterior probability map of the spatial occurrence of the training points. This multiclass posterior probability map then can be compared to the previously defined USGS porphyry copper tracts of the United States (USGS National Assessment Team, 1998) to measure the prediction utility of the differing geologic maps compiled at different scales. For the analysis conducted here, the two geologic map compilations were the Exxon Tectonic Map of the World, which is referred to as the Tectonic map in this report and has a map scale of 1:10,000,000 and the Geological Survey of Canada Felsic/Intermediate Igneous Rock Map, which is referred to as the F/I Igneous map in this report and has a map scale of 1:35,000,000 scale. As evident in the descriptions of the maps and their sources below, these maps were generated not only at different scales, but were generated using vastly different conceptual principles. The Tectonic Map focused on representing the distribution of the world’s sedimentary basins, whereas the F/I Igneous map was generated to illustrate the distribution of porphyry-related igneous rocks. Thus, the map’s scale and conceptual framework are confounded in this WofE analysis.

Exxon Tectonic Map (1:10,000,000-scale Tectonic Map)

The Exxon Tectonic Map of the World was compiled in 1985 by the World Mapping Project as a cooperative effort with Exxon affiliates to show the tectonic setting of the world’s sedimentary basins and its oil and gas deposits. Sources of information included published and unpublished maps and reports of Exxon affiliates, as well as public domain data. The project was carried out under the principal direction of J. M. Widmier (Basin Exploration Division), A. R. Creen (Basin Systems Section) and K. T. Biddle (Basin Analysis Section). In 1994, Exxon Production Research Company donated the Tectonic Map of the World to the American Association of Petroleum Geologists Foundation and the map was made publicly available. In 1995, the 1:10,000,000 scale maps were provided to Geologic Data Systems for digital conversion, where the maps were enlarged and digitally compiled at a scale of 1:5,000,000. Polygon, line, and point attribute information was captured for use in a GIS environment, bathymetric contours were updated using the current GEBCO (General Bathymetric Chart of the Ocean) data, and index hypsographic contours were added from the DCW (Digital Chart of the World). The data used in this study were provided by Geologic Data Systems, who hold exclusive rights for the distribution of the digital Tectonic Map of the World.

Felsic/Intermediate Igneous Rock Map (1:35,000,000-scale)

The 1:35,000,000-scale Felsic/Intermediate Igneous Rock Map (F/I Igneous map) prepared at the Geological Survey of Canada and used in this investigation depicts the generalized global distribution of felsic and intermediate intrusive and volcanic rocks, subdivided by age at the era level, against a backdrop of unclassified Precambrian and Phanerozoic age domains (Chorlton, 2004a). The map is the result of an effort to identify magmatic tracts potentially related to porphyry deposits and is a secondary thematic data set derived from the Generalized Geology of the World (GGW) GIS data set and linked rock unit attribute database (Chorlton, 2004b). This data set contains bedrock unit information from geological compilation maps of between 1: 2,000,000 and 1:5,000,000 paper scales, supplemented by other regional thematic map and literature sources. Its attribute database was designed to accommodate the most significant parameters for each geological domain, which include start and end ages, lithologic components in three levels of detail, stratigraphic or lithologic unit and geological province names, and tectonic settings. Emphasis was placed primarily on populating the age ranges and highest levels of lithologic detail, and secondarily on capturing information related to magmatic composition to compliment databases of magma-related mineral deposits (Chorlton, Sinclair, and Laramée, 2002).

The spatial components of the GGW are known to be imprecise. The accuracy of many paper map sources was poor, and intricate spatial detail in source data sets considered relatively accurate was generalized for consistency. To hide small spatial features or clusters of features for most uses, but retain them in easy-to-visualize form for special purposes, advantage was taken of the ability to store more than one record set (unit) per GIS feature. The small features or clusters were represented as single exaggerated polygons, and a default record set representing the enclosing geological domain was appended to the database. One or more extra record sets then were added so that the same feature could be retrieved when the database is queried for the special interest component. These exaggerated small features, therefore will be spatially over-represented in the map, an effect that occurs in manually prepared thematic compilations where exaggeration has been applied.

To create the digital map of the global distribution of felsic or intermediate intrusive and volcanic rocks subdivided by age, new attribute fields were appended to copies of the GGW vector data (polygon coverages) and then populated with values based on the results of sequential database queries to create a flat file data set. All possible record sets were included in these queries, not just defaults. For domains containing both intrusive and extrusive magmatic components of the appropriate composition, extrusive components took precedence in the output because of the order in which queries were applied. Arcs separating identically classified polygons were dropped to create the final, simplified output, the purpose of which was to provide visual context for the world distribution of major intermediate-felsic magma-related deposits, principally porphyry copper deposits.

Model Definitions

The intent of the modeling reported here was to use the geologic maps to make simple multiclass predictive models of the distribution of porphyry deposits. All calculations were done in ArcSMD (Kemp and others, 2001). The 283 training points used are “significant” copper camps of the globe that are defined in Mutschler, Ludington, and Bookstrom (1999). No attempt has been made in these simple models to differentiate porphyry deposits by type. Of the 283 training points, 221 were classified as porphyry deposits, and most of the others are of types usually associated with porphyry deposits. A WofE model (Bonham-Carter, 1994) provides a measure of correlation with the training points, designated as the contrast, and a measure of the confidence in the contrast, designated as the Studentized contrast (contrast divided by its standard deviation). This confidence measure is a test that the reported contrast is not equal to zero. Map units having a significant contrast are summarized in Tables 1 and 2. Each of these listed map units were used in the model and assigned a distinct weight in the conventional weights analysis in the WofE model (Raines and Bonham-Carter, 2006). For the Tectonic Map, a confidence level of approximately 85% was considered significant. For the F/I Igneous Map, most of the units attained a confidence level of 85% except for two igneous rock domains that individually had a weak association with a low confidence level. By combining these two map units, an association with approximately 60% confidence was attained. All other map units in both maps are generalized into one class that is not associated with copper camps. For this global analysis these minimum levels of confidence were arbitrarily considered sufficient in the context that the final response themes will be grouped into two classes for comparison to the US assessment, and thus other groupings with higher levels of confidence were not investigated. Examples from the resulting models are shown in Figure 1.

Table 1 Generalization of the Tectonic map with Regards to Porphyry Deposits
Table 2 Generalization of the F/I Igneous Map with Regards to Porphyry Deposits
Figure 1
figure 1

Comparison of Tectonic-based (Left) with F/I Igneous-based (Right) model for western hemisphere (A and B) and for globe (Tectonic-based, D, and F/I Igneous-based, E), which have same color scheme as A and B. F/I Igneous-based model defines many small areas, not well displayed in these small image. For comparison, C shows USGS porphyry tracts of United States from U. S. Geological Survey National Mineral Assessment Team (1998) with training points as black dots. In C, red is permissive and white is nonpermissive

Comparison of Models

In order to evaluate these models by comparison with the USGS porphyry tracts of the United States (Fig. 1C), it is necessary to reduce the multiclasses to binary, permissive and nonpermissive, classes as used for the USGS porphyry tracts. This generalization of the global models consists of everything with colors warmer than blue in Figure 1, where the prior probability (Bonham-Carter, 1994) is used to define the break between permissive versus nonpermissive areas. There are several ways to compare the Tectonic and F/I Igneous models with the USGS tracts. The maps can be compared directly, as shown in Table 3, where it can be seen that the Tectonic map slightly underestimated the permissive areas in relation to the USGS tracts. The F/I Igneous map slightly underestimated the permissive areas. The agreement between the Tectonic map model and the USGS tracts is slightly less than that reported by the Raines and Mihalasky (2002) for the model derived from the King and Beikman 1:2,500,000-scale geologic map. Whereas, the agreement between the F/I Igneous and USGS tracts is slightly better that reported by Raines and Mihalasky (2002). This increase in agreement with decreasing scale also is reflected in Jaccard’s Similarity Coefficient (shown as Jaccard’s C in Table 3). Jaccard’s Similarity Coefficient disregards the match of the nonpermissive areas, so it is less influenced by the rarity of training sites that inflates the agreement statistic (Bonham-Carter, 1994). Also interesting is that the kappa statistic reported by Raines and Mihalasky (2002) is intermediate between the Tectonic (lowest kappa) and the F/I Igneous (highest kappa). Kappa is significant because it makes an adjustment for the expected correlation, which is particularly relevant when concerned with rare events such as mineralization. These kappa values are considered “fair agreement” by the criteria of Landis and Koch (1977), but in the author’s experience in spatial applications such kappa values are substantial, Thus the anticipated decrease in correlation with decreasing map scale does not occur.

Table 3 Correlation of the Small-Scale Models with the USGS Tracts of the United States

Another way to evaluate the differences between the Tectonic and F/I Igneous map models versus the USGS tracts is to compare the rankings of states of the United States (Fig. 2). The states are ranked by percent permissive area. The coefficient of explanation, R 2, for the Tectonic map versus USGS tracts is considerably lower than that reported by Raines and Mihalasky (2002), but for the F/I Igneous map model the R 2 is the same reported by Raines and Mihalasky (2002). It has been suggested that this ranking test result is simply a consequence of the size of the states. This was tested by using the states as evidence and ranking them by the association with the training points. The ranking defined by states as evidence had no correlation with the rankings from the porphyry models. The King and Beikman model from Raines and Mihalasky (2002) had the highest R 2 of 0.26 and the others had R 2 less than 0.1 with the states model. Thus, this test also fails to show the anticipated decrease in correlation with decreasing map scale.

Figure 2
figure 2

Comparison of states of United States ranked by area of permissive terranes by different models. Generally, ranking of states is highly correlated with that of USGS tracts, two models reported here are highly correlated, but correlation does not decrease with decreasing scale. Bottom-right plot derives from Raines and Mihalasky (2002). Those states with no permissive areas are excluded from these plots because inclusion of such states simply inflated the correlation with no meaningful change. Linear regression lines and coefficient of explanation are shown on each graph

Conclusions

On the basis of the analyses in Raines and Mihalasky (2002), which used the King and Beikman geologic map of 1:2,500,000 scale, and the analyses presented here of the 1:10,000,000-scale Tectonic map and the 1:35,000,000-scale F/I Igneous map models, the utility of a geologic map for predicting porphyry deposits cannot be correlated directly with the map scale. Indeed, in this study, the smallest scale map correlates best by all measures with the USGS tracts. It would seem that the conceptual focus, or theme, of the geologic map is as, or more, critical than the scale. In geophysical maps, as reported by Nykanen and Raines (2007), and probably in other maps portraying measured data, map scale is truly a measure of the size of features that can be represented. Geologic maps are illustrations of geologic relationships and the map maker subjectively selects the information to portray, sometimes exaggerating important features, in order to convey a vision or representation of features. The approach is somewhat analogous to the way a state or national highway map exaggerates the land area occupied by roadways. The conceptual framework of a geologic map is as critical as map scale in determining the utility of a geologic map for some applications. We have demonstrated that geologic maps of small scale can provide useful information, despite the decrease in spatial accuracy, for regional mineral tract definition on national and global scales. The utility of a particular geologic map for a particular application is critically dependent on the alignment of the intention of the map maker with the application, regardless of map scale.