SCALE

 

Objective

This research priority calls attention to the multidisciplinary issues related to spatial as well as spatio-temporal scale. The objective is to clarify and assess what scale and scale effects are, and through recognizing the fundamental existence of scale effects, determine how one can realistically approach and mitigate the scale effects. Primary issues therefore center on gaining a better understanding of how scale affects human perception; how to effectively and efficiently measure and characterize scale; how to use scale information in judging the fitness of data for a particular use; how to automate scale change and simultaneously represent data at multiple scales; how scale and change in scale affect information content, analysis, and conclusions about patterns and processes.

Background

Scale is not a new issue, nor is concern restricted to geographic information scientists. Scale variations have long been known to constrain the detail with which information can be observed, represented, analyzed, and communicated. Changing the scale of data without first understanding the effects of such action can result in the representation of processes or patterns that are different from those intended. For example, research has shown that reducing the resolution of a raster land cover map (going to larger cells) can increase the dominance of the contiguous classes, but decrease the amount of small and scattered classes (like wetlands in some locations) in the representation (Turner et al. 1989). The spatial scaling problem presents one of the major impediments, both conceptually and methodologically, to advancing all sciences that use geographic information. Likewise, temporal scaling, a separate but related issue, is not well understood and thus difficult to formalize. In an information era, a massive amount of geographic data are collected from various sources, often at different scales. Before these data can be integrated for problem solving, fundamental issues must be addressed.

Recent work on the scaling behavior of various phenomena and processes (including research in global change) has shown that many processes do not scale linearly. The implication is that in order to characterize a pattern or process at a scale other than the scale of observation, some knowledge of how that pattern or process changes with scale is needed so that the scaling process can be adjusted accordingly. Attempts to describe scaling behavior by fractals or self-affine models, which mathematically relate complexity and scale, have proven ineffective because the properties of many geographic phenomena are not strictly repeated across multiple spatial or temporal scales. Multifractals have shown some promise for characterizing the scaling behavior of some phenomena, but it is more likely that fractals will offer only a partial model. Alternative models are needed to understand the impacts that changes in scale have on the information content of databases. Examining the sensitivity of process and analytical models to scale will help scientists validate hypotheses, which in turn will improve geographic theory-building.

Despite a longstanding recognition of the implications of scale on geographic inference and decision making, many questions remain unanswered. The transition from analog (i.e., maps) to digital representations of geographic information forces users of those data to formally deal with these conceptual, technical, and analytical questions in new ways. It is easy to demonstrate by isolated example that scale poses constraints and limitations on geographic information, spatial analysis, and models of the real world. The challenge is to articulate the conditions under which scale-imposed constraints are systematic and to develop geographic models that compensate or standardize scale-based variation. Mishandling or misunderstanding scale can bias inference and reasoning and ultimately affect decision-making processes. New types of analyses, for example the Geographical Analysis Machine (GAM) proposed by Openshaw et al. (1987), may offer methods that are less sensitive to scale than traditional quantitative techniques.

The widespread adoption of geographic information systems (GISs) contributes to the scale problem, but it may offer solutions as well. GISs facilitate data integration regardless of scale differences. This is a problem whenever we try to use coarse aggregate data (like statewide or countywide data), and especially when we try to compare those data with less coarse or disaggregate data (like information by census tract or by individual survey). The capability to process and present geographic information "up" and "down" local-, regional-, and global-scale ranges has been advocated as a solution to understanding the global systems of both natural (e.g., global climate change) and societal (e.g., global economy) processes and the relationships between the two. Fundamental scale questions will benefit from coordinated research efforts among geographic information scientists with various interests and domain experts. Information systems of the future can sensitize users to the implications of scale dependence and provide scale management tools once we develop alternative models of scale behavior, an improved qualitative understanding of the effects of scale, novel methods for describing the scale of data, and intelligent automation methods for changing scale.

Ongoing research

Issues of scale affect nearly every GIS application and involve questions of scale cognition, the scale or range of scales at which phenomena can be easily recognized, optimal digital representations, technology and methodology of data observation, generalization, and information communication. These are very different types of questions. Effective research in the area of scale will require interdisciplinary efforts of geographers, spatial and/or geostatisticians, cartographers, remote sensing specialists, domain experts, cognitive scientists, and computer scientists. Research on scale is under way in geography (Hudson 1992), remote sensing (Quattrochi and Goodchild 1997), cartography (Buttenfield and McMaster 1991), spatial statistics (Wong and Amrhein 1996), hydrology (Sivapalan and Kalma 1995), and ecology (Ehleringer and Field 1993) among other areas. Scale research in many institutes, agencies, and in the private sector began in an ad hoc fashion. Motivated both by practical needs as well as theoretical development, recent attention is focused on formalizing the study of scale, on developing theory, and on exploring robust methods for information representation, analysis, and communication across multiple scales.

Research Priorities

It has become clear that global and regional processes have implications for local places and that individual and local decisions collectively have global and regional implications. Therefore, scientific information about global and regional patterns and processes must be understood on a local level and vice versa. As the policy-making and scientific communities come to terms with these relationships, systematic understanding about spatial and temporal variations in scale gain importance. Geographical information plays an ever larger role as we move to an increasingly automated information economy. Our understanding of scale and the management of data at various scales must keep pace. Ultimately data and information must inform and must produce better decisions.

  • Definitions of scale concepts. There has been much confusion and misuse of the term "scale." Lam and Quattrochi (1992) set a basis for clarifying the term. More thorough work will help clarify all connotations of scale in a multidisciplinary context. These connotations are inherently related. A good understanding of the conceptual relationship between these connotations should benefit the geographic information users in conjunction with other standardization efforts in geographic information use.

     

  • Systematized bases for scale-related decision making. Basic research on the effects of scale on information content and loss will yield practical information for the many users of geographical information. We need to ask, for example, if we lose a significant portion of our explanatory power when we represent global population trends by country as compared with representations at the level of primary divisions within countries. In an increasing array of management and policy settings, decisions about the appropriate scales of analysis are made every day. Identification of critical scales and of scale-invariant data sets or modeling procedures can make those decisions explicit and better informed.

     

  • Practical guidance on data integration and use. Often, data at suboptimal or disparate scales are the best available. Using imperfect data that are available is in many cases preferable to using no data at all, but there are implications for the validity of results. For the user community, creation of knowledge about scale provides principles to improve a data set's fitness for use and guidelines by which to discount model results when necessary. For example, we already know that analysis using aggregate data cannot be used to impute the behavior of individuals (called the ecological fallacy). A better understanding of the problem of conflation (the procedure of merging the positions of corresponding features in different data layers) will enable the fusing of data sets, produced at different scales or produced at the same scale, from different sources. Conflation currently poses a serious impediment in the map overlay process, a critical component of GIS. A better understanding of conflation is also necessary for the integration of data sets produced by different agencies. 

     

  • New methods for quantifying and compensating for the effects of scale in statistical and process models. Scientists and land managers apply a variety of analytical tools to answer geographic questions. Many existing methods do not allow the user to adjust or compensate for the effects of the data scale on the analysis results. Methods that are sensitive to scale will allow the inclusion of scale correctives, much like the correctives that compensate for inflation of the significance of statistical relationship in the presence of spatial autocorrelation.

 

  • Intelligent automated generalization methods. GIS tools can be expanded to provide users with methods for intelligently changing the scale of their data. Basic research is still needed to understand how scale changes are perceived, and this in turn can inform interface design. Intelligent generalization will permit the encoding of raw observations in digital form to derive more responsive, application-specific archived representations.

  

  • Improved understanding of cognitive issues of scale. Many scale questions involve human cognition (i.e., how humans perceive the world and information representing it). This issue is explicit especially during human-computer interaction and must be dealt with technically during interface development. It ultimately affects a chain of decisions. Basic research lays a foundation for answering many conceptual and technical questions about the proper use of spatial and temporal scales in geographic information processing.

 Potential projects:

 

  • Develop more consistent definitions of scale and its associated attributes that can be "uniformly" understood or perceived across multiple disciplines. The challenge here is how to make the "standardized" lexicon be adopted and utilized multidisciplinarily.

 

  • Encourage the development of a scale analysis module to be included in major environmental analytical and measurement methods, so that sensitivity analysis, robustness testing, and the effects of scale on the analytical findings can be assessed.

 

  • It is important to point out that many existing scale studies rely heavily on the re-sampling methods to generate multi-scale data for analysis, and as such, the findings on the scale effects are not really due to the scale effects, but rather they are an artifact attributable to the use of different re-sampling methods (Weigel, 1996). The effects of re-sampling methods on the scale studies are an aspect requiring further research.

 

  • More work need to be done to identify and develop efficient spatial/geostatistical techniques for assessing and characterizing the scale effects. This include refinement of existing techniques, such as fractals, spatial autocorrelation, Shannon index, geographical variance, and local variance, as well as exploration of new techniques such as wavelet analysis, local fractals, and multifractals (Hou, 1998). The indices derived from these techniques, if sufficiently discriminatory and information-rich, could be included as part of the metadata.

 

  • Identify the ranges of scale over which an encoded attribute classification scheme is valid. A project of this type will help build a better understanding of the linkages between the scale of a spatial representation and the appropriate corresponding attribute detail. For example, land cover data represented as 30-meter pixels should include more detail (i.e., the number and types of classes, a hierarchical classification scheme to aid the aggregation of classes) in the land cover classification than similar data represented as 1-kilometer pixels. Studies of these relationships in specific settings and for specific data sets (e.g., land cover classification) will be performed with the aim of building theoretical bases for understanding these relationships more generally.

     

  • Identify the optimal scales of analysis for common data sets, applications, and needs, and critical scales at which the content or structure of phenomena change suddenly (e.g., in vegetation or population). This work will involve the application of sensitivity analyses and spatial statistical tools for describing those sensitivities. Ultimately the goal is to allow users of geographic data to determine the appropriate scales prior to data collection or analysis.

 

  • Develop methods for cost-benefit analysis comparing the use of pregeneralized data (e.g., soil polygons) versus automated generalization of raw data (e.g., soil data from collection points). "Cost" could refer to data collection costs and/or to computational cycles; "benefits" can similarly serve as a metaphor for data validity; each will depend on the needs of the particular application. As automated generalization methods become readily available (many are now in commercial GIS packages), users may wish to access data in the rawest form possible as opposed to using data generalized for a specific purpose. Research of this type would have implications for data collection and archival agencies as well as users.

 

  • Develop improved methods for data generalization. A well researched, yet still unsolved, problem associated with spatial generalization is the creation of multiple versions of databases. Research in cartographic generalization has taken several directions including algorithmic design and testing, the design of models and conceptual frameworks, the application of expert systems, and the modeling of cartographic features. Thus far, most of the work in generalization has focused on what is termed "cartographic" generalization, which involves the graphical considerations associated with scale change. A second, less researched, area is in "model" generalization, by which generalization operators (simplification, smoothing, aggregation, agglomeration, and others) are applied to an original digital landscape model (DLM) in order to create secondary representations of the database, called digital cartographic models (DCM). These terms, DLM and DCM, are taken from the European cartographic literature, where a significant amount of work has been completed in this area.

 

  • Develop improved methods for incorporating knowledge about scale into metadata. We need to model descriptive data about data sets (i.e., metadata) to assess the consequences of downloading data at a finer resolution than is needed for a particular GI S application. The question is to determine if the choice of appropriate scale can be made on the basis of metadata alone. With the expansion of global computer networks and the use of those networks for geographic data transmission, efficient modes of communicating data content are needed. We need to determine which modes are most appropriate for representing scale and to evaluate their relative effectiveness. The provision of information about scale aids in the education of users on how to assess appropriate scales for a given application. Scale is clearly a fundamental component of any metadata report.

 

  • Design and develop a multi-scale database. We still lack the database management tools and associated functions needed to effectively and efficiently store multi-scale data, perform multi-scale analysis, and intelligently change scale. This project should be aimed at the development of a software product that would be useful in GIS applications.

 References

 Buttenfield, B. P., and R. B. McMaster (editors), 1991. Map Generalization: Making Rules for Knowledge Representation. New York: Longmont Scientific and Technical.

 Ehleringer, J. R., and C. B. Field (editors), 1993. Scaling Physiological Processes, Leaf to Globe. New York: Academic Press, Inc.

 Hou, R.-R., 1998. A Local-Level Approach in Detecting Scale Effects on Landscape Indices. M. S. Thesis, Louisiana State University.

 Hudson, J., 1992. Scale in space and time. In R. F. Abler, M. G. Markus, and J. M. Olson (editors), Geography's Inner Worlds: Pervasive Themes in Contemporary American Geography. New Brunswick, NJ: Rutgers University Press, pp. 280-300.

 Lam, N., and D. A. Quattrochi, 1992. On the issues of scale, resolution, and fractal analysis in the mapping sciences. The Professional Geographer 44:88-98.

 Openshaw, S., M. Charlatan, C. Wymer, and A. Craft, 1987. A Mark 1 Geographic Analysis Machine for the automated analysis of point data sets. International Journal of Geographical Information Systems, 1(4):335-358.

 Quattrochi, D. A., and M. F. Goodchild (editors), 1997. Scaling in Remote Sensing and GIS. Boca Raton, FL: CRC/Lewis Publishers, Inc.

 Sivapalan, M., and J. D. Kalma, 1995. Scale problems in hydrology: Contributions of the Robertson Workshop. Hydrological Processes 9(3/4):243-250.

 Turner, M. G., R. V. O'Neill, R. H. Gardner, and B. T. Milne, 1989. Effects of changing spatial scale on the analysis of landscape pattern. Landscape Ecology 3:153-162.

 Weigel, S. J., 1996. Scale, Resolution and Resampling: Representation and Analysis of Remotely Sensed Landscapes Across Scale in Geographic Information Systems. Ph.D. Dissertation, Louisiana State University.

 Wong, D., and C. Amrhein (editors), 1996. The Modifiable Areal Unit Problem. Special issue of Geographical Systems 3:2-3.

 

 Back to Research Priorities Revised White Papers