Review article
Basic Mapping Principles for Visualizing Cancer Data Using Geographic Information Systems (GIS)

https://doi.org/10.1016/j.amepre.2005.09.007Get rights and content

Abstract

Maps and other data graphics may play a role in generating ideas and hypotheses at the beginning of a project. They are useful as part of analyses for evaluating model results and then at the end of a project when researchers present their results and conclusions to varied audiences, such as their local research group, decision makers, or a concerned public. Cancer researchers are gaining skill with geographic information system (GIS) mapping as one of their many tools and are broadening the symbolization approaches they use for investigating and illustrating their data. A single map is one of many possible representations of the data, so making multiple maps is often part of a complete mapping effort. Symbol types, color choices, and data classing each affect the information revealed by a map and are best tailored to the specific characteristics of data. Related data can be examined in series with coordinated classing and can also be compared using multivariate symbols that build on the basic rules of symbol design. Informative legend wording and setting suitable map projections are also basic to skilled mapmaking.

Introduction

A geographic information system (GIS) allows epidemiologists and cancer researchers to investigate spatial patterns within their data and understand relationships between cancer and other health, socioeconomic, and environmental variables. High-quality maps also allow researchers to present a compelling case to others who are interested in their work. GIS is an additional tool in the exploration, analysis, and communication of cancer data, and knowledge of the basic principles for representing data can help cancer researchers make the most of GIS and the opportunities for insight it offers.

This article is structured in three sections: mapping methods, mapping multiple variables, and map finishing. Two common symbol types, choropleth mapping and proportional symbols, are featured, and decisions involved in making effective use of these symbols are summarized. Supporting figures present maps of prostate cancer data to correspond with the topic of this special issue of the American Journal of Preventive Medicine. These maps were produced in ArcGIS (ESRI, Redlands CA, version 9) with no further augmentation in illustration software. The data and geography used for these maps are from the National Cancer Institute (NCI) Cancer Mortality Maps and Graphs Website.1 The available years of prostate cancer mortality data are mapped (aggregations for 1950–1994), drawing from the counts, rates per 100,000 person-years (age adjusted using 1970 populations), and upper and lower bounds of 95% conference interval (CI) offered on the site for black and white races. The map area is cropped to produce compact demonstration figures that can be compared in series. The data are freely available through the NCI website to other mapmakers who would like to work with the methods described.

The basic overview of thematic mapping offered in this article has wide application in cancer and epidemiologic mapping. Other tools in GIS are also of use to epidemiologists, such as address geocoding and network analysis. The links between spatial statistics software tools and GIS are also improving.2 The focus of this short article, however, is limited to symbolizing statistical data, which is a common use of GIS. Basic criteria for choosing symbols to map derived values, significance levels, model results, and smoothed rates are the same as for simpler measures such as crude rates. Likewise, multivariate maps that combine or overlay model results with original data or related variables can illuminate relationships between them by combining symbolization approaches.

Cartographers use visual tools, and epidemiologists use statistical tools to investigate their data. This is an oversimplification, to be sure, but it seems to be a core difference in approach between the two fields and each could be enhanced by further use of the other’s methods. The tools cartographers use to improve their visual representations of data can complement epidemiologists’ sophisticated adjustments for potentially spurious rates and small numbers. For example, cartographers adjust class breaks when mapping a given data set. Epidemiologists adjust the data while mapping with a given classing algorithm without adjustment, such as quantiles. Epidemiologists question the data; cartographers question the symbolization. This contrast is exaggerated in the hope that this brief introduction will encourage epidemiologists to expand their insights from data by expanding their approaches to data representation.

Section snippets

Mapping Methods

A basic characteristic of cancer data that guides choice of a map symbol is whether categories or quantities are recorded. Categorical differences in cancer may be case/control or benign/malignant. They may code race differences (Figure 1) as well as many other socioeconomic categories. Quantitative data may be counts, ranks, or derived values such as rates and percentages (Figure 2).

Symbols that show categorical differences well are color hue and symbol shape. Symbols well suited to

Mapping Multiple Variables

Improved representations of data and relationships between cancer variables may be revealed by overlaying symbols for one variable onto those of another; mapping related variables as a series; mapping differences, modeled indices, residuals, or other derived values; and mapping with symbols that combine variables. Examples of these approaches are described in this section.

Map Finishing

Guidelines for mapmakers often do not discuss the seemingly trivial issues of map titles and legend titles or issues of file export. The look and credibility of final maps are also affected by selecting a map projection suited to the area mapped. These are keys to making a map that can be understood, be presented in multiple media, and be distributed to a wide audience.

Conclusion

Maps and other data graphics can play a role in generating ideas and hypotheses at the beginning of a project. They are useful as part of analyses for evaluating model results and then at the end of a project when researchers present their results and conclusions to varied audiences, such as their local research group, decision makers, or a concerned public. Cancer researchers are gaining skill with GIS mapping as one of their many tools and are broadening the symbolization approaches they use

References (26)

  • Cancer mortality maps and graphs

    (1999)
  • L. Wiggins

    Using geographic information systems technology in the collection, analysis, and presentation of cancer registry dataa handbook of basic practices

    (2002)
  • T.A. Slocum et al.

    Thematic cartography and geographic visualization

    (2005)
  • F.P. Boscoe et al.

    Choosing geographic units for choropleth rate maps, with an emphasis on public health applications

    Cartogr Geogr Inf Sci

    (2003)
  • C.A. Brewer et al.

    ColorBrewer in printA catalog of color schemes for maps

    Cartogr Geogr Inf Sci

    (2003)
  • M.A. Harrower et al.

    ColorBrewer.organ online tool for selecting colour schemes for maps

    Br Cartogr Soc

    (2003)
  • C.A. Brewer et al.

    Mapping mortalityevaluating color schemes for choropleth maps

    Ann Assoc Am Geogr

    (1997)
  • C.A. Brewer

    Spectral schemesControversial color use on maps

    Cartogr Geogr Inf Sci

    (1997)
  • J.M. Olson et al.

    An evaluation of color selections to accommodate map users with color-vision impairments

    Ann Assoc Am Geogr

    (1997)
  • C.A. Brewer

    Guidelines for selecting colors for diverging schemes on maps

    Br Cartogr Soc

    (1996)
  • C.A. Brewer et al.

    Evaluation of methods for classifying epidemiological data on choropleth maps in series

    Ann Assoc Am Geogr

    (2002)
  • C.A. Brewer

    Reflections on mapping Census 2000

    Cartogr Geogr Inf Sci

    (2001)
  • M.T. Gastner et al.

    Maps and cartograms of the 2004 U.S. presidential election results

    (2004)
  • Cited by (93)

    • “We only have the one”: Mapping the prevalence of people with high body mass to aid regional emergency management planning in aotearoa New Zealand

      2020, International Journal of Disaster Risk Reduction
      Citation Excerpt :

      Disaster planning requires a good understanding of the geographical dimensions, boundaries, lifelines and important facilities [64] with advances in health data it is also possible to include key human health geography. The promise of GIS mapping include the potential to reach a broad array of audiences, including health planners, policymakers, advocacy groups, and an interested public [65]. Although this movement promotes creative means of analysis and identification of at-risk populations for planners and researchers, such accessibility may pose dilemmas relating to labelling populations living in particular geographic locales [40–42].

    • Spatial distribution of agricultural pesticide use and predicted wetland exposure in the Canadian Prairie Pothole Region

      2020, Science of the Total Environment
      Citation Excerpt :

      The classes were optimized to illustrate the spatial distribution of each pesticide group based on their specific frequency distributions. For example, for fungicides we used quantile distribution since the distribution was left skewed, whereas k-means algorithm was used for insecticides and herbicides, to reduce the variability within classes and maximize the variance among classes (Brewer, 2006). The open source software R, version 3.5 (R Core Team, 2019) was used for mapping and the R package classInt (Bivand, 2018) for the classification.

    View all citing articles on Scopus
    View full text