Review articleBasic Mapping Principles for Visualizing Cancer Data Using Geographic Information Systems (GIS)
Introduction
A geographic information system (GIS) allows epidemiologists and cancer researchers to investigate spatial patterns within their data and understand relationships between cancer and other health, socioeconomic, and environmental variables. High-quality maps also allow researchers to present a compelling case to others who are interested in their work. GIS is an additional tool in the exploration, analysis, and communication of cancer data, and knowledge of the basic principles for representing data can help cancer researchers make the most of GIS and the opportunities for insight it offers.
This article is structured in three sections: mapping methods, mapping multiple variables, and map finishing. Two common symbol types, choropleth mapping and proportional symbols, are featured, and decisions involved in making effective use of these symbols are summarized. Supporting figures present maps of prostate cancer data to correspond with the topic of this special issue of the American Journal of Preventive Medicine. These maps were produced in ArcGIS (ESRI, Redlands CA, version 9) with no further augmentation in illustration software. The data and geography used for these maps are from the National Cancer Institute (NCI) Cancer Mortality Maps and Graphs Website.1 The available years of prostate cancer mortality data are mapped (aggregations for 1950–1994), drawing from the counts, rates per 100,000 person-years (age adjusted using 1970 populations), and upper and lower bounds of 95% conference interval (CI) offered on the site for black and white races. The map area is cropped to produce compact demonstration figures that can be compared in series. The data are freely available through the NCI website to other mapmakers who would like to work with the methods described.
The basic overview of thematic mapping offered in this article has wide application in cancer and epidemiologic mapping. Other tools in GIS are also of use to epidemiologists, such as address geocoding and network analysis. The links between spatial statistics software tools and GIS are also improving.2 The focus of this short article, however, is limited to symbolizing statistical data, which is a common use of GIS. Basic criteria for choosing symbols to map derived values, significance levels, model results, and smoothed rates are the same as for simpler measures such as crude rates. Likewise, multivariate maps that combine or overlay model results with original data or related variables can illuminate relationships between them by combining symbolization approaches.
Cartographers use visual tools, and epidemiologists use statistical tools to investigate their data. This is an oversimplification, to be sure, but it seems to be a core difference in approach between the two fields and each could be enhanced by further use of the other’s methods. The tools cartographers use to improve their visual representations of data can complement epidemiologists’ sophisticated adjustments for potentially spurious rates and small numbers. For example, cartographers adjust class breaks when mapping a given data set. Epidemiologists adjust the data while mapping with a given classing algorithm without adjustment, such as quantiles. Epidemiologists question the data; cartographers question the symbolization. This contrast is exaggerated in the hope that this brief introduction will encourage epidemiologists to expand their insights from data by expanding their approaches to data representation.
Section snippets
Mapping Methods
A basic characteristic of cancer data that guides choice of a map symbol is whether categories or quantities are recorded. Categorical differences in cancer may be case/control or benign/malignant. They may code race differences (Figure 1) as well as many other socioeconomic categories. Quantitative data may be counts, ranks, or derived values such as rates and percentages (Figure 2).
Symbols that show categorical differences well are color hue and symbol shape. Symbols well suited to
Mapping Multiple Variables
Improved representations of data and relationships between cancer variables may be revealed by overlaying symbols for one variable onto those of another; mapping related variables as a series; mapping differences, modeled indices, residuals, or other derived values; and mapping with symbols that combine variables. Examples of these approaches are described in this section.
Map Finishing
Guidelines for mapmakers often do not discuss the seemingly trivial issues of map titles and legend titles or issues of file export. The look and credibility of final maps are also affected by selecting a map projection suited to the area mapped. These are keys to making a map that can be understood, be presented in multiple media, and be distributed to a wide audience.
Conclusion
Maps and other data graphics can play a role in generating ideas and hypotheses at the beginning of a project. They are useful as part of analyses for evaluating model results and then at the end of a project when researchers present their results and conclusions to varied audiences, such as their local research group, decision makers, or a concerned public. Cancer researchers are gaining skill with GIS mapping as one of their many tools and are broadening the symbolization approaches they use
References (26)
Cancer mortality maps and graphs
(1999)Using geographic information systems technology in the collection, analysis, and presentation of cancer registry dataa handbook of basic practices
(2002)- et al.
Thematic cartography and geographic visualization
(2005) - et al.
Choosing geographic units for choropleth rate maps, with an emphasis on public health applications
Cartogr Geogr Inf Sci
(2003) - et al.
ColorBrewer in printA catalog of color schemes for maps
Cartogr Geogr Inf Sci
(2003) - et al.
ColorBrewer.organ online tool for selecting colour schemes for maps
Br Cartogr Soc
(2003) - et al.
Mapping mortalityevaluating color schemes for choropleth maps
Ann Assoc Am Geogr
(1997) Spectral schemesControversial color use on maps
Cartogr Geogr Inf Sci
(1997)- et al.
An evaluation of color selections to accommodate map users with color-vision impairments
Ann Assoc Am Geogr
(1997) Guidelines for selecting colors for diverging schemes on maps
Br Cartogr Soc
(1996)
Evaluation of methods for classifying epidemiological data on choropleth maps in series
Ann Assoc Am Geogr
Reflections on mapping Census 2000
Cartogr Geogr Inf Sci
Maps and cartograms of the 2004 U.S. presidential election results
Cited by (93)
Spatial epidemiology: An empirical framework for syndemics research
2022, Social Science and Medicine“We only have the one”: Mapping the prevalence of people with high body mass to aid regional emergency management planning in aotearoa New Zealand
2020, International Journal of Disaster Risk ReductionCitation Excerpt :Disaster planning requires a good understanding of the geographical dimensions, boundaries, lifelines and important facilities [64] with advances in health data it is also possible to include key human health geography. The promise of GIS mapping include the potential to reach a broad array of audiences, including health planners, policymakers, advocacy groups, and an interested public [65]. Although this movement promotes creative means of analysis and identification of at-risk populations for planners and researchers, such accessibility may pose dilemmas relating to labelling populations living in particular geographic locales [40–42].
Spatial distribution of agricultural pesticide use and predicted wetland exposure in the Canadian Prairie Pothole Region
2020, Science of the Total EnvironmentCitation Excerpt :The classes were optimized to illustrate the spatial distribution of each pesticide group based on their specific frequency distributions. For example, for fungicides we used quantile distribution since the distribution was left skewed, whereas k-means algorithm was used for insecticides and herbicides, to reduce the variability within classes and maximize the variance among classes (Brewer, 2006). The open source software R, version 3.5 (R Core Team, 2019) was used for mapping and the R package classInt (Bivand, 2018) for the classification.
The Volcanic Hazard Maps Database: an initiative of the IAVCEI Commission on Volcanic Hazards and Risk
2023, Journal of Applied Volcanology