Beruflich Dokumente
Kultur Dokumente
Department of Justice
Office of Justice Programs
National Institute of Justice
AUG. 05
Special REPORT
www.ojp.usdoj.gov/nij
U.S. Department of Justice
Office of Justice Programs
810 Seventh Street N.W.
Washington, DC 20531
Alberto R. Gonzales
Attorney General
Regina B. Schofield
Assistant Attorney General
Sarah V. Hart
Director, National Institute of Justice
Mapping Crime:
Understanding Hot Spots
NCJ 209393
Sarah V. Hart
Director
This document is not intended to create, does not create, and may not be relied upon to
create any rights, substantive or procedural, enforceable by law by any party in any matter
civil or criminal.
Findings and conclusions of the research reported here are those of the authors and do
not necessarily reflect the official position or policies of the U.S. Department of Justice. The
products, manufacturers, and organizations discussed in this document are presented for
informational purposes only and do not constitute product approval or endorsement by the
U.S. Department of Justice.
The National Institute of Justice is a component of the Office of Justice Programs, which also
includes the Bureau of Justice Assistance, the Bureau of Justice Statistics, the Office of
Juvenile Justice and Delinquency Prevention, and the Office for Victims of Crime.
Cover Credits
(From left to right)
Map by Jeff Stith and the Wilson, North Carolina, Police Department
Map by Wilpen Gore, Heinz School of Public Policy and Management, Carnegie Mellon
University, Pittsburgh, Pennsylvania
Background map by Jeff Stith and the Wilson, North Carolina, Police Department
About This Report
Much of crime mapping is devoted to
detecting high-crime-density areas known
What did the researchers
as hot spots. Hot spot analysis helps find?
police identify high-crime areas, types of ■ Identifying hot spots requires multiple
crime being committed, and the best way techniques; no single method is suffi
to respond. cient to analyze all types of crime.
This report discusses hot spot analysis ■ Current mapping technologies have sig
techniques and software and identifies nificantly improved the ability of crime
when to use each one. The visual display analysts and researchers to understand
of a crime pattern on a map should be crime patterns and victimization.
consistent with the type of hot spot and
possible police action. For example, when ■ Crime hot spot maps can most effective
hot spots are at specific addresses, a dot ly guide police action when production
map is more appropriate than an area of the maps is guided by crime theories
map, which would be too imprecise. (place, victim, street, or neighborhood).
iii
Contents
About This Report . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii
Chapter 4. Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
v
Chapter 1. Crime Hot Spots: What They Are,
Crime is not spread evenly across maps. scarce. Community policing is particularly
It clumps in some areas and is absent in attentive to high-crime neighborhoods,
others. People use this knowledge in their where residents have great difficulty exert
daily activities. They avoid some places ing social controls. Problem-oriented polic
and seek out others. Their choices of ing pushes police officials to identify
neighborhoods, schools, stores, streets, concentrations of crime or criminal activity,
and recreation are governed partially by determine what causes these concentra
the understanding that their chances of tions, and then implement responses to
being a victim are greater in some of reduce these concentrations. Much of
these places than in others. In some what is called crime analysis is dedicated
places people lock their cars and secure to locating concentrations of crime—hot
belongings. In other places they do not. spots—and much of crime mapping is
Along some streets people walk swiftly devoted to their detection.
and view approaching strangers with sus
picion. Along other streets they casually This chapter discusses how different inter
stroll and welcome the next interesting pretations of hot spots require different
person they might meet, and notice others types of crime maps. The principal theme
making the same choices in the same is that crime hot spot maps can most
areas. effectively guide police action when pro
duction of these maps is guided by theory.
Some might argue that this behavior With the appropriate crime theory, crime
merely shows that people are unreason maps can communicate vital information
ably fearful of some areas but not of oth to police officials and community mem
ers. This may often be true, but the fact bers efficiently and effectively.
that people are not equally fearful of all
places suggests that they understand that Many useful crime theories provide guid
crime is not evenly distributed. People ance for selecting mapping symbols.
might be mistaken about the risks of Which theory is most useful depends on
some places, but they are not mistaken the type of problem being mapped. Maps
that their risk of being a victim of crime is that are not based on theory will provide
not geographically constant. officers with inadequate and even mislead
ing information.
Police use this understanding every day.
Decisions about how to allocate scarce The term hot spot has a number of mean
resources are based partially on where the ings. This chapter begins with a discussion
demands for police are highest and where of what the term means and how the
they are lowest. Officers are told to be meanings relate to the concept of levels of
particularly attentive to some behavior in spatial analysis of crime. Different theories
some areas, but are given no guidance of crime explain crime at different levels,
about other areas where this behavior is so this chapter briefly describes various
1
SPECIAL REPORT / AUG. 05
levels of crime theories and explains how crime or disorder. It also suggests that
they can be depicted on maps. This chap some hot spots may be hotter than others;
ter examines four types of crime theories that is, they vary in how far above average
in greater detail: place (point) theories; they are.
street (line) theories; area (polygon) theo
ries; and repeat victim theories, which can
operate on point, line, or polygon level. Levels of hot spot analysis
These theories describe the levels of hot
spots and how these levels can be depict If hot spots are merely areas with an
ed on maps. This chapter examines why above average amount of crime or disor
crime theory, crime mapping, and police der, why do practitioners and researchers
actions need to be consistent. The end of use the term in such a variety of ways? In
the chapter examines how the map sym fact, with recent developments in crime
bols implied by each theory communicate mapping, one can find hot spots of any
to users of crime maps. size—from hot spot places to hot regions.
Although all of these perspectives on hot
spots have something in common—con-
centrations of crime or disorder separated
What is a hot spot? by areas with far less crime or disorder—
Areas of concentrated crime are often they differ in the area covered by the hot
referred to as hot spots. Researchers and spots. More importantly, the factors that
police use the term in many different ways. give rise to hot spot places are different
Some refer to hot spot addresses (Eck and from the factors that give rise to hot spot
Weisburd, 1995; Sherman, Gartin, and streets, hot spot neighborhoods, or hot
Buerger, 1989), others refer to hot spot spot cities. Further, the actions one takes
blocks (Taylor, Gottfredson, and Brower, to deal with a hot spot place will be differ
1984; Weisburd and Green, 1994), and oth ent from the actions needed to address a
ers examine clusters of blocks (Block and hot spot street, hot spot neighborhood, or
Block, 1995). Like researchers, crime ana hot spot city.
lysts look for concentrations of individual
events that might indicate a series of relat These approaches differ on the level of
ed crimes. They also look at small areas analysis, or the size of the geographic area
that have a great deal of crime or disorder, of crime about which one is concerned.2
even though there may be no common The level at which one examines crime or
offender. Analysts also observe neighbor disorder is dictated by the question one
hoods and neighborhood clusters with high asks, which will determine the usefulness
crime and disorder levels and try to link of the results. Consider two related, but
these to underlying social conditions. very distinct, questions: Where are drugs
being sold? What is the market for drugs?
Though no common definition of the term
hot spot of crime1 exists, the common The precise answer to the first question
understanding is that a hot spot is an area requires identifying specific drug-dealing
that has a greater than average number of locations or street segments (very small
criminal or disorder events, or an area areas) where drug dealers and customers
where people have a higher than average routinely meet. To answer the second
risk of victimization. This suggests the question, the analyst needs to find out
existence of cool spots—places or areas where the customers are coming from,
with less than the average amount of just as he would if he asked the question,
2
MAPPING CRIME: UNDERSTANDING HOT SPOTS
“What is the market for new cars?” The Crime hot spot theories
answer to the first question—specific
locations or street segments—is not par
ticularly useful for answering the second
Place theories
question. Rather, the analyst would be Place theories explain why crime events
interested in larger areas with high con occur at specific locations. They deal with
centrations of drug users. These areas crimes that occur at the lowest level of
might surround the locations and street analysis—specific places. They involve
segments identified when answering the looking at specific incidents and asking
first question, or they may be physically such questions as, “At what places are
separated from the dealing sites (as would burglaries occurring and at what places are
occur when suburban high school and col they not occurring?” Crime phenomena at
lege students drive into cities to find this level occur as points, so the appropri
drugs). The types of police actions that ate units of analysis are addresses, street
might remove drug-dealing locations are corners, and other very small places,
likely to be different from the actions which are typically represented on maps
needed to dry up the market. So identify as dots. Police action, such as warrants,
ing the appropriate level of analysis is criti which specify exact addresses (not blocks
cal to understanding the problem and or neighborhoods), is very precise at this
determining what action to take. level. Similarly, nuisance abatement focus
es on specific locations.
Crime theories are critical for useful crime
mapping because they aid in the interpre
tation of data (Eck, 1998) and provide Street theories
guidance as to what actions are most Street theories deal with crimes that occur
appropriate. Therefore, understanding at a slightly higher level than specific
how crime theories account for hot spots places; that is, over small, stretched areas
is critical. Several theories of crime and such as streets or blocks. A prostitution
disorder concentration (hot spots) exist. stroll is an example. At this level of analy
Some theories disagree, but often the sis analysts ask such questions as, “On
theories do not contradict each other. which streets are prostitutes found and on
Rather, they explain different types of which streets are they not found?” The
crime phenomena that occur at different appropriate units of analysis can be street
geographic levels. segments, paths, and sections of high
ways, which would be represented on
Each level has basic units of analysis—the maps as straight, bent, or curved lines.
things being examined. One can think of Police action is still relatively precise,
units as corresponding to the geographic although not as precise as at the place
areas being depicted on maps: points, level. Concentrated patrolling occurs at
lines, or polygons (Harries, 1999). Some this level, for example, as well as efforts to
theories help explain point concentrations change traffic and street patterns.
of crime. Other theories help explain linear
concentrations of crime or hot spot crime
polygons. However, theories of crime are Neighborhood theories
useful for helping to guide crime and disor Some theories of crime attempt to explain
der mapping only if one selects a theory neighborhood differences.3 At a higher
appropriate for the level of analysis and level than place or street, neighborhood
action.
3
SPECIAL REPORT / AUG. 05
theories deal with large areas. Here ana Exhibit 1 organizes and summarizes the
lysts are interested in such questions as, discussion of hot spot analysis so far and
“What areas are claimed by gangs and introduces what is to come. The first col
what areas are not?” The appropriate units umn describes the geographic concentra
of analysis are quite varied and can include tion at various levels of interest. The
square blocks, communities, and census second column describes the basic pat
tracts, to name a few. Two-dimensional tern formed by hot spots at each level. The
shapes such as ellipses, rectangles, and third column lists the geometric dimen
other polygons are used on maps to repre sion to be used on a crime map to depict
sent crime phenomena at this level. At this each type of hot spot. Place theories sug
level police action is far less precise gest maps with dots, street theories sug
because the areas are typically too large gest maps that emphasize lines, and area
for effective concentrated patrolling theories suggest the use of polygons on
(Sherman, 1997). Nevertheless, depending maps. Repeat victimization theories do not
on neighborhood characteristics, relevant directly correspond to a single dimension
action might include efforts to engage resi or level. They can be depicted on maps by
dents in collective action against crime dots, lines, or polygons. The last three
and disorder. If offenders are mobile columns highlight points discussed next.
throughout an area, rather than concen Examined are four types of hot spots—
trated at a few places, then efforts to places, victims, streets, and areas.
deter them should occur at this level.
4
MAPPING CRIME: UNDERSTANDING HOT SPOTS
Map Geometric
Concentration pattern dimension Theories Likely causes Examples
Place—at Point concentration; Zero; concentration Routine activity Management of Bar fights,
specific a few places with at points theory; place behavior at places convenience
addresses, many crimes and many management store
corners, or other places with few robberies,
places or no crimes. Repeat ATM patron
crime places are often robberies,
concentrated. drug dealing
locations
Among Often confused with Zero, one, or two; Routine activity Victim routines Domestic
victims repeat crime places concentration at theory; and lifestyle violence
(above). Only visible points, lines, and lifestyles choices
on maps if victims are areas
concentrated at places,
on streets, or in areas.
Street—along Linear concentration One; concentration Offender search Offender movement Outside
a street or block along major thorough- along lines theory patterns and target street
face fares; a few blocks concentrations prostitution,
with much crime and street drug
many blocks with little dealing,
crime robberies of
pedestrians
Area—neighbor- Concentration covering Two; concentration Disorganization Low collective Residential
hood areas multiblock areas in areas theory and re- efficacy, social burglary,
lated ecologic fragmentation, gang
theories of concentrations violence
crime; of youth,
opportunity economic disinvest-
theories ments; concentra-
tions of crime
targets
5
SPECIAL REPORT / AUG. 05
allows the depiction of repeat and non- Repeat victimization hot spots
repeat places on the same map and per
mits comparison among repeat places Repeat victimization refers to the multiple
about the number of crimes. Graduated attacks on the same individual, regardless
dots also allow one to find concentra of location. It often is confused with
tions of hot places (e.g., an area that repeat crime places. A repeat place might
contains several repeat assault bars). have a number of different victims.
Because graduated dots can obscure Clearly one can have both repeat victim
nearby features (e.g., a large dot may ization and repeat crime places (Eck,
overlap nearby smaller dots), this tech 2000). For example, a person could fre
nique is best used on large-scale maps. quent a bar where he is assaulted on a
number of different occasions. But if
■ Color gradient dots. Two other repeat victimization is distributed over
approaches are useful on small-scale many locations (as would occur if repeat
maps. One is to use a color gradient— victims are assaulted at different bars,
yellow through red, for example—to but never the same bar twice), it will not
depict the number of crimes at each show up on a map as a hot spot place
location. A yellow dot may be used to (zero dimension). Repeat victimization
represent places with a single crime, a could show up as lines (one dimension) if
light orange dot may represent locations the victims are repeatedly attacked along
with two crimes, and a deeper orange the same thoroughfares, or as a polygon
dot might represent places with three (two dimensions) if victims are repeatedly
crimes. This approach has the advantage attacked in the same neighborhoods.
of the use of graduated symbols but
overcomes the overlap problem. Mapping repeat victimization is more likely
to reveal patterns with vulnerable popula-
■ Repeat addresses. Another method is tions—potential victims who engage in
to select the most serious hot spot similar activities. Consider taxicab rob
addresses. For example, one might want beries and homicides. These crimes are
to find the worst 10 percent of the unlikely to be concentrated in places. One
addresses. This is called repeat address might find attacks on this victim group
mapping (RAM). The addresses would occurring along specific streets where the
be the 10 percent of repeat addresses drivers are particularly vulnerable or where
that have the most crimes. They would offenders have a better chance of escape.
be plotted on a map using dots to repre More likely, however, taxicab robberies
sent hot spots. This method has two dis and homicides will be spread over a neigh
tinct advantages. First, the map is borhood or in a multineighborhood area
clearer because it has less clutter. within a city.
Second, such maps are useful for clearly
specifying police targets. The deficiency Underlying causes. Repeat crime places
of RAM is that it leaves out information with different victims and repeat victimiza
about the other locations. This deficiency tion with different places have different
can be overcome by producing supple causes. Repeat crime places (with differ
mentary maps that show all locations or ent victims) can be attributable to the
by combining RAM with the use of a behavior of place managers, but if the vic
color gradient so that the targeted hot timizations occur at different places, place
spots have a distinct color (Eck, Gersh, managers have less of a role. In those
and Taylor, 2000). cases, one should look at the occupations,
6
MAPPING CRIME: UNDERSTANDING HOT SPOTS
7
SPECIAL REPORT / AUG. 05
Knowledge of offender, victim, and police outmigration. These changes either dis
behavior is critical to separating the under rupt social networks or prevent such
lying crime pattern from reporting and networks from forming. Since these net
recording patterns. works, according to disorganization the
ory, are responsible for most social
Maps for repeat streets. Commonly control in neighborhoods, their absence
available mapping programs make it easy leads to higher levels of deviancy. Other
to identify hot spot places or hot spot factors, such as poverty and racism, also
areas, but do not make linear hot spots have been identified as undermining
easy to identify. Simple dot maps can be social networks.
used to identify hot street segments, and
this may be the most straightforward ■ Social efficacy. Recent evidence from
method. Most clustering algorithms, Chicago points to the role of social effi
unfortunately, will show areas of concen cacy, which is “the willingness of local
tration even when a line is the most residents to intervene for the common
appropriate dimension. If high levels of good.” It depends on “mutual trust and
precision are not required, such area solidarity among neighbors” (Sampson,
maps may be adequate. Raudenbush, and Earls, 1997, page 919).
Neighborhoods that have a great deal of
social efficacy have less crime and disor
Neighborhoods and other area
der than neighborhoods that have low
hot spots
levels. Social efficacy—like disorganiza
More has been written about neighbor tion and social networks—is not a prop
hood concentrations of crime (hot spots) erty of individual people or places, but a
than about any other form of concentra characteristic of groups of people.
tion of crime. In their pathbreaking book
Social Factors in Juvenile Delinquency ■ Broken windows theory. The broken
(1931), Shaw and McKay noted persistent windows theory also is an area theory of
concentrations of deviancy in the 1920s. crime concentration. Wilson and Kelling
They noted that some neighborhoods had (1982) claim that in most well-function-
high levels of juvenile delinquency, year in ing neighborhoods, small transgressions
and year out, decade after decade, regard of social norms (e.g., failure to keep
less of who lived in the areas (Shaw and one’s yard tidy) result in social pressures
McKay, 1969). Since that time, many to bring the offending party into compli
explanations for differences in neighbor ance. Once a place becomes untended,
hood crime levels have surfaced. Most of however, it undermines the willingness
these theories focus on the ability of local and ability of residents to enforce social
residents to control deviancy (Bursik and order. Consequently, residents withdraw
Grasmick, 1993). from enforcing neighborhood norms,
which allows further deviancy to occur.
Underlying causes. Explanations for dif This in turn results in additional with
fering neighborhood crime levels include drawal and fear and the neighborhood
the following: begins to spiral downward. Skogan
(1990) found evidence in support of this
■ Social disorganization theory. This the basic thesis, although others suggest
ory suggests that the natural ability of the evidence is weak (Harcourt, 1998) or
people to control deviancy in their neigh show that the theory is seriously flawed
borhoods is impaired in some areas by (Taylor, 2000).
constant residential turnover and net
8
MAPPING CRIME: UNDERSTANDING HOT SPOTS
9
SPECIAL REPORT / AUG. 05
Knowing which streets have the robberies Limitations of hot spot maps
and which do not is critical for addressing
such a concentration. So showing this Concentrations of victimization sometimes
form of hot spot requires lines—straight, can be shown with maps, but often they
jointed, curved, or intersecting. cannot. If victimization risk is in part geo
graphical, then maps are useful. A city
Ellipse, choropleth, and isoline maps. wide dot map of gas stations with two or
When hot spots cover broader areas and more robberies within the last 6 months
coincide with neighborhoods, they need to shows concentration at two levels. The
be depicted in another way. Ellipse and dots depict concentrations of robbery at
choropleth maps imply that the areas with specific places. Groupings of dots depict
in the designated hot spots share the streets or neighborhoods with concentra
same risk level, so a specific street or loca tions of repeat robbery gas stations. Dot
tion within the area is irrelevant. Isoline maps for this type of victimization makes
maps imply a continuous gradient of risk some sense, but they do not work for all
within a hot spot, so a particular place has forms of victimization concentration. If vic
risks similar to but not the same as an tims are mobile, street or area maps might
adjacent place or street. A gang-related be more useful. However, the use of maps
robbery problem can be an example. If is limited for some forms of victimization
gang members commit robberies through analysis. If the population of potential vic
out specific neighborhoods (i.e., do not tims is spread throughout an area (not
focus on specific streets or around specif concentrating at places, along streets, or
ic sites), but refuse to commit robberies within neighborhoods), the analyst would
outside their territories, and their territorial be better off using an analytical technique
boundaries are streets, then a choropleth other than maps to convey the concentra
map might be useful. One could create a tion. For example, taxicab robberies may
map of the gang areas and shade the be spread quite thinly across a city. The
areas according to the robbery frequency relevant features of the robbery victims
within each. If the likelihood of a gang- might be related to the cab companies,
related robbery diminishes the farther one the drivers’ ages, hours of operation,
goes from the center of gang activity, then installed security within cabs, or a host of
an isoline map depicting gradients of rob factors that cannot be shown on a map.
bery frequency does a better job of show Police officers trying to investigate or pre
ing the problem. vent such robberies would find maps less
useful than bar charts showing the charac
Ellipses may be far less useful. They sug teristics of victims and nonvictims.
gest a firm boundary between crime on
the inside and no crime on the outside, Exhibit 2 links the major points discussed
but they frequently do not follow natural thus far. The first two columns are from
movement patterns of people. Using an exhibit 1. The third column shows where
ellipse to define an area hot spot is like the police action needs to be focused. If
saying, “Look in this general area,” the concentration level, action level, and
because neither its shape nor its boundary form of hot spot depiction are not aligned,
are likely to conform to the nature of the then the map will be useless at best and
underlying problem. Consequently, suggest inappropriate action at worst. A
ellipses provide police officers with far map depicting hot streets or areas does
less information than other ways of depict not help identify places where nuisance
ing area hot spots. abatement would be useful. Alternatively,
a point map is too specific for implement
10
MAPPING CRIME: UNDERSTANDING HOT SPOTS
11
SPECIAL REPORT / AUG. 05
depicting hot spots is connected with use 3. Some disagreement exists over the geographic
ful theories, each of which suggests differ scope of neighborhood theories of crime. Most
researchers refer to areas covering several square
ent types of police action. Recognition of blocks, although Taylor (1997, 1984) makes a strong
these links in mapping practice will lead to case for the relevant area being about the size of a
better use of crime maps. single block. Clearly, the difference between a large
place and a small neighborhood is blurry.
Notes
1. Although one could have hot spots of anything that
References
can be geographically distributed—a hot spot of auto Block, R., and C. Block. 1995. “Space, Place and
mobile dealerships, for example—usage of this term Crime: Hot Spot Areas and Hot Place of Liquor-
is restricted to crime or disorder. So unless other Related Crime.” In J.E. Eck and D. Weisburd (eds.),
wise specified, hot spots means hot spots of crime Crime and Place (vol. 4, pp. 145–184). Monsey, NY:
or disorder. Criminal Justice Press.
2. Level does not indicate superiority or rank in this Block, R., M. Felson, and C.R. Block. 1985. “Crime
instance. A high-level hot spot is not better or worse Victimization Rates for Incumbents of 246
than a low-level hot spot. Rather, higher level hot spots Occupations.” Sociology and Social Research, 69,
can be composed of groups of lower level hot spots. 442–451.
In this sense, level refers to level of aggregation.
Everywhere outside A B
the chance of being a
crime site that is part
of the pattern is zero.
Everywhere inside is
equally likely to be a C
crime site that is part
D
of the pattern.
12
MAPPING CRIME: UNDERSTANDING HOT SPOTS
Brantingham, P.L., and P.J. Brantingham. 1981. “Notes Menard, S. 2000. “The ‘Normality’ of Repeat
on the Geometry of Crime.” In P.J. Brantingham and Victimization From Adolescence Through Early
P. L. Brantingham (eds.), Environmental Criminology Adulthood.” Justice Quarterly, 17(3), 543–574.
(pp. 27–54). Beverly Hills, CA: Sage Publications.
Sampson, R.J., S.W. Raudenbush, and F. Earls. 1997.
Brantingham, P.L., and P.J. Brantingham. 1995. “Neighborhoods and Violent Crime: A Multilevel
“Criminality of Place: Crime Generators and Crime Study of Collective Efficacy.” Science, 227, 918–924.
Attractors.” European Journal of Criminal Justice
Policy and Research, 3, 5–26. Shaw, C.R., and H.D. McKay. 1931. Social Factors in
Juvenile Delinquency (vol. 2, no. 13). Washington,
Bursik, R.J., and H.G. Grasmick. 1993. Neighborhoods DC: U.S. Government Printing Office.
and Crime: The Dimensions of Effective Community
Control. New York, NY: Lexington Books. Shaw, C.R., and H.D. McKay. 1969. Juvenile
Delinquency and Urban Areas (revision of 1942 ed.).
Eck, J. 2000. “Policing and Crime Event Concentration.” Chicago, IL: University of Chicago.
In L.W. Kennedy, R.F. Meier, and V.F. Sacco (eds.),
The Process and Structure of Crime: Criminal Events Sherman, L.W. 1997. “Policing for Crime Prevention.”
and Crime Analysis. New Brunswick, NJ: Transaction In L.W. Sherman, D. Gottfredson, D. MacKenzie, J.
Press. Eck, P. Reuter, and S. Bushway (eds.), Preventing
Crime: What Works, What Doesn’t, What’s Promising
Eck, J.E. 1994. Drug Markets and Drug Places: A (pp. 8-1–8-58). Washington, DC: U.S. Department of
Case-Control Study of the Spatial Structure of Illicit Justice, National Institute of Justice.
Drug Dealing. Unpublished Ph.D. dissertation,
University of Maryland. Sherman, L.W., P.R. Gartin, and M.E. Buerger. 1989.
“Hot Spots of Predatory Crime: Routine Activities
Eck, J.E. 1998. “What Do Those Dots Mean? and the Criminology of Place.” Criminology, 27(1),
Mapping Theories With Data.” In D. Weisburd and 27–55.
T. McEwen (eds.), Crime Mapping and Crime
Prevention (vol. 8, pp. 379–406). Monsey, NY: Skogan, W.G. 1990. Disorder and Decline: Crime
Criminal Justice Press. and the Spiral of Decay in American Neighborhoods.
New York, NY: Free Press.
Eck, J.E., J.S. Gersh, and C. Taylor. 2000. “Finding
Crime Hot Spots Through Repeat Address Mapping.” Spelman, W. 1995a. “Criminal Careers of Public
In V. Goldsmith, P.G. McGuire, J.H. Mollenkopf, and Places.” In J.E. Eck and D. Weisburd (eds.), Crime
T.A. Ross (eds.), Analyzing Crime Patterns: Frontiers and Place (vol. 4, pp. 115–144). Monsey, NY: Criminal
of Practice (pp. 49–64). Thousand Oaks, CA: Sage Justice Press.
Publications.
Spelman, W. 1995b. “Once Bitten, Then What?
Eck, J.E., and D. Weisburd. 1995. “Crime Places in Cross-Sectional and Time-Course Explanations of
Crime Theory.” In J. E. Eck and D. Weisburd (eds.), Repeat Victimization.” British Journal of Criminology,
Crime and Place (vol. 4, pp. 1–33). Monsey, NY: 35, 366–383.
Criminal Justice Press.
Stedman, J., and D.L. Weisel. 1999. “Finding and
Farrell, G., and K. Pease. 1993. Once Bitten, Twice Addressing Repeat Burglaries.” In C.S. Brito and
Bitten: Repeat Victimization and Its Implications for T. Allen (eds.), Problem-Oriented Policing: Crime-
Crime Prevention (vol. 46). London: Her Majesty’s Specific Problems, Critical Issues and Making POP
Stationery Office. Work (vol. 2, pp. 3–28). Washington, DC: Police
Executive Research Forum.
Harcourt, B.E. 1998. “Reflecting on the Subject: A
Critique of the Social Influence Conception of Taylor, R.B. 1997. “Social Order and Disorder of Street
Deterrence, the Broken Windows Theory and Order Blocks and Neighborhoods: Ecology, Microecology
Maintenance Policing New York Style.” University and the Systematic Model of Social Disorganization.”
Michigan Law Review, 97, 291–389. Journal of Research in Crime and Delinquency, 34(1),
113–155.
Harries, K. 1999. Mapping Crime: Principle and
Practice. Washington, DC: U.S. Department of
Justice, National Institute of Justice.
13
SPECIAL REPORT / AUG. 05
Taylor, R.B. 2000. Breaking Away from Broken Uchida (eds.), Drugs and Crime: Evaluating Public
Windows: Baltimore Neighborhoods and the Policy Initiatives. Thousand Oaks, CA: Sage
Nationwide Fight Against Crime, Grime, Fear, and Publications.
Decline. Boulder, CO: Westview Press.
Weisburd, D., and L. Green. 1995. “Policing Drug Hot
Taylor, R.B., S.D. Gottfredson, and S. Brower. 1984. Spots: The Jersey City Drug Market Analysis
“Block Crime and Fear: Defensible Space, Local Experiment.” Justice Quarterly, 12(4), 711–736.
Social Ties, and Territorial Functioning.” Journal of
Research in Crime and Delinquency, 21, 303–31. Wilson, J.Q., and G.L. Kelling. 1982. “Broken
Windows: The Police and Neighborhood Safety.”
Weisburd, D., and L. Green. 1994. “Defining the Atlantic Monthly, March, 29–38.
Street Level Drug Market.” In D.L. MacKenzie and C.
14
Chapter 2. Methods and Techniques for
Understanding Crime Hot Spots
Spencer Chainey, Jill Dando Institute of Crime Science, University College London
This chapter presents a number of meth these points will be aggregated to small
ods and techniques to understand and area geographies such as beats or census
describe patterns of hot spots in crime blocks to demonstrate the applicability of
data. It explains the advantages and disad certain techniques. The methods dis
vantages of certain techniques, focusing cussed and tested on these data are per
on methods that are easy to understand fectly applicable to analysts’ own data.
and practical to apply. Four data sets are
used to test the methods. The results help
evaluate how the methods improve under Preliminary global
standing of crime patterns. This chapter
does not attempt to find the optimal
statistical tests
method. Rather, it presents a procedure A number of simple-to-use global statisti
for applying a number of complementary cal tests can be used to help analysts
methods that can help analysts under understand general patterns in the crime
stand hot spots in the data. data presented here. These tests
include—
The four data sets are from the London
Metropolitan Police Force’s Crime Report ■ Mean center.
Information System for Hackney Borough
Police for the period June 1999 through ■ Standard deviation distance.
August 1999. This chapter contains tests
■ Standard deviation ellipse.
developed and conducted to help analysts
understand hot spots in representative
■ Tests for clustering.
samples of crime data sets of—
15
SPECIAL REPORT / AUG. 05
and the two other crime subsets. The tance, the more dispersed are the crime
mean center for vehicle crime is the far data. These results show that vehicle
thest south of all the crime types, and crime is the most dispersed; robbery is
street robbery is nearly as far north as that the least.
for residential burglary but slightly farther
west. These mean centers can be used to Standard deviation ellipses
generally indicate that residential burglary
and robbery offenses show a greater ten Levels of dispersion also can be presented
dency to occur in the northern part of the using standard deviation ellipses. The size
borough and that vehicle crime affects the and shape of the ellipse help explain the
southern areas of the borough more. degree of dispersion, and its alignment
helps to explain the crime type’s orienta
tion. Exhibit 3 shows standard deviation
Standard deviation distance ellipses for the four crime types. The sub
Measures of standard deviation distance tle differences between the ellipses help
help explain the level and alignment of dis describe the relative differences in disper
persion in the crime data. These statistics sion and alignment of the four crime
are best used as relative measures, com types’ patterns. The ellipse with the small
paring crime types against each other or est area (robbery) is the least dispersed of
the same crime types for different periods the crime types. The position of the rob
of time. Exhibit 2 shows the standard devi bery ellipse farther north of all crime and
ation distances for the four crime types. vehicle crime, but slightly farther south of
The greater the standard deviation dis the residential burglary ellipse, reflects its
16
MAPPING CRIME: UNDERSTANDING HOT SPOTS
All crime
Robbery
Residential burglary
Vehicle crime
17
SPECIAL REPORT / AUG. 05
evidence of clustering. The NNI test com The results also show the differences
pares the actual distribution of crime data between the NNI results for a minimum
against a data set of the same sample bounding rectangle area and the actual
size, with random distribution. It can be catchment area of the crime points. When
applied when the user has access to data the actual catchment area is known or can
in which each point relates to individual easily be calculated, it should be used in
crime events (irrespective of whether the calculation of the NNI. If the area is
some of these events are mapped on top not known, a minimum bounding rectan
of each other at exactly the same loca gle around the crime distribution often is
tion). The NNI method is explained in the used to calculate the area representing
following steps: the crime data’s catchment zone.
■ For each point, calculate the distance to However, minimum bounding rectangle
the nearest neighbor point. areas used for NNI tests often can create
misleading results for describing point dis
■ Sum the nearest neighbor distance for tributions. To test and show evidence of
all points and divide by the number of this, analysts purposely designed a regular
points in the data. This value is the distribution of points and applied three dif
observed average nearest neighbor ferent types of bounding areas: minimum
distance. bounding rectangle, bounding convex hull,
and true boundary area. After applying NNI
■ Create a random distribution of the tests, the bounding convex hull method
same number of crime points covering and true boundary area revealed similar
the same geographic area, and for each results, correctly describing the point
point calculate the distance to each
nearest neighbor point.
■
Exhibit 4. Nearest neighbor analysis results for the
Calculate the sum of the nearest neigh
four crime data sets
bor distances for all these randomly
distributed points and divide by the Crime type and
number of points in the data. This value bounding area* NNI Z-score
is the average random nearest neighbor
All crime
distance.
Bounding rectangle area 0.32 –129.11
■ The NNI is then the ratio of the True boundary area 0.46 –103.14
observed average nearest neighbor dis
Robbery
tance against the average random near
est neighbor distance. Bounding rectangle area 0.59 –19.14
True boundary area 0.80 –9.20
If the result generated from the NNI test is
Residential burglary
1, then the crime data are randomly dis
tributed. If the NNI result is less than 1, Bounding rectangle area 0.57 –27.14
then the crime data show evidence of True boundary area 0.74 –16.46
clustering. An NNI result that is greater Vehicle crime
than 1 reveals evidence of a uniform pat
Bounding rectangle area 0.52 –38.73
tern in crime data.
True boundary area 0.72 –22.16
Exhibit 4 shows the NNI results for the
four crime data sets. All four sets show *Crime distribution is clustered for all areas.
evidence of clustering in their distribution.
18
MAPPING CRIME: UNDERSTANDING HOT SPOTS
data’s distribution to be regular. The result would seem more important to use meth
from the minimum bounding area suggest ods that do not require an intensity value
ed the same distribution to be random. but retain and perform tests on the original
Where the true boundary area is not avail crime event point data.
able, a convex hull that bounds the limits
of the point distribution will return a more Spatial autocorrelation methods include
accurate result for evidence of clustering the following:
than a minimum bounding rectangle.
■ Moran’s I. If analysts have access only
A z-score test statistic can be applied to to crime point data that are aggregate
help place confidence in the NNI result. counts (representing the number of
This test for statistical significance crime events within a certain geographic
describes how different the actual average area, e.g., census blocks), an appropriate
nearest neighbor distance is from the method to apply to test for clustering is
average random nearest neighbor dis the spatial autocorrelation technique,
tance. The significance of the z-score can Moran’s I. (See exhibit 5.) Moran’s I sta
be found in any table of standard normal tistic works by comparing the value at
deviations. The general rule to follow is any one location with the value at all
that the more negative the z-score, the other locations (Levine, 2002; Bailey and
more confidence can be placed in the NNI Gatrell, 1995; Anselin, 1992; Ebdon,
result, bearing in mind that for smaller 1985). Moran’s I requires an intensity
sample sizes the z-score will be less than value for the crime point, which is often
that for larger samples of crime points. represented as the centroid of the geo
graphic boundary area. This point is then
Test for spatial auto correlation. Spatial assigned an intensity value. For crime
autocorrelation techniques test whether applications, this most often is the count
the distributions of point events are relat of crimes within that geographic area.
ed to each other. Positive spatial autocor The Moran’s I result varies between
relation is said to exist where events are –1.0 and +1.0. Where points that are
clustered or where events that are close close together have similar values, the
together have similar values than those Moran’s I result is high. The significance
that are farther apart. of the result can be tested against a the
oretical distribution (one that is normally
Spatial autocorrelation tests, of which distributed) by dividing by its theoretical
Moran’s I is one commonly applied standard deviation. (For more details on
method, have been used previously to test this test, see Levine, 2002.)
for evidence of crime event clustering
(Chakravorty, 1995). Spatial autocorrelation ■ Geary’s C statistic. A second spatial
techniques require an intensity value, be it autocorrelation method is Geary’s C
a weighting linked to the event or a count statistic. (See exhibit 5.) This method is
of crimes where the crime point relates to best applied to describe differences in
the coordinate of an area to which crime small neighborhoods. (Moran’s I gives
events have been aggregated (e.g., the more of a global indicator of spatial rela
centroid of the area). If the original crime tionships; Levine, 2002). Geary’s C sta
event data exist as accurate point georef tistic is a measure of the deviations in
erenced data, aggregating this data to a intensity values of each point with one
common point will lose spatial detail. With another. The values of C typically vary
the increased availability of accurate and between 0 and 2, where values less
precise geocoded records of crime, it than 1 indicate evidence of positive
19
SPECIAL REPORT / AUG. 05
spatial autocorrelation and values centrated together more than any other
greater than 1 indicate evidence of neg crime category.
ative spatial autocorrelation. Similar to
Moran’s I, the Geary coefficient can be The following sections explore various
tested for significance against a theoreti techniques for mapping crime. These sec
cal distribution (one that is normally dis tions mainly use the vehicle crime data set
tributed) by dividing by its theoretical to explore the different applications of
standard deviation. these techniques to help in understanding
hot spots of crime.
The differences in the sensitivities of the
two spatial autocorrelation tests are noted
in the results on the four crime categories. Crime mapping techniques
For example, evidence exists of positive
spatial autocorrelation for robbery at a Point mapping
more global level. However, the Geary
coefficient reveals that at the smaller The most common approach for display
neighborhood level, areas that have a high ing geographic patterns of crime is point
number of robberies are surrounded by mapping (Jefferis, 1999). Point mapping is
areas with a low number of robberies. popular mainly because it is a simple digi
tal version of a familiar and traditional
The preliminary global statistics reveal a method of placing pins representing crime
wealth of knowledge about the crime data events onto a wall map. In a digital appli
even before the mapped distributions are cation, if these individual geographic point
explored in detail. The analysts have gener objects are suitably attributed with infor
ated an understanding of the global pat mation, such as the code describing the
terns in the crime data and shown that type, date, and time of offense, sets of
evidence of clustering exists in all of the points meeting particular conditions can
four crime categories. This clustering be simply and quickly selected. These
does, however, vary at different scales. selections can then be displayed using
The dispersion tests have revealed that suitable symbology representing the
although clustering exists in the data, the crime category displayed. However, trying
crime hot spots for vehicle crime are more to interpret spatial patterns and hot spots
dispersed than any of the other crime cat in the crime point data can be difficult,
egories. These tests, therefore, begin to particularly if the data sets are large.
reveal a picture of what the hot spot maps
may look like. For example, when the rob Exhibit 6 shows the 1,747 events of vehi
bery data are mapped, analysts can expect cle crime. The large volume of points
hot spots of crime to exist and to be con shown on map A makes it difficult to feel
20
MAPPING CRIME: UNDERSTANDING HOT SPOTS
completely confident in identifying the hot demonstrates the difficulty in trying to con
spots of this particular crime. The prelimi sistently identify crime hot spots from
nary global tests revealed that clusters of point data. Also, at certain locations, what
vehicle crime were evident, but that these appears to be a single crime point may be
hot spots were more dispersed than any more than one crime point. These points at
of the other crime types. To demonstrate coincident locations are impossible to iden
the ambiguity in trying to identify the hot tify using the point map in exhibit 6. Point
spots of crime from this particular data maps do have their application for mapping
set, the analysts asked three crime ana individual events of crime, small volumes
lysts who were not familiar with the study of crime, and repeat locations through the
area to draw the location of the top three use of graduating symbol sizes (see exhibit
hot spots of vehicle crime on map B. The 7), but they can become less effective for
three analysts identified two common identifying hot spots of crime, particularly
areas as being hot spots of crime, but the from large data volumes.
areas drawn at these locations differed in
size and shape. Two hot spots identified by
Spatial ellipses
analyst 1 are completely different from
those identified by analysts 2 and 3. The application of spatial ellipses for
attempting to identify crime hot spots
Which analyst is correct? At this stage it is has a long tradition in crime mapping.
difficult to tell because each hot spot The Spatial and Temporal Analysis of
drawn is plausible. Not all of those drawn Crime (STAC) software distributed by
will be completely correct. This example the Illinois Criminal Justice Information
Crime analyst 1
Crime analyst 2
Crime analyst 3
A B
21
SPECIAL REPORT / AUG. 05
Authority is one type of available soft into a single cluster or when the grouping
ware used for spatial ellipses. The Space criteria fails (Levine, 2002).
Analyzer component of STAC works by
creating standard deviational ellipses K-means clustering. The K-means tech
around crime point clusters. Other spatial nique creates a user-defined number (K) of
ellipse techniques include hierarchical clus ellipses by partitioning the crime point
tering and the K-means clustering routine. data into groupings. The routine finds the
best positioning of the K centers and then
Hierarchical clustering. This method uses assigns each point to the center that is
a nearest neighbor analysis technique to nearest. Exhibit 8 shows five ellipses cre
identify groups of a minimum number of ated using the K-means method. The
user-defined points. The nearest neighbor method demonstrates how spatial ellipse
analysis technique used identifies only techniques are useful for grouping crime
those points that are closer than expected point clusters and revealing areas for clos
under spatial randomness. The first set of er inspection. However, the ellipse outputs
ellipses generated through this process is also demonstrate certain weaknesses in
referred to as first-order clusters. Grouping these types of techniques if analysts are
the first-order clusters can then generate trying to accurately identify the location of
second-order ellipses. This process can crime hot spots. Crime hot spots do not
then be repeated until all crime points fall naturally form spatial ellipses. These
6 to 10
4 to 6
22
MAPPING CRIME: UNDERSTANDING HOT SPOTS
methods, therefore, do not represent the method returns more accurate results.
actual spatial distribution of crime and can Both methods are plausible and important
often mislead the analyst to focus on ly present the opportunity to explore the
areas of low crime importance within an nature of crime in these areas in more
ellipse. Also, all of these techniques detail. However, neither method presents
require a thorough understanding of the the immediate opportunity to prioritize the
routines at work because each method main crime hot spots to assist in preven
requires a number of parameters to be tion targeting.
entered. Users with a vague understand
ing of the actual methods are given few Thematic mapping of
rules to enter appropriate parameters, geographic boundaries
thus introducing ambiguity and influencing
variability in the final output. For example, A popular technique for representing any
different crime analysts investigating crime spatial distribution is geographic boundary
hot spots from the same data may pro thematic mapping. These geographic
duce different results because of the dif boundaries usually are defined administra
ferent parameters they have chosen to tive or political areas such as beats, cen
enter into the routine. Grouping events sus blocks, polling districts, wards, or
into elliptical clusters also excludes from borough boundaries. Crime events
any visual result those events that do not mapped as points can be aggregated to
belong to a cluster. these geographic region areas. These
counts of events by their geographic areas
Four of the five ellipses drawn using the can then be thematically mapped to dis
K-means method partially overlap the hot play the spatial pattern of crime across the
spot areas drawn by the three crime ana area of interest (see exhibit 9).
lysts from the point map. One ellipse dif
fers completely from an area identified by When thematic maps of this type are cre
analyst 1. At this stage it is difficult to ated, the user is required to identify the
understand which one is correct or which type of range to represent the distribution
23
SPECIAL REPORT / AUG. 05
of crime (e.g., equal count, equal ranges, crime analyst was asked to identify those
natural break, standard deviation, quantile, areas he believed to be the three main hot
or custom range; for a good review of spots of crime. These are the three hot
these different types of range settings, spot areas drawn on the map. Notice how
see Harries, 1999). This freedom to choose these interpreted hot spot areas differ
a range of any type creates variability in from those drawn directly from the point
any thematic map produced for identifying data and those identified by spatial
crime hot spots. Different range types ellipses. Which map is correct?
structure the groupings of geographic
boundary crime counts into differing Thematic maps such as the one in exhibit
threshold categories. At what range 9 tend to attract the audience to the
threshold can the crime analyst confident largest areas shaded in the top threshold
ly conclude that geographic boundaries in color range. Therefore, the single census
that range and above represent hot spots block A has been selected as a hot spot as
of crime? In other words, when is a hot it stands out boldly. Census block B has
spot a hot spot? not been selected as a hot spot, yet its
area is one-eighth of that of census block
Considering the theory and application A and it has a similar crime count, as it
behind the map the user is trying to pro features in the top threshold range. This
duce and the target audience is important reveals a problem with using geographic
at this stage. If the application is to identi boundary thematic maps to identify hot
fy the hot spots of crime, the range spots of crime. Due to the varying size
thresholds should be structured to focus and shape of the census blocks (and most
on revealing the locations of these high-
volume crime areas. The analysts also
should consider structuring the thematic Exhibit 9. Vehicle crimes mapped by census tract
ranges in a manner that is easy for the
audience to understand. The map should Greater than 15
be the central message and the thematic 10 to 15
ranges should follow in a logical sequence. 5 to 10
If the thematic ranges require explanation
or confuse the audience, the analysts have 1 to 5
lost the opportunity to present their cen 0
tral message: mapping and displaying
crime hot spots. Mapping is an iterative
process. The first map a user produces is
unlikely to be the one printed or included
in a report. Decide on the message and
follow through with this message by test
ing different settings available for the the
matic thresholds.
24
MAPPING CRIME: UNDERSTANDING HOT SPOTS
25
SPECIAL REPORT / AUG. 05
with a number of the hot spots identified kriging, and splining are all designed to
from the point map. In addition, notice use an intensity, population, or ‘z’ value
that the census block A identified as a hot taken from sample locations to estimate
spot in the geographic boundary thematic values for all locations between sample
map (exhibit 9) is not revealed as a hot sites. For example, interpolation tech
spot using this method. The crimes that niques are commonly used to create sur
contributed to the high count in this large faces representing the distribution of
census block are generally spread across rainfall, where the values between rain
the whole geographic region. The quadrat gauges are estimated from a function that
method, therefore, appears to offer a considers the rainfall readings and the dis
more accurate method for identifying hot tribution of sample sites (i.e., rain gauges).
spots of crime, particularly where applica With crime data, sample sites with an
tions for crime prevention targeting are intensity value do not necessarily exist.
required. Neither would it make sense to apply one
of these techniques to estimate the num
However, the quadrat method still restricts ber of crimes that may have occurred
analysts to using a defined geographic between the existing crime point loca
area to represent the size and shape of tions. No crimes have been recorded in
crime hot spots. The restriction of using the areas between crime points, so the
thematically shaded, defined geographic analysts should avoid methods that aim to
grid cells often results in loss of spatial create estimated intensity values in the
detail within each quadrat and across gaps between the points. Instead, sur
quadrat boundaries. This can lead to prob faces that the analysts wish to create that
lems of inaccurate interpretation. The map represent the distribution of crime should
also looks blocky and does not entirely act as visualizations for helping them
invoke interest. A common request to understand crime patterns. Methods that
address this problem is to reduce the grid suit the analysts’ application should there
cell size. All this does is produce a speck fore represent, as a continuous surface,
led map of small, thematically shaded grid the relationships or densities between
cells that fail to draw the user to any firm crime point distributions.
conclusions as to the size, shape, or loca
tion of crime hot spots for resource target Quartic kernel density estimation. The
ing. Also, issues with thematic range most suitable method for visualizing crime
settings still exist with this method, includ data as a continuous surface is quartic ker
ing those that relate to the MAUP. nel density estimation (Chainey et al.,
2002; McGuire and Williamson, 1999). The
quartic kernel density method creates a
Interpolation and continuous
smooth surface of the variation in the den
surface smoothing methods
sity of point events across an area. The
Interpolation. Interpolation is an increas method is explained in the following steps:
ingly popular method for visualizing the
distribution of crime and identifying hot 1. A fine grid is generated over the point
spots. It aggregates points within a speci distribution (see exhibit 11). In most
fied search radius and creates a smooth, cases, the user has the option to specify
continuous surface that represents the the grid cell size.
density or volume of crime events distrib
uted across the area. Common interpola 2.A moving three-dimensional function of
tion techniques, such as inverse distance a specified radius visits each cell and cal
weighting, triangulation with smoothing, culates weights for each point within the
kernel’s radius (see exhibit 12). Points
26
MAPPING CRIME: UNDERSTANDING HOT SPOTS
closer to the center receive a higher the parameter that will lead to most differ
weight and therefore contribute more to ences in output when varied. Guidelines
the cell’s total density value. exist for working out suitable values for
these two parameters. For crime mapping
3.Final grid cell values are calculated by applications, a suitable method to follow
summing the values of all circle surfaces for choosing bandwidth is that suggested
for each location (see exhibit 13). by Williamson et al. (1999), where the
bandwidth relates to the mean nearest
The quartic kernel estimation method neighbor distance for different orders of K.
requires two parameters to be set prior to These nearest neighbor distances for dif
running. These are the grid cell size and ferent orders of K usually are calculated as
bandwidth (search radius). Bandwidth is
Exhibit 11. A fine grid is placed over the area Exhibit 12. A search radius (or bandwidth) is
covered by the crime points then selected
Density
0.18
0.16
0.14 Kernal density estimate
0.12
0.10 Quartic functions
over individual points
0.08
0.06
0.04
0.02
0
0 1 2 3 4 5 6 7 8 9 10111213141516 1718 19 20
Relative location
Note: Each grid cell value is the sum of values of all circle surfaces for each location.
27
SPECIAL REPORT / AUG. 05
part of the nearest neighbor analysis test spatial distribution of the points being ana
for clustering. If this method is adopted, lyzed. The method also has the advantage
the user is still required to choose which K of deriving crime density estimates based
order to apply. This can be regarded as an on calculations performed at all locations
advantage of the technique because users (Levine, 2002), and retains some practical
are prompted to think about the data they flexibility in map design.
are mapping and apply the K order’s mean
nearest neighbor distance as the band The increased application of this type of
width relevant to their crime mapping continuous surface smoothing method is
application. For example, a smaller band due largely to its more common availability
width would be used for an application and visual appeal. Continuous surface hot
that requires output for focused police spot maps allow for easier interpretation
patrolling than for one that requires a more of crime clusters and reflect more accu
strategic view of crime hot spots. rately the location and spatial distribution
of crime hot spots. The quartic kernel den
Users also are required to specify a grid sity estimation method also considers
cell size. This can be regarded as a positive concentrations of crime at all event levels,
feature in kernel estimation because users rather than cluster grouping some and dis
have the flexibility to set sizes relevant to counting others. As their appeal has
the scale at which the output will be increased, however, few questions are
viewed. Large cell sizes will result in more
coarse or blocky-looking maps but are fine
for large-scale output, while smaller cell Exhibit 14. Quartic kernel density estimation
sizes add to the visual appeal of the con surface for vehicle crime using a bandwidth of
tinuous surface produced but may create 220 m (K16)
large file sizes. When unsure of the cell Highest intensity
size, the user can follow the methodology
of Ratcliffe (1999b), who divides the short
er side of the minimum bounding rectan
gle (or the shortest of the two extents
between minimum X/maximum X or mini
mum Y/maximum Y) by 150. Cell size for
the area that the data cover has been set Lowest intensity
to 30 m. The cell values generated from
quartic kernel estimation also are in mean
ingful units for describing crime distribu
tions (e.g., the number of crimes per
square kilometer). Exhibit 14 shows the
quartic kernel density estimation surface
generated from the vehicle crime point
data, where the bandwidth chosen was a
K order of 16 (220 m).
28
MAPPING CRIME: UNDERSTANDING HOT SPOTS
being asked of the outputs generated. The vehicle crime. The method is visually
issue over which thematic range to choose appealing and structures the thematic
to represent the different thematic thresh thresholds to clearly identify areas of high
olds remains a problem. Many agencies est crime concentration. It is simple to
often fail to question this validity or statis apply because it requires only the calcula
tical robustness of the map produced, tion of the grid cell mean; it uses K order
being caught instead in the visual lure of mean nearest neighbor distances to
their sophisticated looking geo-graphic. define bandwidths; it retains flexibility in
Thus, little regard is given to the legend map design by allowing the user to apply
thresholds that are set to help the analyst different K order bandwidths and grid cell
decide when a cluster of crimes can be sizes to the suited application; and it uses
defined as a hot spot. For example, the a consistent methodology to separate the
number of hot spots on a map showing matic thresholds.
the distribution of crime as a kernel densi
ty surface depends on the ranges selected As a statistic, the mean is an easy value
by the map designer to show spatial con for novice map readers to understand.
centrations of these point events. Careful Increments of the mean would be more
selection of range settings is therefore obviously linked to increasing values and
required, but this opens the opportunity their relative significance. This makes the
for crime analysts to create different ker incremental mean threshold approach
nel density estimation hot spot maps of immediately appealing as a method to
crime from the same data. define crime hot spot legend thresholds.
One method that has been suggested to Exhibit 16 shows the results of applying
help standardize the thematic threshold this hot spot threshold approach, based on
settings of kernel density estimation hot quartic kernel density estimation surfaces,
spot maps is the application of incremen to the robbery and residential burglary
tal multiples of the grid cells’ mean data. The incremental mean legend thresh
(Chainey et al., 2002). Calculations for the old approach consistently defines mapped
mean are applied only to grid cells that crime hot spots. In addition, the analysts
have a value of greater than 0 and that are can now observe the global descriptions
within the study area boundary. From this calculated earlier and see that they match
grid cell set, the mean can easily be calcu the crime hot spot mapping output. The
lated within a geographic information sys kernel surface hot spot maps for vehicle
tem (GIS) with grid cell thematic crime, robbery, and residential burglary
thresholds set at— display areas of crime clustering, and
when compared against each other show
■ 0 to mean. relative levels of dispersal. For example,
the hot spot maps of vehicle crime show a
■ Mean to 2 mean. greater degree of dispersion compared to
that of robbery.
■ 2 mean to 3 mean.
■ 3 mean to 4 mean.
Local indicators of spatial
■ 4 mean to 5 mean. association statistics
■ Greater than 5 mean. A more advanced method to help under
stand hot spots of crime is the application
Exhibit 15 shows the application of this of local indicators of spatial association
incremental mean threshold approach for (LISA) statistics (Anselin, 1995; Getis and
29
SPECIAL REPORT / AUG. 05
Ord, 1996). LISA statistics assess the local associations are explored and a signifi
association between data by comparing cance level is related to the confidence in
local averages to global averages. For this the final output. The search distance usual
reason they are useful in adding definition ly is set to three times the grid cell size of
to crime hot spots and placing a spatial the original kernel estimation surface (i.e.,
limit on those areas of highest crime event for data in this chapter, the search dis
concentration (Ratcliffe and McCullagh, tance was set to 90 m). Although levels in
1998). One of the more applied LISA sta significance can often range between 99.9
tistics on crime point events is the Gi* sta percent, 99 percent, and 95 percent, this
tistic (see Ratcliffe and McCullagh, 1998, range has far less effect over eventual hot
for more details). spot areas than parameters set for grid
cell size or bandwidth (Ratcliffe, 1999b).
The Gi* statistic is applied to a grid cell The most common significance level to
output, such as a quartic kernel density apply is 99.9 percent.
estimation map, from which local associa
tions are compared against the global Exhibit 17 shows the result of the Gi*
average. The user typically is required to LISA statistic (mapped as grey areas) for
enter a search distance within which local robbery. This LISA output has been
A B
30
MAPPING CRIME: UNDERSTANDING HOT SPOTS
mapped with the quartic kernel density Therefore, in certain cases, such as hot
estimation surface generated using a spot maps for residential burglary, the
bandwidth of 117 m (K2). The map shows crime hot spots may simply be displaying
the matching between this LISA output locations of high housing density. Rate
and kernel value results above the 3-mean maps can be created to take account of
threshold. This LISA output adds definition the underlying population distribution. For
to the quartic kernel density estimation, residential burglary, maps that consider
indicating the level at which hot spots can the underlying housing distribution can be
be more clearly distinguished from other generated by calculating a rate of burgla
levels of crime concentration. ries per certain number of households
(e.g., burglaries per 1,000 households).
The most reliable housing counts available
Additional elements usually are those sourced from the popula
tion census. The household counts usually
to consider are collected at the census block level,
All the methods described above consider thus allowing the analysts to generate
volume crime patterns and do not consider maps representing the distribution of resi
any underlying population distribution. dential burglary by their rates. However,
Exhibit 16. Quartic kernel density estimation hot spot maps for robbery and residential burglary using the
incremental mean approach
A: Robbery: Bandwidth 207 m (K6)
B: Residential burglary: Bandwidth 161 m (K6)
Greater than 5 mean
4 mean to 5 mean
3 mean to 4 mean
2 mean to 3 mean
Mean to 2 mean
0 to mean
A B
31
SPECIAL REPORT / AUG. 05
although the underlying spatial distribution a rate. In the case of residential burglary,
of housing is considered, the hot spot analysts usually have this with census
maps generated still suffer from the prob tract household counts.
lems described in the section on geo
graphic boundary thematic maps. Suitable denominators for calculating rates
for other types of crime are rarely avail
Also, suitable denominators for calculating able. For robbery, a suitable denominator
rates rarely are available for other crime for calculating rates would be pedestrian
types, such as robbery and vehicle crime. counts for the area; for vehicle crime, a
Analysts often use population counts as suitable denominator would be vehicle
denominators for calculating rates for counts. The analysts would also wish to
these other crime types. This approach, access these counts at a more precise
however, may merely create hot spot map source, such as point sample locations
ping output that misleads by exaggerating from which estimations at unsampled
the crime problem in town centers that locations could be interpolated. This would
have few residents but a concentration of then allow the analysts to be more flexible
crimes such as robbery and vehicle crime. in the area rate calculation by not restrict
Ideally, it is preferable to use denomina ing them to counts aggregated to geo
tors that are directly relevant to the crime graphic boundary areas of varying sizes
type for which the analysts wish to create and shapes.
Exhibit 17. Robbery quartic kernel density estimation surface (117 m (K2) bandwidth) and Gi* LISA statistic
output (grey areas) derived from this robbery surface
Greater than 5 mean
4 mean to 5 mean
3 mean to 4 mean
2 mean to 3 mean
Mean to 2 mean
0 to mean
32
MAPPING CRIME: UNDERSTANDING HOT SPOTS
33
SPECIAL REPORT / AUG. 05
Anselin, L. 1995. “Local Indicators of Spatial Levine, N. 2002. CrimeStat 2.0, A Spatial Statistics
Association—LISA.” Geographical Analysis, 27(2), Program for the Analysis of Crime Incident Locations.
93-115. Houston, TX: Ned Levine & Associates and
Washington: DC: U.S. Department of Justice,
Bailey, T.C., and A.C. Gatrell. 1995. Interactive Spatial National Institute of Justice. http://www.icpsr.umich.
Data Analysis. Harlow, England: Longman Publishers. edu/NACJD/crimestat.html
Chainey, S.P., S. Reid, and N. Stuart. 2002. “When Is McGuire, P.G., and D. Williamson. 1999. “Mapping
a Hotspot a Hotspot? A Procedure for Creating Tools for Management and Accountability.” Paper pre
Statistically Robust Hotspot Maps of Crime.” In sented to the Third International Crime Mapping
Innovations in GIS 9. London: Taylor & Francis. Research Center Conference, Orlando, Florida,
December 11–14, 1999.
Chakravorty, S. 1995. “Identifying Crime Clusters:
The Spatial Principles.” Middle States Geographer, Openshaw, S. 1984. “The Modifiable Areal Unit
28, 53–58. Problem.” Concepts and Techniques in Modern
Geography, 38, 41.
Ebdon, D. 1985. Statistics in Geography. Oxford,
England: Blackwell Publishers. Ratcliffe, J. 1999a. “Spatial Pattern Analysis Machine
Version 1.2 Users Guide.” http://jratcliffe.net/ware
Getis, A., and J.K. Ord. 1996. “Local Spatial /index.htm
Statistics: An Overview.” In P. Longley and
M. Batty (eds.), Spatial Analysis: Modelling in Ratcliffe, J. 1999b. “Hotspot Detective for MapInfo
a GIS Environment. Cambridge, England: Helpfile Version 1.0.” http://jratcliffe.net/ware/
GeoInformation International, 261–277. index.htm
Harries, K. 1999. Mapping Crime: Principle and Ratcliffe, J., and M.J. McCullagh. 1998. “Hotbeds of
Practice. Washington, DC: U.S. Department of Crime and the Search for Spatial Accuracy.” Paper
Justice, National Institute of Justice. presented to the Second Crime Mapping Research
Center Conference: Mapping Out Crime, Arlington,
Jefferis, E. (ed.) 1999. A Multi-Method Exploration Virginia, December 10–12, 1998.
of Crime Hot Spots: A Summary of Findings.
Washington, DC: U.S. Department of Justice, Williamson, D., S. McLafferty, V. Goldsmith, J.
National Institute of Justice, Crime Mapping Mallenkopf, and P. McGuire. 1999. “A Better Method
Research Center. to Smooth Crime Incident Data.” ESRI ArcUser
Magazine, January–March, 1–5. http://www.esri.com/
news/arcuser/0199/crimedata.html
34
Chapter 3. Spatial Analysis Tools for
Identifying Hot Spots
James G. Cameron, Merkle Direct Marketing, and Michael Leitner, Assistant Professor,
Department of Geography and Anthropology, Louisiana State University
35
SPECIAL REPORT / AUG. 05
more precision, but raster data can gener standard classification scheme options.
ally model geographic patterns across con The four most common classification
tinuous space with greater efficiency. methods are natural breaks, quantile,
equal interval, and standard deviation. The
default classification option in ArcView is
Choropleth mapping with
natural breaks. In this approach, class cat
ArcView
egories are identified based on natural
The data for the examples used in this groupings in the data. Arcview uses a sta
chapter consist of burglary incidents from tistical procedure to identify optimal
the city of Boston for 1999. These crime groupings so that values within each
incidents are represented as point loca class are more similar and values
tions and have been aggregated up to the between classes are farther apart.
census tract level to demonstrate different Usually, class breaks are set to corre
types of analyses. spond with relatively large jumps in the
distribution of values. The quantile classi
Choropleth mapping is a common tech fication method assigns an equal number
nique for representing data summarized of areas to each class. Thus, if 160 cen
by statistical or administrative areas and is sus tracts and four classes exist, then
particularly useful for obtaining a general each class grouping contains 40 census
picture of the overall spatial distribution of tracts, with the lowest 40 in the first
crime. Choropleth maps are primarily used group and the highest 40 in the last
in crime mapping applications to show the group. The equal interval approach divides
relative density or amount of crime occur the distribution of values so that the
ring in different areas. This is done by range of values within each class is iden
assigning graduated colors or varying tical. In other words, the difference
shades across the range of value cate between the highest and lowest value is
gories, going from lowest to highest. the same for each class grouping. With
the standard deviation approach, class
As a first step, examine the general statis breaks are defined by standard deviation
tical distribution of the data. This can be al distances from the mean.
done by plotting a scattergram or his
togram of the data and employing basic Exhibit 1 illustrates the distribution of bur
descriptive statistics such as the mean, glary rates (per 100,000 population) for
median, range, and standard deviation to Boston census tracts in 1999. In this
explore its distribution. Knowing the distri example, burglary rates are plotted along
bution of the data will help the analyst fig the horizontal axis and the number of cen
ure out the best classification scheme to sus tracts having burglary rates within
use for creating class groupings with simi each category of values is shown on the
lar values. Ideally, the classification vertical axis. With this data, analysts can
scheme used should minimize the inner experiment with different classification
class variance as much as possible and schemes and see what happens with a
maximize the variance between classes as distribution that is positively skewed (as
much as possible. In other words, the most crime data are).
range of values within each class should
be more similar to one another, while the Exhibit 2 is a map of burglary rates for
difference in values between classes Boston based on natural break classifica
should be as far apart as possible. tions. This method places extreme outliers
in a category of their own and emphasizes
In addition to user-defined ranges for class the differences between census tracts
categories, ArcView provides several with the highest burglary rates and those
36
MAPPING CRIME: UNDERSTANDING HOT SPOTS
with the lowest rates. With this data, tracts are in the top 20 percent for burglary
using natural breaks appears to be a good rates, then the analyst would apply the
method for highlighting hot spot clusters quantile method of classification and
and outliers in the data. select five classes. The disadvantage in
using this approach, especially with posi
Using natural breaks to classify data tends tively skewed data, is that differences
to be useful when mapping data values between classes may be exaggerated
that are not evenly distributed, since it since a few widely ranging adjacent values
places value clusters in the same class. may be grouped together in one class
The disadvantage of using this approach is while an equal number of relatively homo
that it is often difficult to make compar geneous values may be grouped together
isons between maps since the classifica in another class.
tion scheme used is unique to each data
set. Exhibit 4 is a map of burglary rates for
Boston based on equal interval classifica
Exhibit 3 is a map of burglary rates for tions. Using this method, the range of val
Boston based on quantile classifications. ues is the same for each class. This
This method arranges all observations approach is useful when data are normally
from low to high and assigns equal num distributed and the analyst is interested in
bers of observations to each classification emphasizing observations around the
category. This approach is useful when the mean. The disadvantage in using this
data values are fairly evenly distributed or method with positively skewed data is that
when a need exists to highlight a propor most observations will be assigned to the
tion of the observations. For example, if lower value categories and only a few
the objective is to show which census observations will be assigned to the higher
80
60
40
20
0
0.0
50
10 .0
15 .0
20 .0
25 .0
30 .0
35 .0
40 .0
45 .0
50 .0
55 .0
60 .0
65 .0
70 .0
75 .0
80 .0
85 .0
90 .0
95 .0
0
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
00
.0
Rate
37
SPECIAL REPORT / AUG. 05
value categories. However, when the approach is that the map does not show
objective is to emphasize outliers with the actual values in each class, only how
high crime rates or hot spot clusters, this far each class category is from the mean.
could be a useful approach to classifying
and mapping the data. To know which classification scheme to
use, an analyst needs to know how the
Exhibit 5 is a map of burglary rates for data are distributed and the mapping
Boston based on standard deviation classi objective. If the data are unevenly distrib
fications. With this method, each class is uted, with large jumps in values or
defined by its standard deviational dis extreme outliers, and the analyst wants to
tance from the mean. Again, with positive emphasize clusters of observations that
ly skewed distributions, outliers and hot house similar values, use the natural
spot clusters can easily be isolated and breaks classification approach. If the data
identified. The disadvantage in using this are evenly distributed and the analyst
38
MAPPING CRIME: UNDERSTANDING HOT SPOTS
39
SPECIAL REPORT / AUG. 05
40
MAPPING CRIME: UNDERSTANDING HOT SPOTS
Concentrations of crime can sometimes analyst can more clearly identify hot spots
be seen simply by mapping the location of and high-crime clusters.
individual crime incidents. In areas with a
large amount of criminal activity, however, To create a density map with Spatial
identifying which locations have higher Analyst, first calculate a density grid from
concentrations than others may be diffi the ArcView point theme containing loca
cult. To alleviate this problem, Spatial tions of individual crime incidents. Each
Analyst can create density maps, which cell in the grid is assigned a specific size
measure the number of crimes occurring and has a circular search area applied to it,
within a uniform areal unit, such as the both of which are defined by the user. Cell
number of crimes per square mile. This size determines how coarse or smooth
makes it easier to see the overall distribu the patterns will appear. The smaller the
tion of crime across space so that the cell size, the smoother the density surface
41
SPECIAL REPORT / AUG. 05
will be. However, very small cells also value. Kernel density calculations, howev
require considerably more processing er, give more weight to points near the
time and computer storage space. The center of the search area than to those
size of the search radius will determine near the perimeter. This results in a
how generalized the density patterns will smoother distribution of density values
appear. A smaller search radius will show across cells.
more local variation, while a larger search
radius will show broader patterns in the Thus, the first step is to create a density
data. Specifying either simple or kernel surface map as a raster layer using the
density calculations also is possible. Spatial Analyst menu interface in ArcView.
Simple density calculations total the num In this way, each grid cell in the raster
ber of crime incidents that fall within the layer will have a density value assigned to
search area parameter for each cell and it based on the number of crime incidents
then divide this number by the search within the specified search radius of the
area size to get each grid cell’s density cell. The map in exhibit 6 represents the
42
MAPPING CRIME: UNDERSTANDING HOT SPOTS
43
SPECIAL REPORT / AUG. 05
1 0 1 2 Miles
44
MAPPING CRIME: UNDERSTANDING HOT SPOTS
1999. The mean center for burglary inci of dispersion. They represent a first step in
dents is slightly northwest of the mean exploratory spatial data analysis, providing
center for homicides. This indicates that a more rigorous feel for the overall distribu
homicides are slightly more likely to occur tion of crime. In addition to comparing dif
in the southeastern part of Boston, while ferent types of crime, these tools are
burglaries are somewhat more likely to useful for comparing different groups. For
occur in the northwest part of the city. example, they may be used to compare
While the standard deviational ellipses for the spatial distribution of gang and non-
both burglaries and homicides fall essen gang-related homicides. Centrographic
tially along a north-south axis, burglaries statistics also are useful for comparing
show a much wider dispersion along the spatial shifts, which may occur for the
east-west axis than homicides. This indi same crime across different time periods,
cates that homicides tend to occur in a rel such as month-to-month comparisons.
atively circumscribed area compared with
burglaries. Centrographic statistics, which are used to
describe global spatial properties and pat
Centrographic statistics can be very useful terns in the data, are known as first-order
tools for examining general spatial patterns statistics. Second-order statistics describe
of central tendency, spread, and direction local or neighborhood patterns within the
1 0 1 2 Miles
45
SPECIAL REPORT / AUG. 05
46
MAPPING CRIME: UNDERSTANDING HOT SPOTS
than expected under randomness are a clear pattern in the north and western
included at each step. Thus, the criterion parts of the city.
used for clustering points together is the
user-specified lower confidence interval K-means clustering is a partitioning tech
for a random distribution. nique for grouping crime incidents into a
specific number (K) of clusters specified
Exhibit 10 shows a map of burglary hot by the user. The default number of clusters
spots for Boston in 1999 (n=3,851) identi assigned by CrimeStat is five. The routine
fied by the hierarchical clustering routine in tries to find the best center (seed location)
CrimeStat. A one-tailed probability level of for each K number of clusters specified
.05 was selected and each cluster was and then assigns each crime incident to
required to contain a minimum of 10 bur the client seed location.
glary incident locations. CrimeStat
returned 56 first-order clusters and 4 sec- Exhibit 11 shows a map of burglary hot
ond-order clusters. The locations of the spots for Boston in 1999 identified by the
second-order clusters coincide with two of K-means clustering routine. The map
the three hot spot locations identified on shows three relatively concentrated clus
the density surface map in exhibit 6. While ters and two that are more dispersed. The
the first-order clusters tend to be more more concentrated clusters, especially the
scattered, the second-order clusters show one located in the western arm of the city,
1 0 1 2 Miles
47
SPECIAL REPORT / AUG. 05
are closer to the hot spot locations identi technique prone to misuse and the results
fied by other methods. This example high difficult to interpret.
lights one of the disadvantages to using
the K-means clustering procedure. A local Moran procedure approach is
Whether the clusters make any sense will based on the concept of local indicators of
depend on how carefully the user has spatial association (LISA), in which each
selected the criteria to be used in group observation is assigned a score based on
ing crime locations. Choosing too many the extent to which significant clustering
clusters may result in the identification of of similar values around that observation
patterns that do not really exist; choosing exists. In this case, the score assigned to
too few may diminish significant differ each observation is the Moran’s I statistic
ences that actually do exist between for spatial autocorrelation. Locations with
neighborhoods. Thus, while the K-means high Moran’s I scores have intensity values
procedure provides a great deal of user higher than the average value intensity for
control, this same flexibility can make the all other observations, while locations with
low Moran’s I scores have intensity values
1 0 1 2 Miles
48
MAPPING CRIME: UNDERSTANDING HOT SPOTS
lower than the average value intensity for defined either as adjacent locations or
all other observations. The local Moran according to distance-based weights.
procedure thus provides information about Adjacency specifications, in which adja
the degree of value similarity between cent locations are given a weight of 1 and
near neighbors. nonadjacent locations are given a weight
of 0, are useful for defining near neighbor
Two conditions must be met in order to hoods. Distance specifications, in which
calculate Moran’s I. First, each observation weights are assigned that decrease with
must have a value attached to it. This distance between locations, are useful for
means that an intensity variable must be defining spatial interaction across larger
specified in CrimeStat’s primary file menu. areas. The default in CrimeStat is an adja
In this case, the point location is the cen cency specification.
troid of each census tract in Boston, and
the intensity variable is the burglary rate Exhibit 12 maps the distribution of stan
(number of burglaries per 100,000 people). dardized local Moran Z-values for the spa
Second, the neighborhood must be tial autocorrelation of burglary rates across
49
SPECIAL REPORT / AUG. 05
Boston census tracts in 1999. The pattern In addition to identifying clusters with high
shows a concentration of high values concentrations of crime, the local Moran
toward the center of the city (shaded in procedure is the only clustering tool in
darker red). Also, the northwestern part of CrimeStat that also can identify outliers
the city contains an outlier (shaded in dark (locations that are dissimilar to neighboring
er blue). This census tract has a much locations) and clusters with low concentra
higher burglary rate than its neighbors and tions of crime. The disadvantage of using
therefore has a large negative I score, indi the local Moran procedure and mapping
cating dissimilarity (i.e., negative spatial the results is that it requires the data to be
autocorrelation). This example highlights summarized into zones in order to produce
the usefulness of the local Moran statistic the necessary intensity values and then
for identifying locations, which are dissimi linked with comparable zonal boundaries
lar from their neighbors. In fact, it is the in a GIS for mapping.
only statistic available in CrimeStat that
can be used to identify spatial dissimilarity.
GeoDa
The three clustering techniques available in Similar to CrimeStat, GeoDa is a Windows-
CrimeStat have both advantages and dis based application that is practically a rein
advantages. The advantages to using the vention of the original SpaceStat™
hierarchical clustering technique include package and its ArcView extension,
the ability to identify small geographic DynESDA. GeoDa is a freestanding soft
areas that may have higher concentrations ware that does not require a specific GIS
of crime activity. This technique also is use and runs under any Microsoft Windows
ful for identifying the linkages between operating system. The current version can
several small clusters into second-order “only” input (output) Environmental
and higher clusters. The limitations of hier Systems Research Institute (ESRI) shape
archical clustering include a certain arbi files. It can analyze objects characterized
trariness based on what constitutes a by their location in space as either points
meaningful cluster size. Also, the size of (point coordinates) or polygons (polygon
the grouping area is dependent on the boundary coordinates).
sample size. This means that crime distri
butions with many incident locations (e.g., The program can be downloaded free
burglaries) will have smaller grouping from the University of Illinois at Urbana-
areas, while crime distributions with few Champagne, Spatial Analysis Laboratory
incident locations (e.g., homicides) will Web site (http://sal.uiuc.edu/default.php).
have larger grouping areas. This Web site also includes tutorials and
other useful information related to the
The advantages of using the K-means software (Anselin et al., forthcoming;
clustering routine include the ability of the Anselin, 2004a; Anselin, 2004b).
user to specify the number of spatial clus
ters in the data. This feature provides a GeoDa functions are executed through
great deal of control and flexibility for the menu items or directly by clicking toolbar
user and can be used as an exploratory buttons and can be classified into six
tool to identify different sizes and numbers categories:
of crime clusters. Yet, this same flexibility
■ Spatial data manipulation and utilities:
also can result in a certain arbitrariness,
which may make the results difficult to data input, output, and conversion.
interpret meaningfully.
■ Data transformation: variable transfor
mation and creation of new variables.
50
MAPPING CRIME: UNDERSTANDING HOT SPOTS
51
SPECIAL REPORT / AUG. 05
After opening the project, a Windows dia correlation analysis), Regress (spatial
log requests the file name of a shape file regression), and Options (application-
and the Key variable. The Key variable specific options). The toolbar consists of
uniquely identifies each observation. It is six groups of icons, from left to right:
typically an integer value such as a Federal project open and close; spatial weights
Information Processing Standards (FIPS) construction; edit functions; exploratory
code, a census tract number, or a unique data analysis; spatial autocorrelation; and
ID number. Next, a map window is rate smoothing and mapping. Some func
opened, showing the base map for the tions can be executed by either clicking
analyses. Now the complete menu and all on one of the toolbar buttons or by
toolbars are active, as shown in exhibit 14. selecting the matching item in the menu.
The full menu bar contains 12 items. Four For initial exploratory analyses, box plots
are standard Windows menus: File (open (exhibit 15), box plot maps (exhibit 16), and
and close files), View (select which tool percentile maps (exhibit 17) can be used
bars to show), Window (select or to describe the overall distribution of crime
rearrange windows) and Help (not yet and to identify outliers. With the base map
implemented). Specific to GeoDa are Edit shown on the screen in exhibit 14, a box
(manipulate map windows and layers), plot can be drawn by using Explore ➞ Box
Tools (spatial data manipulation), Table Plot and selecting the variable to be
(data table manipulation), Map (choropleth mapped (burglary rates) from the
mapping and map smoothing), Explore Windows dialog. This will display a box
(statistical graphics), Space (spatial auto- plot with the census tract burglary rates in
52
MAPPING CRIME: UNDERSTANDING HOT SPOTS
Boston for 1999 (shown as blue dots) sort standard multiple of 1.5, the box plot iden
ed from the lowest (bottom of the plot) to tifies 12 census tracts with very high val
the highest (top of the plot) value (exhibit ues of burglary rates (mild, upper outliers).
15). The lowest 25 percent of all burglary There are no mild, lower outliers (very low
rates make up the first quartile, followed values of burglary rates).
by the next 25 percent of all rates that
make up the second quartile, and so on. The corresponding box plot map can be
The second and the third quartiles are sep drawn by using Map ➞ Box Map ➞
arated by the median (the red bar in the Hinge=1.5 and selecting the variable to be
middle), which is the middle value of the mapped (burglary rates) from the Windows
sorted burglary rates. dialog. In the box plot map, the 12 census
tracts with very high values of burglary
In this box plot, an observation is classi rates are shown in dark red with a pattern
fied as an outlier when it lies more than a that shows a concentration toward the
given multiple of the interquartile range center of the city, along with an additional
(IQR) (the difference in value between the outlier in the northwestern part of the city.
75 percent and 25 percent observation)
above or below respectively for the 75th Exhibit 17 contains a percentile map of
percentile and 25th percentile. The stan 1999 burglary rates at the census tract
dard multiples used are 1.5 (mild outlier) level in Boston. This is accomplished by
and 3 (extreme outlier) times the IQR. In using Map ➞ Percentile and selecting the
the box plot, the IQR is shown with the variable to be mapped (burglary rates)
dark red band around the median. Using a from the Windows dialog. Two census
tracts qualify as outliers using the upper
99th percentile criterion. Compare the
location of these outliers with those iden
Exhibit 15. Box plot of burglary rates
tified by the standard deviational choro
pleth map in exhibit 5. The additional
census tract in the standard deviational
map is due to a 95-percent cut-off point
for two standard deviations as opposed to
the 99-percent cut-off point used for the
percentile map.
53
SPECIAL REPORT / AUG. 05
the attribute values are randomly spread which neighbors are defined as sharing a
over space. common border. In contrast, queen conti
guity defines neighbors that share com
To calculate the Moran’s I indicator of spa mon borders and/or common corners.
tial autocorrelation, the analyst must first
construct a spatial weights matrix. Spatial Creating a rook-based contiguity matrix is
weights can be defined either by contigui accomplished by either selecting Tools ➞
ty (where neighbors are identified accord Create ➞ Weights from the menu or by
ing to boundary relationships, in which clicking on the matching toolbar button.
1 = adjacent and 0 = nonadjacent) or by This opens a Windows dialog, in which the
distance (where neighbors are identified name of the input file (a polygon shape
according to a distance-based metric file), the name for the weights file, an ID
around centroid locations which decreases variable for the weights file (a Key variable
with distance between locations). In the from the input file that uniquely identifies
examples presented here, spatial weights each observation), and the type of spatial
are calculated based on rook contiguity, in weights matrix (rook contiguity) need to
First quartile
Second quartile
Third quartile
Fourth quartile
Upper outliers
54
MAPPING CRIME: UNDERSTANDING HOT SPOTS
be specified (see exhibit 18). The resulting example presented here, a spatially lagged
spatial weights file is stored with the file variable for 1999 burglary rates in Boston
extension GAL. It is a simple text file that will be created. This requires first loading a
can be edited with any text editor or word base map and opening a corresponding
processor. spatial weights file by selecting Tools ➞
Weights ➞ Open from the menu or by
Once the spatial weights matrix has been clicking on the matching toolbar button.
created, a spatially lagged variable can be Next, clicking on the Table toolbar button
computed. A spatially lagged variable (or and right clicking to select Field
spatial lag) is derived as the spatially Calculation from the menu will open a
weighted average of its neighboring values. Windows dialog. In this Windows dialog,
A spatial lag is an essential part of the com select the Lag Operations tab and enter
putation of spatial autocorrelation tests and the information as shown in exhibit 19. The
the specification of spatial regression mod name for the new spatially lagged variable
els. The spatial lag computation is part of (W_BURGRT) is entered into the left most
the Table functionality in GeoDa. In the text box. The spatial weights file (same as
1–10%
>10–50%
>50–90%
>90–99%
>99%
55
SPECIAL REPORT / AUG. 05
above) is selected for the text box in the 1622.72, (6) 1952.04, (7) 2434.78, and (8)
middle and the variable to be lagged (1999 2636.20. The spatially lagged variable is
burglary rates in Boston) is selected for simply the average of these eight burglary
the right most text box (exhibit 19). This rates, namely 1522.58.
will add the new spatially lagged variable
(W_BURGRT) as a new column to the end Positive spatial autocorrelation would indi
of the Table in GeoDa. cate cases where the original and the spa
tially lagged variable have similar values.
To briefly illustrate how GeoDa calculates This would point to clustering of high val
the spatially lagged variable for the 1999 ues (hot spots), low values (cold spots),
burglary rates in Boston, consider the cen and medium values. On the other hand,
sus tract in the center of the city previous contrasting values between the original
ly identified as one of two outliers using and its spatial lag indicate negative spatial
the upper 99th percentile criterion (exhibit autocorrelation, or the presence of spatial
17). This census tract has a burglary rate of outliers. Locations of negative spatial asso
9578.54, which is the highest rate among ciation may indicate areas of high crime
all census tracts in Boston in 1999. Eight surrounded by low-crime neighbors (similar
rook neighbors with the following burglary to the example above), or low crime sur
rates surround this census tract: (1) rounded by high-crime neighbors.
738.92, (2) 886.77, (3) 937.77, (4) 971.46, (5)
56
MAPPING CRIME: UNDERSTANDING HOT SPOTS
57
SPECIAL REPORT / AUG. 05
Quadrant II Quadrant I
58
MAPPING CRIME: UNDERSTANDING HOT SPOTS
surrounded by neighbors whose values The local Moran map in exhibit 22 shows
are also below average (low-low). The that a significant “local” cluster of high
other two categories imply negative spa burglary rates (census tracts in red) is
tial association: (Quadrant IV) where a present in the central part of the city. The
location with an above-average value is significance of this cluster is 0.01 (census
surrounded by neighbors with below- tracts in dark green) or 0.05 (census tracts
average values (high-low), or (Quadrant II) in light green) as shown in the Moran sig
where a location with a below-average nificance map in exhibit 23. The census
value is surrounded by neighbors with tracts in this “local” cluster would fall into
above-average values (low-high) (see the first quadrant of the Moran scatter plot
exhibit 21). The observations from each (exhibit 21) where locations with an above-
quadrant could easily be selected and average value are surrounded by neigh
visualized using a GIS. The resulting Moran bors whose values are also above average
scatter plot map would provide a visual (high-high). The local Moran map also iden
representation of spatial clustering (hot tifies one larger significant pocket of low
spots and cold spots) and the location of burglary rates in the southwestern and
spatial outliers. Unfortunately, Moran scat two smaller significant pockets in the
ter plot maps cannot be directly compiled western and northern parts of Boston.
within GeoDa. These census tracts with a below-average
value have neighbors whose values are
With larger data sets, the assessment of also below average (third quadrant in
global spatial autocorrelation needs to be exhibit 21). Finally, two individual census
supplemented by local measures of spa tracts that can be identified as spatial out
tial dependence as well. According to liers are located adjacent in the east and
Anselin (1995), local indicators of spatial southeast of the “local” cluster of high
association (LISAs) achieve two objec burglary rates. Both census tracts have
tives: (1) they can be used to identify sig below-average values and neighbors
nificant patterns of spatial association whose values are above average (second
around individual locations, such as hot quadrant in exhibit 21).
spots or spatial outliers; and (2) they can
be used to assess the extent to which the As a Windows-based application, GeoDa is
global pattern of spatial association is much more user friendly than the original
spread uniformly throughout the data or SpaceStat package and its ArcView
whether there are significant types of Extension, DynESDA. It provides some
locations affecting the computation of useful tools for doing exploratory spatial
Moran’s I. data analysis, including dynamically linked
windows and data brushing. As a stand
Measures of local spatial autocorrelation alone program it has a variety of options
can be visualized by LISA local Moran for data manipulation and transformation,
maps and Moran significance maps. Local mapping, exploratory spatial data analysis,
Moran LISA statistics can be computed by spatial weights construction, descriptive
selecting Space ➞ Univariate LISA from statistics, spatial autocorrelation statistics,
the menu or by clicking on the matching OLS regression with spatial diagnostics,
toolbar button. In the first Windows dialog, and spatial regression modeling. To learn
choose the original variable (BURGRT) as more about GeoDa, consider attending
the first variable (Y). In the second dialog, one of Luc Anselin’s ICPSR training cours
select a spatial weights matrix, and in the es or GeoDa workshops. For more infor
third dialog, check the boxes next to “The mation visit Luc Anselin’s homepage
Significance Map” and “The Cluster Map.” (http://agec144.agecon.uiuc.edu/
users/anselin/).
59
SPECIAL REPORT / AUG. 05
Summary References
As seen throughout this demonstration Anselin, Luc. 1995. “Local Indicators of Spatial
Association—LISA” Geographical Analysis 27:93–115.
chapter, each of these packages has its
own particular strengths and weaknesses Anselin, Luc, Ibnu Syabri, and Youngihn Kho. (forth
as well as unique and overlapping analyti coming). “GeoDa: An Introduction to Spatial Data
cal applications. Exhibits 24 through 27 Analysis.” Geographical Analysis.
summarize the strengths, weaknesses,
Anselin, Luc. 2004a. GeoDa 0.9.5-I Release Notes.
and applications for each of the four hot
Spatial Analysis Laboratory and Center for Spatially
spot software packages examined. Integrated Social Sciences (CSISS), Department of
Agriculture and Consumer Economics, University of
Illinois, Urbana-Champagne.
Not significant
High-high
Low-low
Low-high
60
MAPPING CRIME: UNDERSTANDING HOT SPOTS
Exhibit 23. LISA local Moran significance map for burglary rates
Not significant
p = 0.05
p = 0.01
61
SPECIAL REPORT / AUG. 05
Strengths • Useful for visualizing aggregated crime patterns in and across defined boundary areas.
• Useful for obtaining a general picture of the overall spatial distribution of crime across areas.
Weaknesses • Attention often is focused on the relative size of an area so that large areas tend to dominate the map.
• The actual distribution of crime incidents may be difficult to identify since incidents of crime usually are
not evenly distributed throughout a given boundary area.
Applications • Used with vector data that represents geographic features according to predefined political or
administrative boundaries.
• Often used with crime rates to standardize for population.
Strengths • Useful for visualizing geographic patterns of crime across a continuous surface.
• Density surface maps are based on the distribution of individual crime incidents and are therefore able to
show with greater detail how crime is spatially distributed.
Weaknesses • Because the values in the areas between crime incident locations are estimated interpolations, the
validity of the resulting density patterns is highly dependent on the quantity and relative distribution of
the available data points.
• The interpolation process used to create a density surface generalizes and smooths the data so that
extreme high and low values may disappear.
Applications • Used with raster data that represents geographic features as a grid of cells on a continuous surface.
• Individual crime incidents are used to interpolate the relative density of crime across a continuous surface.
62
MAPPING CRIME: UNDERSTANDING HOT SPOTS
Strengths • Useful for doing exploratory spatial data analyses with crime incident locations (point pattern data).
• Provides a number of statistical routines that vary from descriptive centrographic applications to more
sophisticated nearest neighbor and spatial autocorrelation functions.
Weaknesses • While a number of useful exploratory features are provided, no applications are available for modeling
correlates or determinants of crime.
Applications • Used with point data that represents crime incidents as point locations.
• Statistical routines include: spatial distribution or centrographic statistics, distance statistics for nearest
neighbor analyses, hot spot and clustering routines, and kernal density interpolation functions.
Strengths • Useful for doing exploratory and confirmatory spatial data analyses with either points (point coordinates)
or aggregated crime data (area patterns).
• Provides statistical routines with dynamically linked graphing, data brushing, and mapping applications for
doing interactive exploratory data analyses as a stand-alone program.
• Provides confirmatory spatial analyses tools for modeling correlates of crime using a variety of spatial
regression applications.
Weaknesses • Only uses ESRI shapefiles as input and output coverages.
• Most statistical applications are for the analysis of area patterns; the application of point patterns
is limited.
Applications • Mostly used with areal data in which crime incidents are aggregated according to defined boundary areas
(usually standardized as rates).
• Analytical applications provided with GeoDa include (1) applications for the input, manipulation, and
transformation of data; (2) applications for the construction of spatial weights; (3) applications for
exploratory spatial data analysis (ESDA) using descriptive statistics, global and local measures of spatial
autocorrelation, and dynamically linked graphing, data brushing, and mapping capabilities; and (4)
applications for confirmatory spatial data analysis (CSDA) using OLS regression modeling with spatial
diagnostics, spatial regression residual mapping, and a variety of spatial regression modeling options.
63
Chapter 4. Conclusion
Ronald E. Wilson, Inter-University Consortium for Political and Social Research, Mapping and
Analysis for Public Safety Program, National Institute of Justice
Approaching hot spot Gesler and Albert (2000) point out that
with availability of geographic information
analysis systems (GIS) and other spatial data analy
As seen throughout the previous chapters, sis software an analyst might, and often
conducting hot spot analysis depends on does, side-step an important element of
several factors, varying from theory selec analysis. That element in hot spot analysis
tion, to type of crime being analyzed, to is the understanding of the underlying spa
the display of output results. Carrying out tial and social processes contributing to
analysis must have a logical and systemat the presence or absence of criminal activi
ic approach. Analysis cannot proceed arbi ty in an environment. Places have unique
trarily, depending solely on human intuition characteristics that affect the distribution
and visual inspection for identifying hot of criminal activity over space (i.e., spatial
spots. Nor can analysts depend solely on processes) and temporal distributions.
the software algorithms to provide mean Social and spatial processes are nonsta-
ingful output. Such activities may result in tionary—criminal activity is affected by the
a subjectively perceived hot spot that may variation of demographics, the built envi
or may not actually be a cluster of criminal ronment, economics, and other social
activity. aspects that change across space (Haining,
2003). This leads to a premise that has
Visually identifying a hot spot can inappro long been championed in geography—
priately affect input parameters, such as place matters.
the size of the search radius, because, for
example, an analyst might be looking at Place matters because every location has
too many observations at one time. As a a different environment, such as levels of
result, the presence of clusters could be socioeconomic status, laws governing
exaggerated or could remain undetected if space management, influence of informal
too few observations are used. Converse social controls, condition of surroundings,
ly, a statistical approach can only examine and arrangements of the buildings. A num
the observations that are selected without ber of confounders may also contribute to
considering environmental factors, and the clustering of criminal activity. As a
thus requires human interpretation to result, the spatial arrangement of crime
make sense of the results of analysis. incidents will be different from place to
Analysts should use statistical tools in place and will not lend itself to a uniform
conjunction with human understanding approach to hot spot analysis. Haining
of an area to give the analysis a solid foun (2003) and Gesler and Albert (2000) note
dation for stating where hot spots actually that observations change from place to
are occurring. They must scientifically place, which is an indication that the
determine that a hot spot is indeed an underlying spatial and social processes are
actual cluster of events that are not occur different, and thus the method used for
ring at random. carrying out analysis will need to be
65
SPECIAL REPORT / AUG. 05
adjusted to detect those processes (i.e., were trying to prove or disprove the
hot spots). Further, the fact that people hypothesis that violent crime was clus
and their environment are not evenly dis tered in and around those communities.
tributed across space also requires adjust
ment in the analysis approach. Spatial dependence
Criminal activity is not the same in every
Elements to consider place, as chapter 1 points out in the first
sentence. Therefore, to detect the pres
ence of a hot spot, the strength of spatial
Analysis focus
relationships between incidents must be
Gesler and Albert (2000) point out that established. This strength is known as spa
two different goals apply when it comes to tial dependence and is based on Waldo
cluster, or hot spot, analysis. These Tobler’s “First Law of Geography,” where
approaches are general and focused analy by everything is related to everything else,
sis. With general analysis, an examination but closer things are more related. Spatial
is done to discover whether phenomenon dependence must be measured to estab
is clustered within the study area (i.e., an lish a distance relationship limit between
analyst is looking for the presence of hot crime incidents where an incident is relat
spots). For example, in a National Institute ed to a set of nearby incidents. This
of Justice (NIJ) study of homicides in dependency will likely change over the
Philadelphia, Pennsylvania (Zahn et al., study area as the environmental factors
2003), a hot spot analysis was performed change (Haining, 2003). This is known as a
over the entire city to identify places with spatial process, and when it changes
a clustering of homicides. Subsequently, across space it will be nonstationary. An
those hot spots were examined in con analyst will have to determine the thresh
junction with the presence of religious old distance of influence between inci
institutions and what influence they might dents to guide the selection of bandwidth
have on homicide. In this case, the type and size when analyzing for clusters.
authors were trying to identify places with This cannot be measured just by visually
concentrations of homicide and then ask, determining what that threshold distance
“What is it about this place that might be might be because it is too subjective and
causing so many homicides?” thus must be done with a scientific
approach.
With focused analysis, the purpose is to
identify phenomena that are clustered Currently, most hot spot analysis software
around a particular place of interest within only analyzes points in space without fac
a study area. For example, in another toring in environmental variables.
study (Wilson and Everett, 2004), a CrimeStat® and SaTScan™ are two of the
focused analysis was done because the available exceptions whereby a minimal
primary hypothesis was, “Is there more set of environmental factors can be
violent crime activity clustering in, and included in the analysis (Levine, 2002 and
around, public housing communities?” Kulldorff, 2004). Consulting the literature
The authors selected specific locations for theory or empirical evidence for the
(public housing communities) within a spatial dependence of criminal activity can
study area to determine if there were provide a scientific base for selecting
clusters of violent crime at specific input parameters. Previous research will
places, not the entire city. In this case, likely have considered the spatial relation
the authors already knew that there was ships of criminal activity in combination
“something about those places” and with demographic, socioeconomic, and
66
MAPPING CRIME: UNDERSTANDING HOT SPOTS
environmental variables and their distribu ever, further dilutes the significance of the
tion over space. measure because not only are environ
mental or demographic factors not consid
For example, it has been demonstrated ered; the distribution of incidents
with empirical findings (Roncek, Bell, and themselves are not considered. These for
Francik, 1996 and Holtzman, 2004) that mulas simply state that given a number of
one-eighth of a mile is a reasonable dis incidents, clustering would occur based on
tance to measure the diffusion of crime in the amount of area in which they are pres
places that have strong neighborhood ent. Should a bulk of the observations be
structures that are self-contained, such as located in a small portion of the study area
Philadelphia, Pennsylvania, or Chicago, (i.e., they are concentrated and are not
Illinois. Residents can often meet their evenly distributed), then the formula might
needs without leaving these neighbor give a value that is too large and detect all
hoods. However, places like Las Vegas, of those observations as a cluster. The
Nevada, or Fort Lauderdale, Florida, have area with the bulk of the observations
completely different spatial structures could be used for analysis, but this returns
because the neighborhoods are spread out the subjective selecting of parameters for
and require residents to drive everywhere analysis because some delimiting bound
to get anything. One-eighth of a mile for ary must be specified.
measuring the clustering of crime might
not make sense in these neighborhoods
because this distance would likely be too Crime type
small for measuring clustering and would Consideration of crime type plays an
likely not yield significant results. Even if important role as the spatial distribution of
the same crime type is examined in these incidents varies in and between violent
places, the structure of the spatial relation and property crime types. Different types
ships of the observations will be different of crime have different spatial relation
because the environments are different. ships, dependencies, structures, and distri
butions, which are the result of different
Absent of theory or empirical evidence, social and spatial processes over an area.
statistical techniques can be used to find These processes are affected by, and
spatial dependence based on the distribu affect, other social and spatial processes
tion of points, such as variograms or near occurring at nearby places. If all criminal
est neighbor indexes. These formulas activity was evenly distributed and was the
measure the spatial distribution of points result of the same social and spatial
against a set of randomly distributed processes, then hot spot analysis would
points to determine if clustering is by not be needed. If crime is analyzed in this
chance or not. Using formulas such as fashion, a blanket statement is being made
these, however, are only statistical exercis about criminal activity that assumes each
es that assume that an area has no physi type is caused by the same set of factors.
cal or social barriers, the shape of the
study area has no relevancy, spatial and For example, in an NIJ study of public
social processes are stationary (they do housing and violent crime (Wilson and
not change over space), and environmen Everett, 2004) an analysis was first con
tal factors are not considered to have influ ducted with all crime types classified as
ence. If these tools are not available, violent crime. The result was the impres
formulas exist for determining spatial sion that violent crime was clustered
dependency based on the presumed den mostly in public housing communities.
sity of a study area, which is the number However, when broken down into individ
of observations divided by area. This, how ual crime types, the authors found that
67
SPECIAL REPORT / AUG. 05
murder, assault, rape, robbery, and cross-section of time, a major event might
weapons violations were not clustered in have occurred or a crime-prone establish
public housing communities but assault on ment might have been introduced or
females and domestic violence were. The removed that had a sudden or lagged
identification of which crime type was impact on the cumulative amount of
actually clustered in the communities crime. Consequently, the temporal rela
might allow police to address problems tionship does not correspond with the spa
focused on violence against women rather tial relationship.
than trying to provide solutions and
resources for all other violent crime types. To counter this, select time periods that
synchronize the temporal dependency
Exploration of crime type distributions is with the spatial dependencies under
fundamentally important to determine analysis, such as separation of times of
which type of hot spot method should be day, seasons, events, policy implementa
used. Analyzing crime in general, such as tions, or the introduction or removal of
violent, drug, or property crime could yield establishments. A series of hot spot maps
misleading or incorrect results. Breaking may need to be generated instead of just
down crime types from general categories one. An incorrect temporal measurement,
can allow for focused analysis or meaning even if at a location that has a true cluster
ful results. ing of crime, may lead to false negatives
or positives.
Time intervals
Barriers
Time further complicates the process
of hot spot analysis because varying Physical and social barriers between
intervals can affect cluster detection of places must be factored into analysis,
criminal activity. Certain crimes occur at since they will have an effect on the direc
particular times of day, months (seasons), tional significance of spatial relationships.
or over special events. For example, These barriers have a separation effect
assaults may occur more frequently at that can drastically change whether a hot
night in areas with nightclubs. Since spot exists and the shape and size of that
these clubs are not open in the day, hot spot. Many algorithms for determining
crimes occurring during daylight hours hot spots currently do not have the capa
might lack a spatial relationship that is bility to detect a barrier such as a river or a
present during nighttime hours of opera shopping area that separates two places.
tion. This is especially true if the analysis
is trying to link an increase in crime to Natural and manmade physical barriers
the presence of the establishment. can impede spatial relationships and cre
ate the illusion of hot spots where it is
Depending on how many incidents are unlikely that crime incidents are related.
within a given period of time, the accumu For example, rivers, regardless of size,
lation of crime incidents over too long of a provide a severe break in spatial relation
time interval can indicate the presence of ships, as access to each side is limited.
a hot spot when one really does not exist. Kernel density smoothing routines, for
Conversely, too short of a time interval can example, should be used on each side of
obscure a cluster of criminal activity these barriers independently, which
because not enough observations were allows observations to be measured on
captured in relation to the actual time each side independently. Conversely,
interval of the spatial process. During this measuring incidents in relation to each
68
MAPPING CRIME: UNDERSTANDING HOT SPOTS
other could cause a hot spot to be detect points out that distribution and density
ed that crosses a barrier when actually no must be understood because categoriza
relationship is present. This same principal tion can make a difference on how hot
holds true for manmade barriers such as spots look or even if one is present.
major limited access highways or large Arbitrarily selecting categorical ranges may
parks. misrepresent the size and shape of the
hot spot.
Social barriers consist of environments
that make it difficult for an offender to
travel through undetected. Upscale neigh Software
borhoods with high-end shopping, restau Software for hot spot analysis is becoming
rants, or clubs provide an environment in more available in both GIS software and
which would-be criminals might stand out custom software programs. Much has
and draw attention. For example, private been written about using software for hot
security is often present in affluent neigh spot analysis, but little about the develop
borhoods and increased surveillance, such ment of hot spot analysis tools. In particu
as neighborhood watches, might provide a lar, these issues revolve around design of
record of identification. Social barriers will tools for conducting spatial data analysis
likely be less of a consideration than physi that includes environmental and demo
cal barriers, as spatial relationships will be graphic factors. In this respect, any analy
accounted for during analysis. A disruption sis software requires a variety of tools that
will occur between observations on each allows for full and indepth investigations.
side of a neighborhood, for example, that
will likely cause a diminishing amount of There are not enough robust statistical
observations from one side of the barrier tools within, or that interact with, GIS soft
to the other. ware. Software programs for spatial analy
sis, to date, do not contain all of the tools
needed to do a full analysis of data. For
Output display example, many custom software packages
As demonstrated in chapter 2, the results do not have the ability to visually display
of hot spot analysis can be displayed in hot spot analysis results. If any further
several ways. Primarily this is done as a analysis needs to be done, the analyst
continuous surface or as delineated must manipulate the data in a GIS for dis
boundaries depending on the output of play and then import that work back into
the analysis software. Some software pro the statistical analysis software. As a
grams output a grid of continuous surface result, analysts may use several software
values while others output a set of values programs to carry out their research and
within the original unit of analysis, such as analysis. For example, a hot spot analysis
a census block group. Either method of public housing and violent crime
requires categorizing data with an associ (Wilson and Everett, 2004) required the
ated color. use of ARC/INFO®, SPSS, Microsoft®
Excel, and CrimeStat to conduct the analy
When displaying output as a continuous sis. While progress has been made in
surface, the underlying values will often bringing spatial data analysis and GIS soft
have statistical significance. Therefore, it is ware together, such as GeoDa™ (Anselin,
important to understand the ranges of 2004), it is still not to the level that allows
those levels of significance, such as z- the flexibility and interactivity that the
scores, in order to show the significant crime analysis community needs.
breaks of criminal activity. Chapter 2
69
SPECIAL REPORT / AUG. 05
Theory and practice are often filled with other articles that may
not be relevant to the analyst.
Establishing a stronger link between theo
ry and practice will help avoid the arbitrary
What this means for
approaches to hot spot analysis and give
practitioners
an analyst a scientific foundation from
which to work. The literature often encour Practitioners must first and foremost
ages experimentation with analysis results develop strategies for conducting hot spot
until the outcomes make sense, but that analyses that have a scientific foundation.
can be time consuming and confusing Analysts must think about and organize
because nothing exists to substantiate the many elements and options that go
that the analysis approach was appropri into analysis. That is, practitioners must
ate. There should be solid and grounded use a scientific approach to carrying out
reasons for identifying clusters, parameter analysis that is logical, systematic, and
selection, analysis techniques, and output critically examined. This will give strong
display categories. credibility to the statistical output and
interpretation of the results. Further, ana
lysts must provide feedback to researchers
What this means for researchers
on analysis that tested a particular theory
To improve hot spot analysis, researchers or whether the use of empirical evidence
must do two important tasks. The first is worked in their jurisdiction.
to further develop theories to provide sci
entific reasons for depicting spatial rela Practitioners must understand that their
tionships and the strength of the approach to hot spot analysis will be differ
dependencies between criminal incidents, ent every time they conduct analysis. Their
environmental and socio-demographic vari approach will change based on place, pur
ables, and the interpretation of results. The pose of analysis, spatial dependence
second is to conduct more empirical stud between crime and environment, crime
ies that test theories of spatial relation type, time, barriers, and the visual display
ships and crime to guide parameter of results. This will subsequently deter
selection, appropriate time intervals, crime mine which software programs they use
types, and barriers. Researchers, there and how they will use them.
fore, must work to build models that are
flexible and incorporate both composition Full circle
al (demographic) and contextual (ecologi
cal) variables. These models must perform Researchers and practitioners must work
spatial data analysis as well as statistical more closely together. Researchers often
analysis. will make contact with law enforcement
agencies to get data needed to conduct
Researchers must also do more to get research with little or no further contact.
their theories or empirical results into the Although exceptions exist and the problem
hands of practitioners through more out is decreasing as crime analysis progress
reach to crime analysts or policymakers. es, minimum contact is still largely the
Publishing in peer-reviewed journals, such norm. One way to bring these two groups
as Criminology or The Professional together is to develop software tools that
Geographer, will not reach an audience can provide an opportunity for instant
that wants to use theory and empirical evi feedback between the groups. Such timely
dence but has little recourse in doing so. feedback could lead to the development of
These journals are expensive to obtain and software that more closely models ground
70
MAPPING CRIME: UNDERSTANDING HOT SPOTS
truth. More so, researchers and practition Holtzman, H. 2004. Personal communications at the
ers should continue to attend events such National Institute of Justice.
as NIJ’s Crime Mapping Research Kulldorff, M. and Information Management Services
Conference or the Jill Dando Institute of Inc. 2004. SaTScan v4.0: Software for the spatial and
Crime Science Crime Mapping Conference space-time scan statistics. http://www.satscan.org
to maintain the discourse about what
works and what does not. Levine, N. 2002. CrimeStat 2.0, A Spatial Statistics
Program for the Analysis of Crime Incident Locations.
Houston, TX: Ned Levine & Associates and
Washington, DC: U.S. Department of Justice,
References National Institute of Justice.
Anselin, L. 2004. GeoDa version 0.9.5-i, An Roncek, D.W., R. Bell, and J.M.A. Francik. 1981.
Exploratory Spatial Data Analysis (ESDA) software “Housing Projects and Crime: Testing a Proximity
application. http://sal.agecon.uiuc.edu/geoda_main. Hypothesis.” Social Problems 29 (2): 151–166.
php
Wilson, R.E. and R.S. Everett. 2004. Targeting Violent
Gesler, W.M. and D.P. Albert. 2000. “How Spatial Crime in Small Communities: A Spatial Data Analysis.
Analysis Can be Used in Medical Geography.” In D.P. Unpublished research. Washington, DC: U.S.
Albert, W.M. Gesler, and B. Levergood (eds.), Spatial Department of Justice, National Institute of Justice.
Analysis, GIS, and Remote Sensing Applications in
the Heath Sciences. Chelsea, MI: Ann Arbor Press. Zahn, M.S., R.J. Kaminski, R.E. Wilson, and D.M.
Campos. 2003. Religious Institutions and Homicide in
Haining, R. 2003. Spatial Data Analysis: Theory and Philadelphia Neighborhoods. Unpublished research.
Practice. New York: Cambridge University Press. Washington, DC: U.S. Department of Justice,
National Institute of Justice.
71
SPECIAL REPORT / AUG. 05
72
About the National Institute of Justice
NIJ is the research, development, and evaluation agency of the U.S. Department of Justice.
NIJ’s mission is to advance scientific research, development, and evaluation to enhance the
administration of justice and public safety. NIJ’s principal authorities are derived from the
Omnibus Crime Control and Safe Streets Act of 1968, as amended (see 42 U.S.C. §§ 3721–3723).
The NIJ Director is appointed by the President and confirmed by the Senate. The Director estab
lishes the Institute’s objectives, guided by the priorities of the Office of Justice Programs, the
U.S. Department of Justice, and the needs of the field. The Institute actively solicits the views of To find out more about the National
criminal justice and other professionals and researchers to inform its search for the knowledge Institute of Justice, please visit:
and tools to guide policy and practice.
http://www.ojp.usdoj.gov/nij
Strategic Goals
NIJ has seven strategic goals grouped into three categories: or contact:
Dissemination
4. Disseminate relevant knowledge and information to practitioners and policymakers in an
understandable, timely, and concise manner.
5. Act as an honest broker to identify the information, tools, and technologies that respond to
the needs of stakeholders.
Agency management
6. Practice fairness and openness in the research and development process.
7. Ensure professionalism, excellence, accountability, cost-effectiveness, and integrity in the
management and conduct of NIJ activities and programs.
Program Areas
In addressing these strategic challenges, the Institute is involved in the following program areas:
crime control and prevention, including policing; drugs and crime; justice systems and offender
behavior, including corrections; violence and victimization; communications and information
technologies; critical incident response; investigative and forensic sciences, including DNA; less-
than-lethal technologies; officer protection; education and training technologies; testing and
standards; technology assistance to law enforcement and corrections agencies; field testing of
promising programs; and international crime control.
In addition to sponsoring research and development and technology assistance, NIJ evaluates
programs, policies, and technologies. NIJ communicates its research and evaluation findings
through conferences and print and electronic media.
U.S. Department of Justice
Office of Justice Programs PRESORTED STANDARD
National Institute of Justice POSTAGE & FEES PAID
AUG. 05
DOJ/NIJ
*NCJ~209393*
Washington, DC 20531
Official Business
PERMIT NO. G–91
Penalty for Private Use $300
NCJ 209393