The Socio-Economics of Crime: Burglary in Washington, D.C.

Title
The Socio-Economics of Crime: Burglary in Washington, D.C.

Author

Neil Dhawan
American River College, Geography 350: Data Acquisition in GIS; Fall 2003

Abstract

Identifying and quantifying spatial relationships between areas of high burglary density and various demographic characteristics.

Introduction

The root causes of crime have been well theorized and debated. An interesting corollary to this discussion is an examination of the relationship between geography and crime. Rather than attempt to identify what factors lead to crime, we may explore what factors accompany crime. That is, we focus on correlation rather than causation. This study attempts to do just that. In particular, I have attempted to quantify the relationships between various socio-economic conditions and areas with a high density of burglaries. Washington D.C. burglary data from the year 2000 was used in this analysis. By comparing the percentage of areas in close proximity to burglary locations that fall in census tracts with some measurable demographic value to the percentage of the city those tracts comprise, possible spatial relationships were identified. GIS’ ability to model such relationships, particularly in the raster file format, made it an excellent tool in this endeavor.

Background

Each year, the FBI publishes Crime in the United States, a statistical summary of crime offenses in the U.S. It merely provides a reference for crime statistics broken down by political jurisdiction and individual agency and, as such, the FBI warns that it should not be used as a singular tool of crime analysis. Different areas can have vastly different demographics; it is naļve and imprudent to think that crime rates do not need to be standardized in some way. However, the FBI does note that several crime factors are known to affect both the type and amount of crimes committed. These include but are not limited to:

Population density and degree of urbanization.
Economic conditions, including median income, poverty level, and job availability.
Cultural factors and educational, recreational, and religious characteristics.
Family conditions with respect to divorce and family cohesiveness.
Stability of population with respect to residents' mobility, commuting patterns, and transient factors.
Modes of transportation and highway system.

In conjunction with known burglary locations, I have used these and other factors to draw inferences about the relationships between varying demographic attributes and burglary.

Methods

All spatial datasets were obtained from the ESRI website in the shapefile format. These included Washington D.C. census tracts, streets, major roads, water bodies, and 2,531 burglary locations for the year 2000. Because the census tracts shapefile was unprojected and did not have a projection file (*.prj) associated with it, I used ArcToolbox to set the spatial reference to the NAD83 Geographic Coordinate System. To eliminate any accuracy errors associated with data projected “on-the-fly” during an ArcMap session, I projected the census tracts shapefile to the NAD83 State Plane Maryland Projected Coordinate System to match the rest of the data. Since the area of each tract was not listed in the attribute table (an attribute I would need in my analysis), I created a Personal Geodatabase file (*.mdb) within ArcCatalog, and imported the census tracts shapefile in as a feature class to the Personal Geodatabase. This automatically created a field that contained the area of each census tract in square meters. Aerial photos were downloaded from the TerraServer USA website in a JPEG format. Although the photos were projected and came with GIS world coordinates embedded in a World file (*.jgw), I had to set the spatial reference to UTM Zone 18 (NAD83 datum) using ArcCatalog.

Demographic data was downloaded from the U.S. Census Bureau’s American FactFinder website. The socio-economic information I obtained included the following:

average per capita income
total population
population of males 25 years or older
population of males 25 years or older without a high school degree or equivalency
population of adults 30 years or older
population of adults 30 years or older responsible for grandchildren
population of persons with a known income
population of persons with a known income below poverty

This data was summarized by individual census tracts and downloaded as an MS Excel spreadsheet (*.xls). I imported this spreadsheet into MS Access as a database table, and created several new fields based on the existing demographic data and the census tract area field created in the Personal Geodatabase. These new fields included:

population density (persons/ m²)
proportion of males 25 years or older without a high school degree or equivalency
proportion of the total population with a known income who are below poverty
proportion of adults 30 years or older who are responsible for grandchildren

I exported the updated database table as a dBase IV (*.dbf) file, and added it to ArcMap. I then added all the D.C. spatial data, including a dBase IV table of pawnshop addresses in Washington, D.C. I geocoded the pawnshop locations and performed a tabular join between the demographic data table and the census tracts shapefile based on a census tract ID field.

Because the burglary shapefile contained point-specific burglary locations, there was a possibility that relevant spatial relationships would be overlooked. That is, if a burglary is committed near the border between two or more census tracts, I needed a method to ensure that all census tracts in close proximity would be included in the analysis of that particular crime event. This was done by creating a straight-line distance GRID raster of 50 meters from each burglary location.

Burglaries as Points Burglaries as Burglary Proximity Zones

I refer to these areas within 50 meters of a burglary as “Burglary Proximity Zones,” and thus named the raster file “Burglary Proximity Zones.” I then reclassified the Burglary Proximity Zones raster file into a single value: 1. Like all raster files created in this project, the cell resolution was 10m, and therefore each cell represented 100 m².

In order to quantify (at least on some level) the relationships between burglary and certain demographic characteristics I utilized a simple process of topological overlay using the ArcGIS Spatial Analyst extension. Generally, the same basic steps were followed in each analysis:

(1) Creation of a GRID raster file from the census tracts shapefile based on some socio-economic attribute

(2) Reclassification of the raster file into two classes (values 0 and 1) based on some specific value of the attribute in question

(3) Multiplication of the raster file by the Burglary Proximity Zones raster file using the Raster Calculator

(4) Calculation of the percentage of areas within 50 meters of a burglary (i.e. Burglary Proximity Zones) that are also in census tracts with the aforementioned specific attribute value

(5) Calculation of the percentage of the city that is comprised of census tracts with the aforementioned specific attribute value

View Detailed Methods and Results

Results

Burglary Factor	Percent of Burglary Proximity Zones' cumulative area in a Census Tract with Burglary Factor	Percent of Washington, D.C.'s total area comprised of Census Tracts with Burglary Factor
Percentage of People with a Known Income who are Below Poverty >= 18%	54%	37%
Percentage of Males 25 or older who are without a High School Degree >= 27%	54%	36%
Population Density >= 5,000 persons/km²	43%	19%
Percentage of People 30 or older in a household who are responsible for grandchildren >= 7%	43%	28%
Areas within 1 mile of a pawnshop	60%	38%
Areas within 500 feet of a major road	57%	40%

CRIME FACTOR MAPS

Poverty Educational Attainment Population Density

Primary Caregivers Proximity to Pawnshops Proximity to Major Roads

Analysis

As theorized, there were fairly significant associations between burglary locations and certain socio-economic or geographic attributes. All 6 crime factor analyses produced results that followed my general expectations. For example, census tracts with a high population density had more burglaries than a simple 1:1 ratio of area-to-burglaries would predict. Whereas only 19% of Washington, D.C.’s total area has a population density equal to or greater than 5,000 persons/ km², 43% of the total Burglary Proximity Zone (all areas within 50 meters of a burglary) is located here. However, it quickly became clear that my analysis failed to consider several relevant issues. As I evaluated the results, I noticed that many areas of high burglary density consistently failed to show up in the areal extent of the considered socio-economic attributes. A high burglary density is evident in census tract 84, and yet this tract does not have (1) a high population density, (2) a high percentage of grandparents as primary caregivers, (3) a high percentage of adults below poverty, and (4) a high percentage of adult males who don’t have a high school degree. In fact, the total population of this tract is only 1. I did not consider the fact that burglaries are not limited to residences. Commercial burglaries often take place in areas with little or no population at all. As such, any attempt to predict areas at risk of high burglary densities must also include factors other than those relating to social characteristics. For example, proximity to major roads (or any means of “quick getaway” for a burglar, such as a subway) is a burglary factor that is not dependant on social characteristics. Because I did not have the metadata to differentiate between commercial and residential burglaries in my dataset, perhaps I could have limited my focus to census tracts with a certain population level (at the very least to tracts where the population was greater than “0”). Another concern that came forth during my analysis is my method of using Burglary Proximity Zones instead of point features. As mentioned, I chose to create a 50 meter "buffer" around each burglary location to include census tracts and their demographics in my analysis that were near certain burglaries but did not encompass them. However, there is a possibility that such a technique would force me to undervalue areas with an extremely high number of burglaries committed in extremely close proximity. For example, if 10 burglaries were committed within a very short distance from each other, a 50 meter "buffer" raster would essentially appear the same as a 50 meter "buffer" that surrounds only 1 burglary location. Finally, an issue that should be noted is the concept of crime versus the concept of burglary. Although a burglary is always a crime, a crime is not always a burglary. The FBI’s list of known crime factors is a general one, and there is little chance that each factor described is relevant to burglary. Despite this, and as discussed, the factors which I did consider were all shown to have a degree spatial correlation to burglary.

Conclusions

Though the goals of this project were modest, I feel satisfied knowing that, at least overall, the results met my expectations. I think most students have had the experience of plugging number after number into a calculator, only to see an answer on the calculator screen that is quite obviously wrong ("the speed of the bicyclist is... 2,623,872 mph?"). Ultimately, I hope to use my findings to create a predictive model for burglary. This would require further analysis, such identifying factors that affect commercial burglary locations. Additionally, I need to find an accurate method of incorporating surroundings into my model. That is, census tracts are not islands. The image conjured up by the proverbial "wrong side of the tracks" is probably not an accurate one, as it seems more likely that most areas are usually affected by what surrounds them.

References

Crime Factors. Retrieved November 6, 2003 from the Federal Bureau of Investigation Web Site: http://www.fbi.gov/ucr/cius_02/html/web/crimefactors.html

Uniform Crime Reports. Retrieved November 6, 2003 from the Federal Bureau of Investigation Web site: http://www.fbi.gov/ucr/ucr.htm

links
non-tables page || index page || || original