Project Proposal

The Prime Crime Area Spatio-Temporal Analysis

Published

Feb. 27, 2021

DOI

Contents

Executive Summary

Back to top

Demographic, socio-economic and crime rate data of the Greater London Region, retrieved from the London Datastore, are used in this project. In this project, 3 key analysis will be performed:

  1. Exploratory Data Analysis
  2. Clustering analysis
  3. Regression Modeling

Motivation

Back to top

With the limited police resources and possible adverse impact when crime occurs, analytics on crime has been done as far back as in the 1800s (Hunt, 2019). Crime occurrence was found to have spatial patterns, and thus predictive analytics should be possible. However, mixed results were obtained in the research to determine whether predictive policing results to lower crime rates (Meijer & Wessels, 2019). Thus, it is more beneficial to use the analytics to determine areas with higher risk of crime and to discover the underlying factors to the increased risk.

Traditionally, crime analysis is done manually or through a spreadsheet program (RAND Corporation, 2013). This project would give the users an easier way to do the analysis using a web application.

Forecasting Crime for Law Enforcement, extracted from Predictive Policing, RAND Corporation, 2013

Project Objectives

Back to top

This project aims to deliver an interactive user web application interface, whereby users are able to apply actionable insights based on the 3 key analysis

Datasets

Back to top

A borough includes wards, which is the primary unit of English electoral geography for civil parishes and district councils. There are a total of 32 boroughs in Greater London, excluding the City of London.

1. MPS Ward Level Crime (historic: Apr 2010 onward)

2. MPS Ward Level Crime (most recent 24 months)

3. Land Area & Population - Ward

4. Income of Taxpayers

5. Economic Activity Rate, Employment Rate and Unemployment Rate by Ethnic Group & Nationality, Borough

6. Geographical Map of London (LOAC) SHP data

7. Local Authority District Names and Codes

Proposed Scope and Methodology

Back to top

1. Data cleaning and preparation

2. Choose the right R package to visualize:

3. Data visualization and Analysis

4. Building of Artifact - Web Application

The timeframe for this project is illustrated in the Gantt Chart below

Project Timeframe

Storyboard & Visualization Features

Back to top

Board 1 - Exploratory Data Analysis

Exploratory Data Analysis

Board 2 - Clustering Analysis

Clustering Analysis

Board 3 - Regression Analysis

Regression Analysis


Software Tools

Back to top

RStudio

R-Packages

Back to top

Data Cleaning:

EDA:

Clustering Analysis:

Regression Analysis:

Web Application:

References

Back to top

Data Sources

Footnotes