Task: Crime Forecasting#
Overview#
The task involves leveraging data science techniques to participate in a real-time crime forecasting challenge inspired by the National Institute of Justice’s Real-Time Crime Forecasting Challenge. Your goal is to analyze and forecast place-based crime patterns using provided data and additional relevant sources.
Objective#
Develop an algorithm or model that accurately predicts crime hotspots in a specified jurisdiction using the provided Portland Police Bureau (PPB) Calls-for-Service (CFS) dataset. Predictions will focus on specific crime categories over various timeframes.
Task Deliverables#
Data Analysis and Preprocessing
Load and explore the CFS dataset to identify patterns, trends, and features.
Handle any missing data or inconsistencies in the dataset.
Model Development
Develop a machine learning or statistical model for forecasting crime locations.
Focus on All Calls-for-Service
Forecast crime hotspots for time periods of:
Two weeks
Evaluation Metrics
Use the Prediction Accuracy Index (PAI) to measure how well your model predicts hotspots.
Use the Prediction Efficiency Index (PEI) to evaluate the efficiency of your predictions.
Use the March-May, 2017 Calls-for-Service Data as the test set to evaluate your solution’s PAI and PEI.
Visualization
Provide visualizations of the forecasted crime hotspots.
Clearly indicate predicted high-risk areas.
Report
Submit a concise report detailing:
Data exploration and preprocessing steps.
Model development and chosen features.
Evaluation of your model’s performance.
Discussion of challenges faced and potential improvements.
Include your visualizations in the report.
Submission
Submit your code as a GitHub repository.
Share the GitHub repository link with me.
Data Access#
Download the provided data set from the challenge webpage. Additional data sources may be used to enrich your analysis, but ensure they are appropriately cited.
Submission Format#
Share your GitHub repository containing:
Python code used for analysis and forecasting.
Forecast results for each timeframe in a .csv format.
Visualizations in .png or .jpg format.
A report in PDF format.
Your resume
One page statement to descript your … for the GA positions.
Evaluation Criteria#
Prediction Accuracy and Efficiency: How accurate and efficient your forecasts are based on PAI and PEI metrics.
Innovation: Use of novel techniques or approaches in your analysis and forecasting.
Clarity of Visualizations and Report: How well your findings are presented and explained.
Deadline#
Submit your completed task by 05/12/2025.