40
340 N 12 th St, Suite 402 Philadelphia, PA 19107 215.925.2600 [email protected] www.hunchlab.com 2.0 - Getting Started

HunchLab 2.0 Getting Started

  • Upload
    azavea

  • View
    131

  • Download
    1

Embed Size (px)

DESCRIPTION

This presentation covers the requirements to get started with HunchLab 2.0's predictive policing system. It starts discussing technical requirements (security, authentication) and then proceeds to discuss guidelines for configuring meaningful predictive models of crime. The presentation concludes with information about related geographic and temporal datasets that are useful in forecasting crime with recommendations on how to prioritize data sets to use in HunchLab.

Citation preview

Page 1: HunchLab 2.0 Getting Started

340 N 12th St, Suite 402 Philadelphia, PA 19107

215.925.2600 [email protected]

www.hunchlab.com

2.0 - Getting Started

Page 2: HunchLab 2.0 Getting Started

Agenda •  Technical Overview

–  SaaS

–  Authentication

–  End-user Requirements

•  Setup

–  Required Data

–  Uploading Crime Data

–  Defining Crime Models

•  Additional Data Sets

Page 3: HunchLab 2.0 Getting Started

Places

People

Patterns } Prioritization

Page 4: HunchLab 2.0 Getting Started

Places

People

Patterns } Prioritization

Page 5: HunchLab 2.0 Getting Started
Page 6: HunchLab 2.0 Getting Started

SaaS Architecture

Page 7: HunchLab 2.0 Getting Started

Software as a Service Model

•  Subscription –  Bug fixes –  Updates –  Hosting / backups / etc. –  2nd tier support –  Training

•  Amazon Web Services infrastructure –  High availability –  Elastic resources

•  User load •  Model building processes

Page 8: HunchLab 2.0 Getting Started

AWS Infrastructure & Security

•  AWS data centers –  Data residency

•  US or EU

–  Physical security •  AWS employees with permission / 2 factor auth

–  Logical access •  Azavea employees with permission / 2 factor auth

–  Redundant network / power –  Continuous penetration testing –  3rd party evaluations

•  Best-of-breed services

Page 9: HunchLab 2.0 Getting Started
Page 10: HunchLab 2.0 Getting Started

Authentication

Page 11: HunchLab 2.0 Getting Started

Authentication

•  Options –  Standalone

•  HunchLab managed credentials

–  Integrated •  Active Directory / LDAP compatible •  Requires SaaS application to contact internal servers

•  Security Considerations –  CJIS requires 2 factor authentication –  HunchLab can provide this in standalone mode

Page 12: HunchLab 2.0 Getting Started

Authentication

Page 13: HunchLab 2.0 Getting Started

End-user Requirements

Page 14: HunchLab 2.0 Getting Started

Client Requirements / Browsers

•  Core requirements –  Modern browser –  Network connectivity

•  TLS 1.1+

–  HTML5 app •  Geolocation API (GPS for Sidekick)

•  Browsers –  Desktop

•  Internet Explorer: last 2 releases •  Firefox: last 2 rapid releases and extended support release •  Chrome: last 2 rapid releases

–  Mobile •  Safari 7 for iOS •  Chrome current rapid release for Android

Page 15: HunchLab 2.0 Getting Started

Client Requirements / Browsers

Page 16: HunchLab 2.0 Getting Started

Client Requirements / Browsers

•  TLS version support –  http://en.wikipedia.org/wiki/Transport_Layer_Security#Web_browsers

Page 17: HunchLab 2.0 Getting Started

Client Requirements / Browsers

•  Testing –  http://test.hunchlab.com

Page 18: HunchLab 2.0 Getting Started

Required Data

Page 19: HunchLab 2.0 Getting Started

Required Data

•  Boundaries –  ShapeFile format –  Uploaded in application –  Types

•  Jurisdiction boundary (required) •  Organizational layers (divisions, districts, etc.)

•  Event data (crimes, calls for service) –  CSV format –  Uploaded via API

Page 20: HunchLab 2.0 Getting Started

Required Data

•  Event data (crimes, calls for service) –  CSV format

•  First row is headers with names as below

–  Columns •  datasource (string) - identifies data source

–  example: rms

•  id (string) - unique identifier for event within data source –  example: 1

•  class (string) - class(es) for event separated by pipe –  example: agg|1|23

•  pointx (numeric) – longitude –  example: -105.0255345

•  pointy (numeric) – latitude –  example: 39.7287494

•  address (string) - street address –  example: 340 N 12th Street

Page 21: HunchLab 2.0 Getting Started

Required Data

•  Event data (crimes, calls for service) –  Columns (continued)

•  datetimefrom (ISO8601 datetime) - start time –  example: 2012-01-01T13:00:00Z

•  datetimeto (ISO8601 datetime) - end time –  example: 2012-01-01T13:00:00Z

•  report_time (ISO8601 datetime) - report time –  example: 2012-01-01T13:00:00Z

•  last_updated (ISO8601 datetime) - record update time –  example: 2012-01-01T13:00:00Z

Page 22: HunchLab 2.0 Getting Started

Required Data

•  Event data (crimes, calls for service) –  Upload via API

•  Allows automation of upload process •  Workflow

–  Query your database for recent changes –  Transform into CSV format –  POST CSV to HunchLab URL –  Check for import to complete

–  Example scripts •  https://github.com/azavea/azavea-hunchlab-examples

Page 23: HunchLab 2.0 Getting Started

Crime Models

Page 24: HunchLab 2.0 Getting Started
Page 25: HunchLab 2.0 Getting Started

Crime Models

•  Generate predictions –  Automatically built on a regular basis

•  Represents one or more crime classes •  Choices to make:

–  Crime classes –  Color –  Severity weight –  Patrol Efficacy

Page 26: HunchLab 2.0 Getting Started
Page 27: HunchLab 2.0 Getting Started

Crime Models

•  Which crimes to model? –  Start with serious events

•  Part 1s, etc.

–  Add ‘problem’ crime types for your department

•  How many models? –  Aim for up to 10 models

•  Single crime type vs. combination? –  Does the event happen often enough on its own?

•  Example: Homicides as part of Violence

–  Is the strategy the same as related crime types? •  Example: Homicides vs. Aggravated Assaults

Page 28: HunchLab 2.0 Getting Started
Page 29: HunchLab 2.0 Getting Started

Lincoln Example

# Assaults x

$87,238

# Burglary x

$13,096

# MVT x

$9,079

Sum to Predicted Cost of Crime

# Rape x

$217,866

# Robbery x

$67,277

Page 30: HunchLab 2.0 Getting Started

Crime Models

•  Severity weights –  How important is it to prevent these crimes? –  RAND cost of crime

•  http://www.rand.org/content/dam/rand/pubs/occasional_papers/2010/RAND_OP279.pdf

–  NIH publications •  http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2835847/table/T5/

Page 31: HunchLab 2.0 Getting Started

Crime Models

Page 32: HunchLab 2.0 Getting Started

Crime Models

•  Patrol Efficacy –  What proportion of these events are preventable via patrol

activities? •  Example: rape (stranger vs known assailant)

–  How effective is patrol against the preventable events? •  Example: street crimes vs indoor crimes

–  Expressed as percent (0-100%) –  Examples:

•  Robbery: 50% •  Residential Burglary: 20% •  Rape: 5%

Page 33: HunchLab 2.0 Getting Started

Crime Models

1.  Define set of models via crime classes 2.  Assign severity weights 3.  Assign patrol efficacy values 4.  Assign colors

•  Overall Goal –  Craft a set of models that generate predictions for real

opportunities for your officers to prevent crime.

Page 34: HunchLab 2.0 Getting Started
Page 35: HunchLab 2.0 Getting Started

Optional Data

Page 36: HunchLab 2.0 Getting Started

Optional Data

•  Geographic POIs –  Points, lines, polygons (Shapefile) –  Examples

•  Schools •  Transit stops •  Parks •  Bars

•  Temporal feeds –  Schedules (CSV) –  Examples

•  School calendar •  Sporting events

Page 37: HunchLab 2.0 Getting Started

Choosing Data Sets

•  Usefulness vs. Complexity –  How strong do you believe the correlation is?

•  Example: bars vs hospitals

–  How big is the data set? •  Example: schools vs bus stops

–  How often does the data change? •  Example: hospitals vs bars

•  Availability –  Start with what you have

•  Police stations, fire stations, public housing

–  Layer in data from other city departments •  Schools, bus stops, liquor licenses

–  Fill in gaps (once things are going)

Page 38: HunchLab 2.0 Getting Started

Choosing Data Sets

•  Risk Terrain Modeling –  Literature reviews

•  http://www.rutgerscps.org/pubs.htm

–  Factors in 5 or more reviews: •  Drug Activity •  Bars •  Nightclubs •  Schools •  Transportation Hubs

Page 39: HunchLab 2.0 Getting Started

Agenda •  Technical Overview

–  SaaS

–  Authentication

–  End-user Requirements

•  Setup

–  Required Data

–  Uploading Crime Data

–  Defining Crime Models

•  Additional Data Sets

Page 40: HunchLab 2.0 Getting Started

340 N 12th St, Suite 402 Philadelphia, PA 19107

215.925.2600 [email protected]

www.hunchlab.com

Amelia Longo Business Development Associate [email protected] 215.701.7715

Jeremy Heffner HunchLab Product Manager [email protected] 215.701.7712