32
David Herzog Missouri School of Journalism and NICAR

A crash course in data for information graphics

Embed Size (px)

DESCRIPTION

An overview of using government data for information graphics, from locating the data to visualizing it with Web 2.0 tools and desktop software.

Citation preview

Page 1: A crash course in data for information graphics

David HerzogMissouri School of Journalism and NICAR

Page 2: A crash course in data for information graphics

Locating the data

Obtaining the data

Evaluating the data

Working with the data

Visualizing the data

Page 3: A crash course in data for information graphics

“Database state of mind”

Data has to exist. Where? Online Offline

Page 4: A crash course in data for information graphics

Government websites Data.gov U.S. Census Bureau FDIC Missouri Data Portal Missouri Accountability Portal

Page 5: A crash course in data for information graphics

U.S. agency FOIA pages Drug Enforcement Administration

NGO sites Right-to-Know Network OpenMissouri.org NICAR database library ALA state agency databases wiki

Page 6: A crash course in data for information graphics

Commercial services Socrata Infochimps Geocommons Foreclosure Radar Oil Price Information Service Search Systems Junar

Page 7: A crash course in data for information graphics

Academic data catalogs ICPSR

Forms Forms.gov Web forms▪ Columbia parade permits

Page 8: A crash course in data for information graphics

Records retention schedules

Reports State auditor U.S. Government Accountability Office U.S. Inspectors General

Page 9: A crash course in data for information graphics

Google advanced search Look for data files Look for key words Look only on government sites

Page 10: A crash course in data for information graphics
Page 11: A crash course in data for information graphics

Data entry In the field At the office

Printouts/reports

Inspection forms

Page 12: A crash course in data for information graphics

Download it

Write or request a scraper with ScraperWiki

Convert a PDF with CometDocs Zamzar

Just ask for it

Make an open-records request

Page 13: A crash course in data for information graphics

U.S. Freedom of Information Act Passed in 1966 Amended in 1996 to include electronic

records

State open-records statutes Missouri Sunshine Law

Page 14: A crash course in data for information graphics

Get the roadmap! Record layout File layout Data dictionary Code sheet

Metadata Data about the data

Page 15: A crash course in data for information graphics

Look at it immediately when you get it It is what you asked for/expected? How many rows/records of data? Is the file format OK?

Page 16: A crash course in data for information graphics

Does it look too good to be true?Beware of missing informationWho collected the information?How? What are their methods?Why?What is their agenda?Who supports them financially or

otherwise?

Page 17: A crash course in data for information graphics

Notepad++ for PCsTextMate for Mac

Page 18: A crash course in data for information graphics
Page 19: A crash course in data for information graphics
Page 20: A crash course in data for information graphics
Page 21: A crash course in data for information graphics

Always keep original file

Never overwrite data columns

Tools Spreadsheets Database managers Google Refine Programming languages

Page 22: A crash course in data for information graphics

Raw numbers, without context, rarely are interesting.

Ask: Compared to what?

Page 23: A crash course in data for information graphics

Raw (amount) change New-Original

Percent change Change/Original

Per capita rates Per person Per x people

Page 24: A crash course in data for information graphics

Percent of total Individual/Total

Ratio Apples/oranges

Averages Mean Median

Page 25: A crash course in data for information graphics

Be curious!Cut out small slicesSpreadsheets for simple math and

comparisonsSpreadsheets for pivot tablesDatabase managers for more robust

analysisAlways ask: Is this correct?

Page 26: A crash course in data for information graphics

Online software platforms

Desktop software

Page 27: A crash course in data for information graphics
Page 28: A crash course in data for information graphics
Page 29: A crash course in data for information graphics
Page 30: A crash course in data for information graphics
Page 31: A crash course in data for information graphics
Page 32: A crash course in data for information graphics

Contact David Herzog at

[email protected] Twitter: @davidherzog