24
Data wrangling with Tableau and Excel October 11 2016 JRNL 520H

Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

Data wrangling with Tableau and Excel

October 11 2016JRNL 520H

Page 2: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

What is data wrangling?Data wrangling is the process of preparing raw data for use in a data analysis or visualization software.

Page 3: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

What are the causes of dirty data?● Data entry error

Page 4: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

What are the causes of dirty data?● Data entry error● Incompatible tables

Page 5: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

What are the causes of dirty data?● Data entry error● Incompatible tables● Incompatible table format

Page 6: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

What should we look out for when cleaning data?● Table formating

Page 7: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

What should we look out for when cleaning data?● Table formating● Variable type

Page 8: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

What should we look out for when cleaning data?● Table formating● Variable type● Invalid character values

Page 9: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

What should we look out for when cleaning data?● Table formating● Variable type● Invalid character values● Invalid numeric values

Page 10: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

What should we look out for when cleaning data?● Table formating● Variable type● Invalid character values● Invalid numeric values● Grouping data

Page 11: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

What should we look out for when cleaning data?● Table formating● Variable type● Invalid character values● Invalid numeric values● Grouping data● Missing values

Page 12: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

Ideal format of data in Tableau1. Start your data in cell A1. Remove all introductory information and footnotes.2. Have the first row be the column headers/variable names3. Have every subsequent row be one observation. No cross-tabulation!

Page 13: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

Ideal format of data in TableauBefore After

Page 14: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

Ideal format of data in TableauBefore After

Page 15: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

Data InterpreterTableau’s Data Interpreter feature draws out sub-tables and removes some of that extraneous information to help prepare your data source for analysis. Note: the data interpreter only works with Microsoft Excel files, not CSV or other file types.

Page 16: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

Data InterpreterTableau’s Data Interpreter feature draws out sub-tables and removes some of that extraneous information to help prepare your data source for analysis. Note: the data interpreter only works with Microsoft Excel files, not CSV or other file types.

Complete Tableau exercise

Page 17: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

JoinsA JOIN is a means for combining columns from one or more tables by using values common to each. There are four main join types: inner, left, right and full outer.

Page 18: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

Joins

Page 19: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

Joins

Page 20: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

Joins

Page 21: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

Joins

Complete Tableau exercise

Page 22: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

Wrangling in ExcelSometimes the data interpreter in Tableau isn’t able to detect all of the errors in the dataset. In cases like this, you will need to manually clean the data in Excel.

Complete Tableau exercise

Page 23: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

PivotTabular format

Columnar format

Page 24: Excel Data wrangling with Tableau andtmm/courses/journ16/slides/week5-wrangle.pdf · Data Interpreter Tableau’s Data Interpreter feature draws out sub-tables and removes some of

Pivot

Complete Tableau exercise

Tabular format

Columnar format