34
Displaying Data Data: Categorical and Numerical Dot Plots Stem and Leaf Plots Back-to-Back Stem and Leaf Plots Grouped Frequency Tables Histograms and Bar Graphs Circle Graphs (Pie Charts)

Displaying Data Data: Categorical and Numerical Dot Plots Stem and Leaf Plots Back-to-Back Stem and Leaf Plots Grouped Frequency Tables Histograms

Embed Size (px)

DESCRIPTION

Data Data are observations (such as measurements, genders, survey responses) that have been collected. Statisticians often collect data from small portions of a large group in order to determine information about the group. This information is then used to make conjectures about the entire group. Describing data frequently involves reading information from graphical displays, tables, lists, and so on.

Citation preview

Page 1: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Displaying Data

Data: Categorical and Numerical Dot Plots Stem and Leaf Plots Back-to-Back Stem and Leaf Plots Grouped Frequency Tables Histograms and Bar Graphs Circle Graphs (Pie Charts)

Page 2: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Graphs are used to try to tell a story.

Page 3: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

DataData are observations (such as measurements, genders, survey responses) that have been collected. Statisticians often collect data from small portions of a large group in order to determine information about the group. This information is then used to make conjectures about the entire group.

Describing data frequently involves reading information from graphical displays, tables, lists, and so on.

Page 4: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Data: Categorical and NumericalCategorical data are data that represent characteristics of objects or individuals in groups (or categories), such as black or white, inside or outside, male or female.

Numerical data are data collected on numerical variables. For example, in grade school, students may ask whether there is a difference in the distance that girls and boys can jump. The distance jumped is a numerical variable and the collected data is numerical.

Page 5: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

A dot plot, or line plot, provides a quick and simple way of organizing numerical data. They are typically used when there is only one group of data with fewer than 50 values.

Dot Plots

Page 6: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Suppose the 30 students in Abel’s class received the following test scores:

Dot Plots

Page 7: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

A dot plot for the class scores consists of a horizontal number line on which each score is denoted by a dot, or an x, above the corresponding number-line value. The number of x’s above each score indicates how many times each score occurred.

Dot Plots

Page 8: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

The score 52 is an outlier Scores 97 and 98

form a cluster

A gap occurs between scores 88

and 97.

Dot plotsTwo students

scored 72.

Four students scored 82.

Page 9: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Dot PlotsIf a dot plot is constructed on grid paper, then shading in the squares with x’s and adding a vertical axis depicting the scale allows the formation of a bar graph.

Page 10: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Stem and Leaf PlotsThe stem and leaf plot is similar to the dot plot, but the number line is usually vertical, and digits are used rather than x’s.

9 | 7 represents 97.

Page 11: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Stem and Leaf PlotsIn an ordered stem and leaf plot, the data are in order from least to greatest on a given row.

Page 12: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Advantages of stem-and-leaf plots: They are easily created by hand. Do not become unmanageable when volume of

data is large. No data values are lost.

Disadvantage of stem-and-leaf plots: We lose information – we may know a data value

exists, but we cannot tell which one it is.

Stem and Leaf Plots

Page 13: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

How to construct a stem-and-leaf plot:

1. Find the high and low values of the data.

2. Decide on the stems.

3. List the stems in a column from least to greatest.

4. Use each piece of data to create leaves to the right of the stems on the appropriate rows.

Stem and Leaf Plots

Page 14: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

5. If the plot is to be ordered, list the leaves in order from least to greatest.

6. Add a legend identifying the values represented by the stems and leaves.

7. Add a title explaining what the graph is about.

Stem and Leaf Plots

Page 15: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Back-to-back stem-and-leaf plots can be used to compare two sets of related data. In this plot, there is one stem and two sets of leaves, one to the left and one to the right of the stem.

Back-to-Back Stem-and-Leaf Plots

Page 16: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

ExampleGroup the presidents into two groups, George Washington to Rutherford B. Hayes and James Garfield to Ronald Reagan.

Page 17: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Example (continued)

a. Create a back-to-back stem and leaf plot of the two groups and see if there appears to be a difference in ages at death between the two groups.

b. Which group of presidents seems to have lived longer?

Page 18: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Example (continued)

Because the ages at death vary from 46 to 93, the stems vary from 4 to 9. The first 19 presidents are listed on the left and the remaining 19 on the right.

Page 19: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Example (continued)

The early presidents seem, on average, to have lived longer because the ages at the high end, especially in the 70s through 90s, come more often from the early presidents.

The ages at the lower end come more often from the later presidents.

For the stems in the 50s and 60s, the numbers of leaves are about equal.

Page 20: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Stem and Leaf PlotsA stem and leaf plot shows how wide a range of values the data cover, where the values are concentrated, whether the data have any symmetry, where gaps in the data are, and whether any data points are decidedly different from the rest of the data.

Page 21: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Frequency TablesA frequency distribution table shows how many times data occurs in a range.

The data for the ages of the presidents at death are summarized in the table.

Page 22: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Each class interval has the same size. The size of each interval can be computed by

subtracting the lower endpoint from the higher and adding 1, e.g., 49 – 40 +1 = 10.

We know how many data values occur within a particular interval but we do not know the particular data values themselves.

Frequency TablesCharacteristics of Frequency Tables

Page 23: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

As the interval size increases, information is lost. Classes (intervals) should not overlap.

Frequency TablesCharacteristics of Frequency Tables

Page 24: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Histograms and Bar GraphsA histogram is made up of adjoining rectangles, or bars.

The bars are all the same width. The scale on the vertical axis must be uniform.

Page 25: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Uni

form

sca

leHistograms and Bar Graphs

The death ages are shown on the horizontal axis and the numbers along the vertical axis give the scale for the frequency.

Frequencies are shown by the heights of vertical bars each having same width.

Page 26: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Bar GraphsA bar graph typically has spaces between the bars and is used to depict categorical data.

The bars representing Tom, Dick, Mary, Joy, and Jane could beplaced in any order.

Page 27: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Histograms and Bar GraphsA distinguishing feature between histograms and bar graphs is that there is no ordering that has to be done among the bars of the bar graph, whereas there is an order for a histogram.

Page 28: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

A double bar graph can be used to make comparisons in data.

Double-Bar Graphs

Page 29: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Circle Graphs (Pie Charts)A circle graph, or pie chart, is used to represent categorical data. It consists of a circular region partitioned into disjoint sections, with each section representing a part or percentage of the whole.

A circle graph shows how parts are related to the whole.

Page 30: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Example

Construct a circle graph for the information in the table, which is based on information taken from a U.S. Bureau of the Census Report (2006).

Page 31: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Example (continued)

The entire circle represents the total 299 million people.

The measure of the central angle (an angle whose vertex is at the center of the circle) of each sector of the graph is proportional to the fraction or percentage of the population the section represents.

Page 32: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Example (continued)

For example, the measure of the angle for thesector for the under-5 group is or approximately 7% of the circle.

Because the entire circle is 360°, of 360°, or about 24°, should represent the under-5 group.

Page 33: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Example (continued)

The table shows the number of degrees for each age group.

Page 34: Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms

Example (continued)