13
College of Engineering Engineering Statistics Department of Dam & Water Resources Lecturer: Goran Adil & Chenar Pictorial Description of Data Chapter 2-2 22 Chapter 2, Part Two: Pictorial (Graphical Description of Data Learning outcomes when you complete this chapter, you should be able to do the following: 1- Understanding graph and charts 2- How to make graph or charts Graphical Presentation A graph is a method of presenting statistical data in visual form. The main purpose of any chart is to give a quick, easy-to-read-and-interpret pictorial representation of data which is more difficult to obtain from a table or a complete listing of the data. The type of chart or graphical presentation used and the format of its construction is incidental to its main purpose. A well-designed graphical presentation can effectively communicate the data's message in a language readily understood by almost everyone. they can be used to describe either a sample or a population; quantitative or qualitative data sets. Some basic rules for the construction of a statistical chart are listed below: (a) Every graph must have a clear and concise title which gives enough identification of the graph. (b) Each scale must have a scale caption indicating the units used. (c) The zero point should be indicated on the co-ordinate scale. If, however, lack of space makes it inconvenient to use the zero point line, a scale break may be inserted to indicate its omission. (d) Each item presented in the graph must be clearly labelled and legible even in black and white reprint.

2.2 pict of data

Embed Size (px)

Citation preview

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 22

Chapter 2, Part Two: Pictorial (Graphical Description of Data

Learning outcomes

when you complete this chapter, you should be able to do the following:

1- Understanding graph and charts

2- How to make graph or charts

Graphical Presentation

A graph is a method of presenting statistical data in visual form. The main purpose of any

chart is to give a quick, easy-to-read-and-interpret pictorial representation of data which is

more difficult to obtain from a table or a complete listing of the data.

The type of chart or graphical presentation used and the format of its construction is incidental to its

main purpose. A well-designed graphical presentation can effectively communicate the data's

message in a language readily understood by almost everyone. they can be used to describe

either a sample or a population; quantitative or qualitative data sets.

Some basic rules for the construction of a statistical chart are listed below:

(a) Every graph must have a clear and concise title which gives enough identification of the

graph.

(b) Each scale must have a scale caption indicating the units used.

(c) The zero point should be indicated on the co-ordinate scale. If, however, lack of space makes it

inconvenient to use the zero point line, a scale break may be inserted to indicate its omission.

(d) Each item presented in the graph must be clearly labelled and legible even in black and white

reprint.

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 23

Stem-and Leaf Plot

Represents data by separating each value into two parts: the stem (such as the leftmost digit) and the

leaf (such as the rightmost digit).

Example 2.4

Concrete cube test. The following 28-day compressive strengths, in newtons per

square millimetre, were obtained from test results on concrete cubes in England. The

Results are shown in Table 2.4.

Table 2.4: Concrete Cube Test Results

43.3 45.7 45.8 47.3 47.3 47.7 47.7 48.1 48.5 48.5 48.8

49.1 49.3 49.4 49.5 49.5 49.6 50.2 50.5 51.0 51.0 51.1

52.0 52.2 52.5 52.7 52.8 52.9 53.1 53.1 53.4 53.4 53.8

Steam (Tens) Leaves (unit)

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 24

HINT

• Data that has at least two digits better exemplifies the value of a stem-leaf plot.

• Cover up the actual data and ask students to read the data from the stem-leaf plot.

• Data should be put in increasing order.

• A stem-leaf plot builds ‘side-ways’ histogram.

• The most common mistake students make when developing stem-leaf plots is leaving out a

‘stem’ if there is no data in that class.

The other mistake is, if there is no data in a class, putting a ‘0’ next to the stem - which, of course,

would indicate a data value with a right digit of ‘0’. Students confuse the use of ‘0’ indicating a

frequency in a frequency table with a ‘0’ in a stem-leaf plot indicating an actual data value.

DOT PLOTS

• Dots represent an actual data value.

• Dots representing the same value are stacked.

• Similar to a histogram in that the distribution of the data is shown

Example 2.5

The following series is the minimum monthly flow (m3 S-l ) in each of the 20 years 1957 to 1976 at

a certain river in United Kingdom.

Table 2.5: minimum monthly flow ( m3 S-l )

21 36 4 16 21 21 23 11 46 10

25 12 9 16 10 6 11 12 17 3

0 5 10 15

20 25

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 25

Bar Charts (Bar Graph)

A-Line Diagram or Bar Chart Line diagram or bar chart

The occurrences of a discrete variable can be classified on a line diagram or bar chart. In this type

of graph, the horizontal axis gives the values of the discrete variable and the occurrences are

represented by the heights of vertical lines. The horizontal spread of these lines and their relative

heights indicate the variability and other characteristics of the data.

How to make a bar graph: 1. Use the data from the table to choose the right scale.

2. Draw and label the scale on the vertical axis. (Vertical means "up and down.")

3. Draw and label the horizontal axis. (Horizontal means "across.")

4. List the name of each item.

5. Draw vertical bars to represent each number.

6. Title the graph

Example 2.6

Number of flood occurrences per year from 1939 to 1972 at the gauging station of Calamazza on

the Magra River, between Pisa and Genoa in north western Italy.

Table 2.6flood occurrences in one year

Number of floods

in a year

Frequency (x)

0 0

1 2

2 6

3 7

4 9

5 4

6 1

7 4

8 1

9 0

Total 34

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 26

Histogram, Polygon and Ogive.

Example 2.7

The Annual maximum daily mean discharges (m3/s) for the River Thames

at Kingston over the period 1884 to 1991 are presented in Table 2.14

Table 2.7: Frequency Table (Annual maximum flows in m3/s)

Class limit (Annual

maximum flows in

m3/s)

Class Mid-

Point

Lower Class

Boundary

Upper Class

Boundary Frequency

Relative

Frequency

00 – 99 1

100 – 199 8

200 – 299 42

300 – 399 36

400 – 499 11

500 – 599 7

600 – 699 1

700 – 799 2

Total 108 100.01

0

1

2

3

4

5

6

7

8

9

10

0 1 2 3 4 5 6 7 8 9

FL

OO

D O

CC

UR

RE

NC

ES

P

ER

Y

EA

R

NUMBER OF FLOODS

Flood Occurrences Per Year

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 27

Histograms

a) Frequency Histogram

Histograms look similar to bar charts, but they are used to present numerical or continuous

data rather than categorical data. In bar charts the “x-axis” lists the various categories under

consideration, whereas in histograms the “x-axis” is a numerical scale.

The bars on the horizontal scale are labeled with one of the following:

(1) Horizontal Scale for Histogram: Use class boundaries or class midpoints.

(2) Vertical Scale for Histogram: Use the class frequencies.

Guide to construct histogram

♣ It is often more practical to use class midpoint values instead of the class boundaries.

♣ A common rule of thumb is that the vertical height of the histogram should be about three-

fourths of the total width.

♣ Both axes should be clearly labeled

1

8

42

36

117

1 2

0

5

10

15

20

25

30

35

40

45

99.5 199.5 299.5 399.5 499.5 599.5 699.5 799.5

-0.5 99.5 199.5 299.5 399.5 499.5 599.5 699.5

Fre

qu

ency

Daily Mean Discharges (m3/s)

Annual Maximum Daily mean Discharges for the River Thames

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 28

b) Relative Frequency Histogram

Has the same shape and horizontal scale as a histogram, but the vertical scale is marked with

relative frequencies instead of actual frequencies.

Polygon

Frequency polygon

Uses line segments connected to points directly above class midpoint values

0.93%%

7.41%%

38.89%%

33.33%%

10.19%%6.48%%

0.93%% 1.85%%

0%

5%

10%

15%

20%

25%

30%

35%

40%

45%

99.5 199.5 299.5 399.5 499.5 599.5 699.5 799.5

-0.5 99.5 199.5 299.5 399.5 499.5 599.5 699.5

Rel

ati

ve

Fre

qu

ency

(%

)

Daily Mean Discharges (m3/s)

Annual Maximum Daily mean Discharges for the River Thames

0

5

10

15

20

25

30

35

40

45

0 100 200 300 400 500 600 700 800

Fre

qu

ency

Daily Mean Discharges (m3/s)

Annual Maximum Daily mean Discharges for the River Thames

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 29

Relative Frequency polygon

Ogive

A line graph that depicts cumulative frequencies

0%

5%

10%

15%

20%

25%

30%

35%

40%

45%

0 100 200 300 400 500 600 700 800

Rel

ati

ve

Fre

qu

ency

Daily Mean Discharges (m3/s)

Annual Maximum Daily mean Discharges for the River Thames

0%

20%

40%

60%

80%

100%

120%

0 100 200 300 400 500 600 700 800

Cu

mu

lati

ve

Rel

ati

ve

Fre

qu

ency

Daily Mean Discharges (m3/s)

Annual Maximum Daily mean Discharges for the River Thames

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 30

Pie Charts

A pie chart is a graph that depicts qualitative data as slices of a circle, in which the size of each slice

is proportional to the frequency count for the category.

Guidelines for constructing Pie chart

1- Choose a small number (5-6) of categories for the variable because too many make the pie

chart difficult to interpret.

2- Whenever possible, construct the pie chart so that percentages are in either ascending or

descending order

Example 2.8

Table 2.8 below shows failed earth dams in terms of dam type. Present the data in form of a pie chart.

Table 2.8: Subdivision of Failed Earth Dams in Terms of Dam Type.

Earth dam type Case number Percentage (%)

Homogeneous earthfill 38 62.0

Zoned earthfill 9 15.0

Earthfill with core wall 12 20.0

Faced earthfill 2 3.0

Total 61 100.0

360

Component partangle

Whole quantity

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 31

Earth dam type 62%

Homogeneous earthfill

15%

Zoned earthfill20%

Earthfill with core wall3%

FAILED EARTH DAMS IN TERMS OF DAM TYPE.

Earth dam type

Homogeneous earthfill

Zoned earthfill

Earthfill with core wall

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 32

Tutorial on Frequency Distribution and Pictorial Description of Data

For the following examples,

1- Prepare Frequency Distribution Table for Tutorial 1 to 8, Then

2- Whenever Relevant, present the data in form of

(a) Bar Charts/ multiple/ component bar charts

(b) Histograms and relative frequency Histogram

(c) Frequency Polygon and Relative Frequency Polygon

(d) Ogive (Whenever Relevant)

(e) Stem-and-leaf plot

(f) Dot Plot

(g) Pie Charts

Tutorial 2.1

The following data in Table 2.5 are the annual maximum flows in m3/s in the Colorado River at

Black Canyon for the 52-year period from 1878 to 1929. Prepare Frequency Distribution Table

Table 2.9: The annual maximum flows in m3/s in the Colorado River at Black Canyon for

the 52-year period from 1878 to 1929:

1980 1130 3120 2120 1700 2550 8500 3260 3960 2270

1700 1570 2830 2120 2410 2550 1980 2120 2410 2410

1420 1980 2690 3260 1840 2410 1840 3120 3290 3170

1980 4960 2120 2550 4250 1980 4670 1700 2410 4550

2690 2270 5660 5950 3400 3120 2070 1470 2410 3310

3230 3090

[Adapted from E. J. Gumbel (1954), “Statistical theory of extreme values and

some practical applications,” National Bureau of Standards, Applied

Mathematics Series 33, U.S. Govt. Printing Office, Washington, DC.]

Tutorial 2.2

For the following data shown in Table 2.6, Prepare Frequency Distribution.

Table 2.10: EPA Mileage Rating on 30 Cars Miles Per Gallon

36.3 30.1 40.5 36.2 42.1 38.5 37.5 40.0 35.6 38.8

38.4 37.1 37.0 41.0 36.3 38.6 35.9 40.2 32.9 44.9

39.8 39.9 38.1 34.8 33.9 36.7 37.0 37.1 39.0 40.5

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 33

Tutorial 2.3

Concrete cube test. From 28-day concrete cube tests made in England in 1990, the following

results of maximum load at failure in kilonewtons and compressive strength in newtons per square

millimeter were obtained:

Maximum load: 950, 972, 981, 895, 908, 995, 646, 987, 940, 937, 846, 947, 827,

961, 935, 956.

Compressive strength: 42.25, 43.25, 43.50, 39.25, 40.25, 44.25, 28.75, 44.25, 41.75,

41.75, 38.00, 42.50, 36.75, 42.75, 42.00, 33.50.

Prepare two separate Frequency Distribution Table for maximum load and compressive

strength.

Tutorial 2.4

Annual flows of the Derwent at Yorkshire Bridge, England, for the period 1938–1967 are shown in

Table 2.11 in millimeters of equivalent runoff over the catchment area above the site. Prepare

Frequency Distribution Table

Table 2.11: Annual flows in millimeters of equivalent runoff over the catchment area

946 1074 867 1058 838 837 1133 815 1138 869

910 927 1193 1386 737 1113 1187 955 891 1302

868 969 742 955 665 1143 947 763 1288 1029.

Tutorial 2.5

Water quality. Water quality measurements are taken daily on the River Ouse at Clapham,

England. The concentrations of chlorides and phosphates in solution, given below in milligrams per

liter, are determined over a 30-day period.

Chloride: 64.0, 66.0, 64.0, 62.0, 65.0, 64.0, 64.0, 65.0, 65.0, 67.0, 67.0, 74.0, 69.0, 68.0, 68.0, 69.0,

63.0, 68.0, 66.0, 66.0, 65.0, 64.0, 63.0, 66.0, 55.0, 69.0, 65.0, 61.0, 62.0, 62.0.

Phosphate: 1.31, 1.39, 1.59, 1.68, 1.89, 1.98, 1.97, 1.99, 1.98, 2.15, 2.12, 1.90 1.92, 2.00, 1.90,

1.74, 1.81, 1.86, 1.86, 1.65, 1.58, 1.74, 1.89, 1.94, 2.07, 1.58, 1.93, 1.72, 1.73, 1.82.

Prepare two separate Frequency Distribution Table for maximum load and compressive

strength.

Tutorial 2.6

Hurricane damage is of great concern to civil engineers. The frequencies of hurricanes affecting the

east coast of the United States each year during a period of 69 years are given as follows by H. C. S.

Thom (1966), Some Methods of Climatological Analysis, World Meteorological Organisation,

Geneva. Prepare Frequency Distribution Table

College of Engineering Engineering Statistics

Department of Dam & Water Resources Lecturer: Goran Adil & Chenar

Pictorial Description of Data Chapter 2-2 34

Table 2.12 Number of Hurricanes versus Frequency

Number of

hurricanes

0 1 2 3 4 5 6 7 8 9

Frequency 1 6 10 16 19 5 7 3 1 1

Tutorial 2.7

Distribution of concrete densities. Let us reconsider the distribution of the

concrete densities given in table 2.9. Prepare Frequency Distribution Table.

Table 2.13: Results of Concrete Sample Density

2411 2415 2425 2427 2427 2428 2429 2433 2435 2435

2436 2436 2436 2436 2437 2437 2441 2441 2444 2445

2445 2446 2447 2447 2448 2448 2449 2450 2454 2454

2455 2456 2456 2457 2458 2469 2471 2472 2473 2488

Tutorial 2.8

Annual maximum flow of Tevere (Tiber) River observed at Ripetta, a gauging station in Rome,

central Italy, from 1921 to 1974 are shown in Table below. Prepare Frequency Distribution Table.

Table 2.14 Annual Maximum Flow m3/sec