Upload
jessica-phelps
View
220
Download
1
Tags:
Embed Size (px)
Citation preview
Introduction to Probability and Statistics
Consultation time:
Ms. Chong
OUTLINEOUTLINE
1-1 Introduction1-2 Descriptive and Inferential
Statistics1-3 Variables and Types of Data1-4 Data Collection and Sampling
Techniques1-5 Computers and Calculators
OBJECTIVESOBJECTIVESDemonstrate knowledge of all statistical
terms.
Differentiate between the two branches of statistics.
Identify types of data.
Identify the measurement level for each variable.
Identify the four basic sampling techniques.
Explain the importance of computers and calculators in statistics.
1-1 Introduction1-1 Introduction
What is statistics?
Number ?
Value ?
Data ?
Statistics consists of conducting studies to collect, organize, summarize, analyze, and draw conclusions from data.
Data are the values (measurements or observations) that the variables can assume.
Variables whose values are determined by chance are called random variables.
1-2 Descriptive and Inferential 1-2 Descriptive and Inferential StatisticsStatistics
Descriptive statistics consists of the collection, organization, summarization, and the presentation of data.
Inferential statistics consists of performing estimations and making decision about the whole population based on the information gained from limited samples.
Involve the use of
“statistical terms” for the
data descriptions.
“Describe” the data that
have been collected.
Descriptive Descriptive StatisticsStatistics
Examples of Descriptive Statistics
Six out of ten students in UCSI are local students.
Average score for Biology test is 75.8.
The average annual salary for staffs in Company A is RM 2500.
Draw conclusions and make decisions
Make predictions
Inferential Inferential StatisticsStatistics
Examples of Inferential Statistics
The population in Malaysia will increase by 5% in the coming five years.
It was estimated that the number of cars sold in next month is 50.
Making predictions and estimations
Relief constipati
on problem.
Patients with
constipation
problem.
Eat diet high in fiber.
Examples of Inferential Statistics
Diet high in fiber can help to reduce constipation problem.
Making Conclusion
A population consists of all subjects (human or otherwise) that are being studied.
A sample is a group of subjects selected from a population.
Population
Sample
• Generally, researchers generalize their findings on population based on the sample selected.
• The sample must be representative enough to reflect the whole population.
• The sample is considered representative if:
The sample size was large enough. The sample was selected randomly (no bias).
1-3 Variables and Types of Data1-3 Variables and Types of Data
Data
Qualitative Quantitative
Discrete Continuous
Qualitative VariablesQualitative Variables
Qualitative variables are variables that can be placed into distinct categories, according to the attribute or characteristic.
It also can be known as non-numerical variables.
Example: Gender
Nationality
Marital status
Quantitative VariablesQuantitative Variables
Quantitative variables are numerical and can be ordered or ranked.
It also can be known as numerical variables.
Example: Height
Age
Blood Pressure
Discrete VariablesDiscrete Variables
Discrete variables assume values that can be counted.
Example: Number of student, pairs of shoes, price, etc.
Continuous VariablesContinuous Variables
Continuous variables can assume all values between any two specific values.
They are obtained by measuring instead of counting.
Example: Weight, height, volume, pressure, etc.
Level of MeasurementLevel of Measurement
Classify data according to how they are categorized, counted, or measured.
There are four level of measurement.
Nominal
Interval
Ratio
Ordinal
Norminal level of measurementNorminal level of measurement
Classify data into mutually exclusive categories in
which no order or ranking can be imposed on the data.
Example: Course studied (Medicine/Food Science, etc.)
Eye colour (brown/blue, etc.)
Gender (Male/Female)
Ordinal level of measurementOrdinal level of measurement
Classify data into categories that can be ranked;
however, no precise differences exist between
the ranks.
Example: Grade (A, B, C, D)
Performance (poor, fair, good, excellent)
Judging (first place, second place, etc.)
PerformancePerformance
Poor Fair Good Excellent
Is the difference between poor and fair same as the difference between good excellent?
Are the differences measurable?
Interval level of measurement
Ranks data, and precise differences between the
units of measure do exist.
There is no true / meaningful zero.
Example: IQ (109, 110, etc.)
Temperature (- 5oC, 0oC, 8oC, 10oC, etc.)
Ratio level of measurement
Possesses all characteristics of interval measurement.
A true zero do exists.
Additionally, true ratios exist for the same variable.
Example: Salary (RM0, RM1000, RM3000, etc.)
Weight (0kg, 35kg, 70kg, etc.)
1-4 Data collection and 1-4 Data collection and Sampling Techniques Sampling Techniques
Data can be collected via different methods.
Survey is one of the most commonly used methods.
Surveys can be done through telephone, mail questionnaire, personal interviews, etc.
Sampling MethodsSampling Methods
There are four basic sampling methods are used by statisticians to obtain samples that are unbiased.
Random SamplingRandom Sampling
The samples are selected by chance methods or random numbers.
Systematic SamplingSystematic Sampling
1 2 3 4…10…15…20…25.…30…35
Cluster SamplingCluster Sampling
The population is divided into many clusters.
Samples are selected by using some intact groups that are representative of the population.
POPULATION
A C
D
B
E
A C
E
Samples
Stratified SamplingStratified Sampling
Population is divided into many groups (strata) according to some characteristics.
The subjects within groups are randomly selected as samples for study.
POPULATION
A
C
B
D
A
B
C
D
50 samples
50 samples
50 samples
50 samples
1-5 Computers and Calculators1-5 Computers and Calculators
Computers and calculators make numerical computation easier.
There are many statistical packages available such as Microsoft Excel, MINITAB, SPSS, Unscrambler, etc.
Data must still be understood and interpreted.