40
Business Research Methods Business Research- Data Analysis http://www.facebook.com/ mr.fortyseven

Business Research Data Analysis Ppt 111111020701 Phpapp02

Embed Size (px)

Citation preview

Page 1: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Business Research MethodsBusiness Research- Data Analysis

Page 2: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Topics to be coveredData Analysis : Editing, Coding,

Classification, Tabulation, Analysis and Interpretation

Statistical Analysis of Business Research: Bivariate Analysis (Chi-square)

Multivariate Analysis – Factor Analysis, Discriminant Analysis, Cluster Analysis, Conjoint Analysis

ANOVA – One-way & Two-way classification

Page 3: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Difference between Data and InformationAny raw facts or figures is known

as data.When the data is processed by

doing statistical analysis and some conclusion can be drawn from it, it is known as information.

Page 4: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Steps in Processing of DataQuestionnaire

checking

Editing

Coding

Tabulation

Data Cleaning

Statistically adjusting the data

Selecting a Data Analysis Strategy

Page 5: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Questionnaire checking – The initial step in questionnaire checking involves a check of all questionnaires for completeness and interviewing quality. A questionnaire returned from the field may be unacceptable for several reasons:

1. Part of the questionnaire may be incomplete.2. The pattern of responses may indicate that

the respondent did not understand or follow the instructions.

3. The responses show little variance.4. The questionnaire is answered by someone

who does not qualify for participation.5. The returned questionnaire is physically

incomplete, one or more pages are missing.

Page 6: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Editing – Review of the questionnaires with the objective of increasing accuracy and precision. It consists of screening questionnaires to identify illegible, incomplete, inconsistent or ambiguous responses. This can be done in two stages:

a) Field Editing – Objective of field editing is to make sure that proper procedure is followed in selecting the respondent, interview them and record their responses. The main problems faced in field editing are:

1. Inappropriate Respondents – Instead of house owners, tenant is interviewed.

2. Incomplete interviews, 3. Improper understanding, 4. Lack of consistency, 5. Legibility, 6, Fictitious interview – Questionnaires are filled by interviewer himself without conducting the interview.

b) Office Editing – It is more thorough than field editing. Problems of consistency, rapport with respondents are some of the issues which get highlighted during office editing.

Page 7: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Example of Inconsistency:A respondent indicated that he doesn’t drink coffee, but

when questioned about his favorite brand, he replied ‘BRU’.

Treatment of Unsatisfactory ResponsesReturning to the field – Questionnaires with

unsatisfactory responses may be returned to the field, where the interviewers recontact the respondents.

Assigning missing value – Editor may assign missing values to unsatisfactory responses. This approach may be desirable if 1) the number of respondents with unsatisfactory responses is small, 2) the proportion of unsatisfactory responses for each of these respondents is small, or 3) the variables with unsatisfactory responses are not the key variables.

Discarding unsatisfactory respondents – This is possible only when proportion of unsatisfactory respondents is small or the sample size is large.

Page 8: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Coding – Coding refers to those activities which helps in transforming edited questionnaires into a form that is ready for analysis. Coding speeds up the tabulation while editing eliminates errors. Coding involves assigning numbers or other symbols to answers so that the responses can be grouped into limited number of classes or categories. The code includes an indication of the column and data record it will occupy. For eg. Sex of respondents may be coded as 1for males and 2 for females.Questions Answers Codes

1. Do you own a vehicle?

Yes 1

No 22. What is your occupation?

Salaried S

Business BRetired R

Page 9: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Tabulation – Refers to counting the number of cases that fall into various categories. The results are summarized in the form of statistical tables. The raw data is divided into groups and sub-groups. The counting and placing of data in a particular group and sub-group are done. The tabulation involves:

1. Sorting and counting.2. Summarising of data.Tabulation may be of two types:3. Simple tabulation – In simple tabulation, a single

variable is counted.4. Cross tabulation – Includes two or more

variables, which are treated simultaneously.Tabulation can be done entirely by hand, or by

machine, or by both hand and machine.

Page 10: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Sorting and counting of data: Sorting can be done as follows:

Format of a Blank table Table No. TITLE – Number of children per familyHead Note – Unit of measurement

Income (Rs) Tally Marks Frequencies1000 IIII 41500 II 22000 III 3

Sub-Heading

Caption

Body

Foot note

Total

Sub heading indicates the

row title or the row headings.

Caption indicates what each column is

meant for.Body of the

table gives full information of the frequency.

Page 11: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Kinds of Tabulation1. Simple or one-way tabulation – The

multiple choice questions which allow only one answer may use on-way tabulation or univariate. The questions are predetermined and consist of counting the number of responses falling into a particular category and calculate the percentage.

Example Table 14.1: Study of number of children in a

familyNo. of children

Family Percentage

0 10 51 30 152 70 35

Page 12: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

2. Cross Tabulation or Two-way Tabulation – This is known as Bivariate Tabulation.The data may include two or more variables.

Eg. Popularity of a health drink among families having different incomes.

Table 14.3: Use of Health DrinkIncome per month

No. of children per family (0)

1 2 No. of families

1000 10 5 8 231001-2000

5 0 8 13

2001-3000

20 10 12 42

Page 13: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Data cleaning – Includes consistency checks and treatment of missing responses. Although preliminary consistency checks have been made during editing, the checks at this stage are more thorough and extensive, because they are made by computer.

Consistency checks – Identify data that are out of range, logically inconsistent or have extreme values. For eg. A respondent may indicate that she charges long distance calls to a calling card, although she does not have one.

Page 14: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Treatment of missing responses – Missing responses represent values of a variable that are unknown, either because respondents provided ambiguous answers or their answers were not properly recorded.

1. Substitute a Neutral Value – A neutral value, typically the mean to the variable, is substituted for the missing responses.

2. Substitute an Imputed Response – The respondent’s pattern of responses to other questions are used to impute or calculate a suitable response to the missing questions.

3. Casewise Deletion – Cases or respondents with any missing responses are discarded from the analysis.

4. Pairwise deletion – Instead of discarding all cases with any missing values, the researcher uses only the cases or respondents with complete responses for each calculation. As a result, different calculations in an analysis may be based on different sample sizes.

Page 15: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Statistically Adjusting the Data – If any correction needs to be done for the statistical analysis, the data is adjusted accordingly.

Selecting a Data Analysis Strategy – The selection of a data analysis strategy should be based on the earlier steps of the marketing research process, known characteristics of the data, properties of statistical techniques and the background and philosophy of the researcher.

Page 16: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Univariate TechniquesUnivariate techniques are appropriate when there is a single

measurement of each element in the sample, or there are several measurements of each element but each variable is analyzed in isolation.

Multivariate Techniques – Suitable for analyzing data when there are two or more measurements of each element and the variables are analyzed simultaneously. Concerned with the simultaneous relationships among two or more phenomena.

Multivariate techniques differ from univariate techniques in that they shift the focus away from the levels (averages) and distributions (variances) of the phenomena, concentrating instead upon the degree of relationships among these phenomena.

Page 17: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Bivariate Technique – Chi-squareMultivariate Techniques

Dependence Techniques are appropriate when one or more variables can be identified as dependent variables and the remaining as independent variables.

In interdependence techniques, the variables are not classified as dependent or independent, rather the whole set of interdependent relationships is examined.

Dependence Techniques

Interdependence Techniques

•Discriminant Analysis•Conjoint Analysis

•Factor Analysis•Cluster Analysis

Page 18: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Chi-square TestThe chi-square test is represented by the symbol χ2 and

owes its origin to greek letter chi. This test was first used by Karl Pearson and is one of the most widely used test today.

Through the test, we are able to determine the extent of difference between the theory or expected value and the observed or the actual value.

χ2 = ∑ (O-E)2 EWhere, O = Observed frequencies E = Expected frequenciesIt is particularly useful in tests involving nominal data. The chi

distribution is positive but an asymmetrical distribution. It has only one parameter, namely degrees of freedom. Degrees of

freedom refers to the number of classes to which a value can be assigned freely without exceeding the limitation placed.

Page 19: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Applications of chi-square:Chi-square as a test of independence – We

can establish if two or more attributes are associated or independent. Eg. A doctor may be interested in knowing if the new BCG vaccine is effective in controlling the target diseases or not. If there is any relationship between divorce rates and working wives.

We start with the null hypothesis that there is no association between the attributes specified .

Acceptance criteria – Calculated value is less than the table value, null hypothesis is accepted.

Rejection Criteria – Calculated value is greater than the table value, null hypothesis is rejected.

Page 20: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

As a test of goodness of fit – It can be used to determine how well a theoretical distribution (poisson or normal) fits on the observed data or how appropriately a theoretical distribution fits empirical distribution.

As a test of homogeneity – Used to find out if two or more randomly selected independent samples have been drawn from the same population or not.

Page 21: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Conditions for chi-square TestThe total number of items should be large

enough to guarantee a modem of similarity between the theoretically correct distribution and the sampling distribution being studied.

The number of items in each group should be reasonable.

The observation used in the study should be collected on the random basis so that there is no element of bias.

Data given should be in original and absolute form and not in relative form like proportions, percentages etc.

Page 22: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Merits of chi-square testIt can be used without making any

assumption about the form of parent distribution or its parameters, since it is based on observed frequencies and not on parameters like mean and standard deviation.

It is a distribution free test and hence can be used in any type of population distribution.

Additive property of chi-square test that allows the researcher to add the results of independent but related samples.

It is easy to calculate and interpret.

Page 23: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Demerits of chi-squareIt has some limitations

associated with it.It is used for testing hypothesis

but is not useful for estimation.It is not as reliable as parametric

tests.

Page 24: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Discriminant AnalysisIn this analysis two or more groups are compared.

Discriminant Analysis is a technique for analyzing data when the dependent variable is categorical and the independent variables are interval in nature. For eg. The dependent variable may be the choice of a brand of personal computer and the independent variables may be ratings of attributes of PCs on a seven-point Likert scale.

Eg. Where discriminant analysis is used:1. Those who buy our brand and those who buy

competitor’s brand.2. Heavy user, medium user and light user of the

product.

Page 25: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

The value of dependent variable is calculated by using the data of independent variable.

Z = b1x1+b2x2+b3x3+…………Where, Z = Discriminant score b1 = Discriminant weight for variable x1 = Independent variableThe objectives of discriminant analysis:Development of discriminant functions, or linear

combinations of independent variables, which will best discriminate between the categories of the dependent variable.

Examination of whether significant differences exist among the groups, in terms of independent variable.

Determination of which independent variables contribute to most of the intergroup differences.

Classification of cases to one of the groups based on the values of the independent variable.

Evaluation of the accuracy of classification.

Page 26: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Example of Discriminant VariableA study of 294 consumers was undertaken to determine the

correlates of rebate proneness, or the characteristics of consumers who respond favorably to rebate promotions.The independent variables were four factors related to household shopping attitudes and behaviors, and selected demographic characteristics (sex, age and income). The dependent variable was the respondents degree of rebate proneness, of which three levels were identified. Respondents who reported no rebate-triggered purchases during the past 12 months were classified as nonusers, those who reported one or two such purchases as light users and those with more than two purchases, frequent users of rebates.Two findngs emerged: Rebate sensitive customers associate less effort with fulfilling the requirements of the rebate purchase, And consumers who are aware of the regular prices of products are more likely to respond to rebate offers.

Page 27: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

When the dependent variable has two categories, the technique is known as two-group discriminant analysis. When three or more categories are involved, the technique is referred to as multiple discriminant analysis.

Example of Discriminant analysis:In terms of demographic characteristics,

how do consumers who exhibit store loyalty differ from those who do not?

Do heavy, medium, and light users of soft drinks differ in terms of their consumption of frozen foods?

Page 28: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Conjoint AnalysisConjoint Analysis attempts to determine the relative

importance consumers attach to salient attributes and the utilities they attach to the levels of attributes.

This information is derived from consumer’s evaluation of brands, or brand profiles composed of these attributes and their levels. The respondents are presented with stimuli that consist of combinations of attribute levels. They are asked to evaluate these attributes in terms of their desirability.

In a situation where the company would like to know the most desirable attributes or their combination for a new product or service, the use of conjoint analysis is most appropriate.

Page 29: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

ExampleAn airline would like to know, which is the most

desirable combination of attributes to a frequent traveller: a) Punctuality, b) Air fare, c)Quality of food served on the flight, and d) Hospitality and empathy shown.

A comparison between the utility of a price level of Rs. 400 versus Rs 500, a delivery period of 1 week versus 2 weeks or an after-sales response of 24 hours versus 48 hours.

Eg. Of Conjoint Analysis for a laptop:Weight (3kg or 5 kg), Battery life (2 hrs or 4 hrs), Brand

name ( Lenovo or Dell)Rank order the combination of the characteristics, i.e.

3kg, 2hrs, Lenovo 5kg, 4 hrs, Dell

Page 30: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Three steps in Conjoint Analysis Identification of relevant products or service attributes.Collection of data.Estimation of worth for the attribute chosen.

Conjoint Analysis has been used in marketing for a variety of purposes:

Determining the relative importance of attributes in the consumer choice process.

Estimating market share of brands that differ n attribute levels.

Determining the composition of the most preferred brand.Segmenting the market based on similarity of preferences

for attribute levels.

Page 31: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Factor AnalysisFactor Analysis is a general name

denoting a class of procedures primarily used for data reduction and summarization.

In marketing research, there may be a large number of variables, most of which are correlated and which must be reduced to a manageable level.

Relationships among sets of many interrelated variables are examined and represented in terms of a few underlying factors.

Page 32: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Example of Factor AnalysisRespondents in a survey were asked to rate the

importance of 15 bank attributes. A five point scale ranging from not important to very important was employed. These data were analyzed via principal component analysis.

A four factor solution resulted, with the factors being labeled as traditional services, convenience, visibility and competence. Traditional services included interest rates on loans, easy to read monthly statements and obtainability of loans. Convenience was comprised of convenient branch location, convenient ATM locations, speed of service, and convenient banking hours. The visibility factor included recommendations from friends and relatives and attractiveness of the physical structure. Competence consisted of employee competence and availability of extra services.

Page 33: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Two most commonly employed factor analysis procedures:

Principal Component Analysis – When the objective is to summarise information from a large set of variables into fewer factors, principal component factor analysis is used.

Common factor analysis – If the researcher wants to analyse the components of the main factor, common factor analysis is used.

Eg. Common Factor – Inconvenience inside a car.The components may be:

1. Leg Room2. Seat arrangement3. Entering the rear seat4. Door locking mechanism.

Page 34: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Principal Component Factor AnalysisCustomer feedback about a two wheeler

manufactured by a company.Identified six variables:1. Fuel efficiency2. Durability of life3. Comfort4. Spare parts availability5. Breakdown frequency6. PriceGrouping of 1,2,4,5 into Factor 1(Technical Factor)6 into Factor 2 ( Price Factor)3 into Factor 3 (Personal Factor)

Page 35: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Factor analysis is used in the following circumstances:

To identify underlying dimensions or factors that explain the correlations among a set of variables.

To identify a new, smaller set of uncorrelated variables to replace the original set of correlated variables.

To identify a smaller set of salient variables from a larger set for use in subsequent multivariate analysis.

Page 36: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Applications of Factor AnalysisIt can be used in market segmentation for

identifying the underlying variables on which to group the customers.

In product research, factor analysis can be employed to determine the brand attributes that influence consumer choice.

In advertising studies, to understand the media consumption habits of the target market.

In pricing studies, to identify the characteristics of price sensitive consumers.

Page 37: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Cluster AnalysisCluster Analysis is a class of techniques

used to classify objects or cases into relatively homogeneous groups called clusters. Objects in each cluster tend to be similar to each other and dissimilar to objects in the other clusters.

Cluster Analysis is used:To classify persons or objects into small

number of clusters or group.To identify specific customer customer

segment for the company’s brand.

Page 38: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Applications of Cluster AnalysisSegmenting the market: For eg. Consumers

may be clustered on the basis of benefits sought from the purchase of a product.

Understanding buyer behaviorsIdentifying new product opportunities – By

clustering brands and products, competitive sets within the markets can be determined. A firm can examine its current offerings compared to those of its competitors to identify potential new product opportunities.

Selecting test marketsReducing data- Data reduction tool to develop

clusters or subgroups of data.

Page 39: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Example of Cluster AnalysisIn a study examining decision making

pattern among international vacationers, 200 respondents were clustered into six clusters. Namely respondents preferring vacations for psychological purpose, educational purpose, social, relaxational , physiological and aesthetic purpose. The behavior of vacationers was studied and appropriate promotional strategies were designed for each of the clusters.

Page 40: Business Research Data Analysis Ppt 111111020701 Phpapp02

http://www.facebook.com/mr.fortyseven

Questions What is conjoint analysis? (3 Marks) What are editing and coding? 3 Marks Distinguish data from information. 3 Marks What are the measures of central tendency? 3 Marks What is cross tabulation? Give an example. 3 Marks Explain the problems during editing of data. 7 Marks What is editing? What is its importance? 7 Marks What is cluster analysis? What are the steps involved

in it? Explain. 7 Marks What is tabulation? What are the types of tabulation?

7 Marks Explain the problems during editing of data. 7 Marks Write short notes on the following:i) Conjoint Analysis, ii) Cluster Analysis, iii) Discriminant

Analysis, iv) Chi-square Test. 10 Marks