You are on page 1of 28

Data analysis

establishment of appropirate category


Suppose the researcher is analysing the inconvenience the car owner is facing with his present model Inconvenience to enter the backseat Inconvenience due to insufficient legroom Inconvenience with respect to interior Inconvenience in door locking and dicky opening Each is assingned a different number for the purpose of codification

Mutually exclusive
The answers given by the respondents should be placed under only one category Eg., occupational of an individual may be answered as 1) professional 2) sales 3) executive 4) manager etc.

Example
questions Do you own a vehicle What is your occupation answer Yes NO Salaried Business Retired Technical consultant Codes 1 2 S B R T C

Tabulation
Tabulation refers to counting the number of cases that fall into various categories it involves Sorting and counting Summarizing of data

Types of tabulation
Simple tabulation Single variable is counted Cross tabulation It includes two or more variables which are treated simultaneously

Sorting and counting of data


Income 1000 1500 2000 2500 tally mark IIII IIII III IIII IIII II IIII IIII IIII I frequencies 5 8 12 16

Table method
The tabulation may include table number,title, head note, stub, caption, sub entries, body of the table, footnote, and source

Table method
Table TITLE-No of children per family Head Note- Unit of measurement total Sub heading caption
body Foot note

Kinds of tabulation
Simple or one way tabulation The multiple choice questions which allow only one answer may use one way tabulation there may be two types of univariate tabulation Questions with only response Multiple response to question

Questions with only one response


No of children 0 Family 10 Percentage 5

1
2 3 4 More than 4

30
70 60 20 10 200

15
35 30 10 6 100

Questions with multiple response


Sometimes respondents may give more than one answer to a given question in this case there will be an overlap and response when tabulated need not add to100 percent Example What do you dislike about the car which you own at present

Choice of an automobile
parameter No of respondents
10
15 15

engine
Body Mileage

Interior
Colour Maintenance frequency

06
18 16 20

inconvenience

There is duplication because respondents may be dissatisfied with mileage given by vehicle and also may dislike interior of the car and also may dislike interior of the car

Suppose we are tabulating the cause of inconvenience felt by the car owner, it can be classsified as follows Cramped problem Rear seat problem Difficulty to raising the window Difficulty in locking the door Now tabulation of each of factor would help to identify the real problem for dislike

Cross tabulation or two way tabulation


This is known as bivariate tabulation the data may include 2 or more variable Example Popularity of health drink among families having different income suppose 500 families are met and data collected as follows

Use of health drink


Income no of children per family per month
0 <1000 1001-2000 2001-3000 3001-4000 4001-5000 >5000 5 10 20 12 6 6 59 1 0 5 10 3 2 1 21 2 8 8 12 6 6 4 44 3 9 10 14 7 5 5 50 4 11 13 20 13 10 7 74 5 15 18 22 20 15 10 100 More than 5 25 27 32 30 20 18 152 73 91 130 91 64 51 500

No of famili es

Above table shows that consumption of health drink not only depends on income but also on the number of children per family

Summarising the data


Before making summarizing the data should be classified into 1)relevant data 2) irrelevant data During the field study the researcher has collected lots of data which he may think would be use

Summarising the data includesclassification of data Frequency distribution Use of appropirate statistical tool

Classification of data
Number of groups Number of groups should be sufficient to record all possible data classification should not be too narrow

example
If a researcher is conducting a survey on why the current car owner dislikes the car the car owner may indicate the following Difficulty in seeking entry to the back seat Interior space Cramped leg room Mileage Rattling of the engine Dicky space

All the above data can be classified into 2 or 3 categories Discomfort Expense Pride Safety Design of the car

WIDTH OF THE CLASS INTERVAL Class interval should be uniform and should be of equal width This will give consistency in the data distribution EXCLUSIVE CATEGORIES Classification made should be done in such a way that the response can be placed in only one category

example
Problem of leg room is the answer by the respondent this should be placed under discomfort or design but not both EXHAUSTIVE CATEGOIES This should be made to include all responses including dont know answers sometimes this will influence the ultimate answer to the research problem AVOID EXTREMES Avoid open ended class interval

Usage of statistical tools


FREQUENCY DISTRIBUTION Frequency distribution simply reports the number of responses that each question received frequency distribution organizes data into class or groups it shows the number of data that falls into particular class

example
income 4000-6999 7000-9999 10000-12999 No of people 100 122 140

Central value or tendency plays a very important role in research The researcher may be interested in knowing the average sales/shop, average consumption per month

You might also like