You are on page 1of 8

EPIDEMIOLOGY

STATISTICS
Group Members:
Thaishri Raja Ram 2061152002
Pavithradevi Kumaran 2061152016
Darshini Selvanathan 2061152004
Selvrubhini Ramish 2061152006
1
UNIVARIABLE DATA ANALYSIS
In univariable analysis, association of one explanatory variable at a
time with the outcome without worrying about other variables or
confounders (unconditional association) is tested.
This is important to shortlist variables for multivariable analysis,
especially if there are a large number of explanatory variables. It also
excludes the variables from further analysis that do not show any
significant association with the outcome on their own as they are not
likely to be associated with the outcome after adjusting for other
variables.
Results of univariable logistic regression analysis include statistics and
P-values, parameter estimates and standard errors, and odds ratios and
their confidence limits.
P-value is the probability of obtaining the value of test statistic at
least as extreme as obtained, given that the null hypothesis is true.
Low P-value - little probability of null hypothesis being true.
However, P-value does not give you any idea of the magnitude and 2
variability of association which is obtained by examining parameter
MULTIVARIATE DATA
ANALYSIS
Multivariate statistical analysis refers to multiple advanced techniques for
examining relationships among multiple variables at the same time.
Researchers use multivariate procedures in studies that involve more
than one dependent variable (also known as the outcome or
phenomenon of interest), more than one independent variable (also known
as a predictor) or both. This type of analysis is desirable because
researchers often hypothesize that a given outcome of interest is effected
or influenced by more than one thing.
There are many statistical techniques for conducting multivariate analysis,
and the most appropriate technique for a given study varies with the type
of study and the key research questions. Four of the most common
multivariate techniques:
i. Multiple regression analysis
ii. Factor analysis
iii. Path analysis
iv. Multiple analysis of variance or MANOVA
3
ODD RATIO
An odds ratio (OR) is a measure of association between an exposure and an
outcome. The OR represents the odds that an outcome will occur given a
particular exposure, compared to the odds of the outcome occurring in the
absence of that exposure. Odds ratios are most commonly used in case-control
studies, however they can also be used in cross-sectional and cohort study
designs as well.
Odds ratios are used to compare the relative odds of the occurrence of the
outcome of interest (e.g. disease or disorder), given exposure to the variable of
interest (e.g. health characteristic, aspect of medical history).
The odds ratio can also be used to determine whether a particular exposure is a
risk factor for a particular outcome, and to compare the magnitude of various
risk factors for that outcome.
OR=1 Exposure does not affect odds of outcome
OR>1 Exposure associated with higher odds of outcome
OR<1 Exposure associated with lower odds of outcome
An odds ratio of 1 indicates no association, whereas values >1 and <1
respectively, indicate positive and negative association of the tested variable
with the outcome.
Example: From the article the exposure was insecticide treated bed nets and the4
outcome will be positive or negative clinical malaria
ADJUSTED OR
Odds ratio or Crude Odds ratio are obtained when you are considering the effect
of only one predictor variable (example equation consists of only one
independent variable. However when include more variables in the analysis
(confounder variables for the said relationship) we get adjusted Odds Ratio,
which takes into account the effect due to all the additional variables included in
the analysis.
Unadjusted OR is a simple ratio of probabilities of outcome in two groups P1,P2
and it can be derived from the result of logistic regression (as opposed to
counting a simple ratio calculated by hand from a 2x22x2 table). However, in
logistic regression you can include other, confounding variables so to control
their influence on dependent variable and if we do so, we get OR that is adjusted
for the influence of confounders. Therefore, we adjust by controlling additional
variables in logistic regression model.

OR =

5
CONFIDENCE INTERVAL
(CI)
The 95% confidence interval (CI) is used to estimate the precision of the
OR. A large CI indicates a low level of precision of the OR, whereas a
small CI indicates a higher precision of the OR. It is important to note
however, that unlike the p value, the 95% CI does not report a measures
statistical significance. In practice, the 95% CI is often used as a proxy for
the presence of statistical significance if it does not overlap the null value
(e.g. OR=1). Nevertheless, it would be inappropriate to interpret an OR
with 95% CI that spans the null value as indicating evidence for lack of
association between the exposure and outcome.

6
P -VALUE
A p-value helps us determine the significance of our results. Hypothesis
tests are used to test the validity of a claim that is made about a
population. This claim thats on trial, in essence, is called the null
hypothesis.
The alternative hypothesisis the one we would believe if the null
hypothesis is concluded to be untrue. The evidence in the trial is our
data and the statistics that go along with it. All hypothesis tests
ultimately use a p-value to weigh the strength of the evidence (what the
data are telling you about the population). The p-value is a number
between 0 and 1 and interpreted in the following way:
A small p-value (typically 0.05) indicates strong evidence against
the null hypothesis, so you reject the null hypothesis.
A large p-value (> 0.05) indicates weak evidence against the null
hypothesis, so you fail to reject the null hypothesis.
p-values very close to the cutoff (0.05) are considered to be marginal
(could go either way). Always report the p-value so your readers can
draw their own conclusions. 7
References
1.
http://www.dummies.com/education/math/statistics/what-a-p-value-
tells-you-about-statistical-data/

2. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2938757/

3.
http://classroom.synonym.com/multivariate-statistical-analysis-2
448.html

4.
http://sydney.edu.au/vetscience/biostat/macros/logistic_tut_mode
l2.shtml

5.
http://stats.stackexchange.com/questions/130993/adjusted-odds-r
8

You might also like