Professional Documents
Culture Documents
What is science?
The systemic study of the structure and
behavior of the physical world, involving
experimentation and measurement and
the development of theories to describe
the results of these activities.
Cambridge International Dictionary of English
What is an experiment?
A test done in order to learn something or
to discover whether something works or is
true (plausible).
Experiments are done to prove the
hypothesis is false.
Only falsification is possible with certainty;
proving something can never be done with
certainty
Galileo Galilei
(1564-1642)
Father of modern physics
Father of modern
observational astronomy
Father of modern science
Pioneered the use of
quantitative experiments
whose results could be
analyzed with
mathematical precision
How many of you accept that the sun goes round the earth?
How many of you accept that the earth goes round the sun?
Types of Experiments
Experiments are done to test a hypothesis;
a prediction.
Observational
Experimental
Quasi-experimental
Experimental Design
Whether or not a studys findings are
useful or not depends crucially on design.
No matter how ingenious or important an
idea for an experiment might be, if the
study is badly designed, its worthless.
An excellent procedure for determining
cause and effect.
Does night cause day or does day cause night?
Reliable
Potentially replicable by yourself or anyone else
Important
Subjective
Score or Measurement
Types of Measurement
Nominal
Ordinal
Interval
Ratio
Experimental Design
This is important so that the results can be
interpreted and other causes can be
excluded from the most likely conclusion.
The results have to be valid, reliable and
generalizable (important)
Simplest Experiment
Measurement
Baseline
Treatment
Measurement
Treatment
Control
Measurement
Treatment
Experimental
Measurement
frequency
Population distribution
treatment
control
2cm
height
100 cm
Randomization
Post-test/control group
Independent variable and dependent variable
Treatment
Control
Measurement
Treatment
Experimental
Measurement
Random allocation
Measurement
Measurement
Treatment
Control
Measurement
Treatment
Experimental
Measurement
Between-group Design
Advantages
Simplicity
Less chance of practice or fatigue
Useful when it is impossible for an individual
to participate in all experimental conditions
Disadvantages
Expense in terms of time, effort and number
Repeated-Measure Design
Treatment
Measurement
No Treatment
Measurement
No Treatment
Measurement
Treatment
Measurement
Repeated-Measures Design
Advantages
Economy
Sensitivity
Disadvantages
Carry-over effect
The need for the condition to be reversible
Repeated-Measure Design
Treatment
Measurement
No Treatment
Measurement
No Treatment
Measurement
Treatment
Measurement
Random allocation
Measurement
Measurement
Measurement
Measurement
Treatment
Control
Treatment
Level A
Treatment
Level B
Treatment
Level C
Measurement
Measurement
Measurement
Measurement
Example
Three conditions A, B,
and C.
Avoid systemic order
effect
6 possible
combinations of order
Order of conditions
or trials
Group 1
Group 2
Group 3
Multi-Factorial Design
Two or three independent variables in the
same study
Not recommended to use more than 2 as
the analysis gets quite complicated and
conclusion have to be qualified.
Gender and treatment
Day and performance
Correlations
How changes in one variable alters
another variable?
Can have more than one variable and
determine the contribution or weight or %
that each variable has on the measured
value.
Multi-factorial, multi-dimensions, e.g.
physics
Correlations are not cause and effect
Statistical Analysis
STATISTICS
Descriptive
Inferential
Graphs/tables
Numerical summary
Calculation of probability
Tests of significance
Measurement
Types of measurements
Nominal
Ordinal
Interval
Ratio
Descriptive
Statistics
Summarize the data
Spread of data
Graphs
Tables
Mean
Mode
Median
Line graphs
Histograms
Pie charts
The closer the mean is to all the data points the more representative is
the mean of the data
mean
5
4
mean
3
2
1
0
0
Distribution of data
Means are the
same
Distribution of the
scores is different
Range
Distribution measures
Range
Quartile
Mean deviation
Sum of squared deviation
Mean of the sum of squared deviation
variance
Standard deviation
Standard error of the mean (sem)
1 sd
68%
2 sd
96%
1.96 sd 95%
Population
Parameters
(mu, mean)
(sigma, standard
deviation)
Sample
Parameters
X (mean)
Sd (standard
deviation
estimate
X = 10
X = 11
X=9
X = 12
X = 11
X = 10
X = 10
The question
what is the probability that the difference
between the means is due to chance and
not due to my manipulation?
Test Statistic
A test statistic (t, F, z) is a statistic with a known
distribution.
e.g. Age and death
Sample size
Data distribution
Homogeneity of variance
Independent measurements or repeated
measurements
Comparison between 2 groups
Comparison between 3 or more groups
Type of data
Parametric or Non-parametric
Parametric Tests
1. Arithmetic means
Interval or ratio data
3. Normal distribution
1. Tests
Plot histogram
Kolmogorov-Smirnov test
Shapiro-Wilk test
Parametric tests
Students t-test
Independent t-test
Dependent t-test
Students t-test
Only two groups of data
Sample size < 30
Ratio difference between the means /
estimated s.e. of the difference between
those two sample means.
Independent t-test
unpaired
Two groups
Different participants, i.e. each participant
contributes only one data point
Normal distribution
Levenes test not significant (p = 0.492)
Presenting the data:
m = 24.2, se = 1.49; m = 20.00, se = 1.30.
t(18) = 2.12, p < 0.05, r = 0.45.
One-Way ANOVA
Two-Way ANOVA
Repeated measures ANOVA
Mixed ANOVA
One-Way ANOVA
3 or more groups
Different participants in each group
One-way for 1 independent variable
Groups are independent, i.e. data in one group
does not influence the data in another group
Homogeneity of variance
If Levenes test is significant
Use non-parametric tests or transform the data
df
Mean
square
Sig
Between
groups
450.664
90.133
269.733
0.000
Within
groups
38.094
114
0.334
Total
488.758
Presentation of results
F (5, 114) = 269.73, p < 0.05
Post-hoc tests
9 REGWQ or HSD
Equal sample size and homogeneity of variance
9 Bonferroni
Conservative, effected by group variance
9 Gabriels Procedure
Slight differences in sample size
9 Hochbergs GT2
Big differences in sample size
9 Games-Howell Procedure
If the variances are not equal
Assumption of sphericity
Equality of variance of the differences between
treatment levels
Mauchlys test
Non-Parametric Test
Assumption-free tests
Less strict about the distribution of the data
being analyzed.
Thus raw scores are used
Data is ranked
Lowest score rank 1, next lowest rank 2, etc
Basic tests
Mann-Whitney test
= independent t-test
Kruskal-Wallis test
= one-way ANOVA
Friedmans ANOVA
= one-way repeated measures ANOVA
Mann-Whitney Test
= independent t-test
Two conditions and different participants in each
condition
Kruskal-Wallis Test
= one-way ANOVA
More than 2 conditions and different participants have been used in
each condition.
Claude Bernard
(1813-1878)
Those whose minds are
bound and cramped. They
make poor observations,
because they choose
among the results of their
experiments, observation,
and reading only what suits
their object, neglecting
whatever is unrelated to it
and carefully setting aside
everything which might
tend toward the idea they
wish to combat.
Ignaz Semmelweis
(1818-1865)
Savior of mothers
10-35% mortality
Death of colleague Jacob
Kolletschka (1847)
Introduced lime washing
of hands
12.24% to 2.38%.
The Etiology, Concept
and Prophylaxis of
Childhood Fever (1861).
Semmelweis Reflex
Dismissing or rejecting out of hand
information, automatically, without thought,
inspection, or experiment.
References
Andy Field and Graham Hole. How to Design
and Report Experiments. Sage Publications.
London. 2006.
Martin Bland. An Introduction to Medical
Statistics. 2nd ed. Oxford Medical Publications.
1996.
Norman T. J. Bailey. Statistical Methods in
Biology. 3rd Ed. Cambridge University Press.
1995.
Beth Dawson and Robert G. Trapp. Basic and
Clinical Biostatistics. Lange Medical
Books/McGraw-Hill. 3rd ed. 2001.