Consider the following senario:
Three groups of children are selected: 3 yr olds, 8 yr olds, & 14 yr olds. Each group is read a Dr. Seuss story. Within each group, individuals are randomly assigned to one of three reading conditions. Reading sessions for group 1 only last 5 mins; 15 mins for group 2; and 30 mins for group 3. After two weeks the kids' reading ability is measured (with age appropriate standardized books).
Reading session duration | ||||
5 mins | 15 mins | 30 mins | Age |
3 yrs | 8 yrs | 14 yrs |
So we have not two samples, but 3 X 3 samples (that's 9). How do we analyze the data?
So far our Analysis of Variance (ANOVA) models have had only 1 factor (that's the "one-way" part). Often researchers aren't interested only in a single factor, but how multiple factors (independent variables) work together. The way the factors work together are called interaction effects.
In analysis of variance, a factor is an independent variable. A study that invloves only one independent variable is called a single-factor design. A study with more than one independent variable is called a factorial design.
The individual treatment conditions that make up a factor are called levels of the factor.
For the most part we will focus on a 2-Factor between groups ANOVA, although there are many other designs that use the same basic underlying concepts.
* One should always consider the interaction effects before trying to interpret the main effects. Sometimes the interaction effects are the "cause" of the apparent main effects.
Partioning the variance of 2 factor between groups ANOVA
A = main effect of A
B = main effect of B
AB = interaction of A and B
There are lots (eight) of different potential outcomes:
1) No effects at all
2) A only
3) B only
4) AB only
5) A & B
6) A & AB
7) B & AB
8) A & B & AB
Usually best way to look at your results are to look at a matrix that contains the cell means (the means of all of the separate conditions) and the marginal means (the means across the rows and down the columns).
Factor A | |||
B | B1 | ||
B2 | |||
A1 | A2 |
Okay, now let's look at some of these possible patterns of results.
Let's look at a more complex design to see a
more
comlicated interaction. In this example there were two factors: anxiety
(high med low) and test dificulty (hard medium easy). The dependent
variable is test performance.
Let’s add another
variable, test difficulty.
Each of these different designs has
advantages and
disadvantages.
Design
Advantages
Disadvantages
Two-level,
single factor
It is efficient
for determining if a variable has any effect
One cannot infer
shape of functions
Results are easy
to interpret and analyze
Interpolation
and extrapolation are dangerous
It is adequate
for some theory testing
Complex theories
are difficult to test
It is useful for applied comparisons
Multilevel
experiment, single factor
One can infer shape of functions
It requires more
participants or time
Range of
independent variable is less critical
Counterbalancing
is more ponderous
Statistics are
more difficult
Factorial
experiment
One can
investigate interactions
Experiments
become large as more factors are added
Adding factors
decreases variability, thus increasing statistical sensitivity
Statistics are
more difficult to assess
It increases
generalizability without decreasing precision
Higher-order
interactions are sometimes difficult to interpret
Converging-series
experiments
They offer more
flexibility than large factorial experiments
Interactions are
difficult to assess
They have
built-in replications
Between-experiment
comparisons are also between-subjects, with associated difficulties
One must analyze
prior experiment before doing the next
Partitioning of variance
Consider the data from a 2 X 3 between groups experimental design. There
are 6 separate conditions, each condition has 5 subjects in it (so N =
30).
As was the case with the 1-way between groups ANOVA, we need to compute
the
sums, means, and Sums of Squares for each condition (this information is
in blue) and for the overall data (grand in green). However, we've also
got to compute the sums, means, and SS for each of the factors (A
collapsed across B, in red & B collapsed across A, in purple).
Recall that our first stage of partitioning the variance is basically the
same as we did for 1-way between groups ANOVA.
Now for the new Sums of Squares:
SSbetween A = SSA =
SSinteractionAxB = SSAxB = SSbetween
treatments - SSA - SSB = 220 - 120 - 20 = 80
dftotal = N - 1 = 30 - 1 = 29
dfbetween treatments = total number of conditions - 1 = 6 -
1 = 5
dfwithin = N - total number of conditions = 30 - 6 = 24
dfA = KA - 1 = 2 - 1 = 1
dfB = KB - 1 = 3 - 1 = 2
dfAxB = dfbetween treatments - dfA -
dfB = 5 - 1 - 2 = 2
For all of our F's (we'll have three for this example) we will use the
MSwithin as our denominator.
MSwithin = SSwithin/dfwithin = 120/24 =
5.0
MSA = SSA/dfA = 120/1 = 120.0
MSB = SSB/dfB = 20/2 = 10.0
MSAxB = SSAxB/dfAxB = 80/2 = 40.0
FA = MSA/MSwithin = 120/5 = 24.0
FB = MSB/MSwithin = 10/5 = 2.0
FAxB = MSAxB/MSwithin = 40/5 = 8.0
Note: For each one you may need to look up a separate Fcritical
from the table because you may have different degrees of freedom for each
F. Assuming an alpha level of 0.05
Fcrit for Main effect of B = F(2,24) = 3.40, 2.0 < 3.40 so we
fail to reject the H0 for the main effect of B (so there are
not differences between B1, B2, and B3).
Fcrit for Interaction of AxB = F(2,24) = 3.40, 8.0 > 3.40 so we
reject the H0 for the interaction of A and B (so not all of the
6 conditions are equal).
Within each of these columns we specify what level each data point is
in.
Note that all of our raw scores from the dependent variable go into the
same column.
Below is what your output should look like. The most relevant results are
circled in red (I did this, SPSS doesn't circle any results for you)
Below is a set of data for you to practice with. The numbers are the raw
numbers (so that means there are 5 subjects in each condition, 20 subjects
total). I suggest that you try
it by hand first, then use SPSS to check your answers.
Click here to see the resulting ANOVA
table
B1 B2 B3
A1
5
3
3
8
69
9
13
6
83
8
3
3
3
A2
0
2
0
0
30
0
0
5
00
3
7
5
5
B1 B2 B3
A1
5
3
3
8
6
T = 25
SS = 18
mean = 5.0
9
9
13
6
8
T =
45
SS =
26
mean = 9.03
8
3
3
3
T =
20
SS =
20
mean = 4.0Total A1 = 90
mean A1 = 6.0
nA1 =
15
A2
0
2
0
0
3
T = 5
SS
=
8
mean = 1.00
0
0
5
0
T = 5
SS
=
20
mean = 1.00
3
7
5
5
T =
20
SS =
28
mean = 4.0Total A2 = 30
mean A2 = 2.0
nA2 =
15Total B1 = 30
mean B1 =
3.0
nB1 = 10Total B2 = 50
mean B2 = 5.0
nB2
= 10Total B3 = 40
mean B3 = 4.0
nB3
=
10
G = 120
Grand mean = 120/30 = 4.0
SStotal = S(X - grand
mean)2 = 340 (note: X refers to each of the thirty raw
scores)
Now we need the degrees of freedom
SSbetween treatments = Sn(condition
mean - grand mean)2 = 5(5-4)2 +
5(9-4)2 +
... + 5(4-4)2 = 220
SSwithin treatments = SSSeach
condition = 18 + 26 + 20 + 8 + 20 + 28 = 120
= 15(6-4)2 + 15(2-4)2 = 60 +
60 = 120
(note: you could use
SSbetween B = SSB = instead)
=
10(3-4)2 + 10(5-4)2 + 10(4-4)2 = 10 +
10 + 0 = 20
Now we need to compute our Mean Squares
Now we're finally ready for the three F-ratios
Fcrit for Main effect of A = F(1,24) = 4.26, 24.0 > 4.26 so
we reject the H0 for the main effect of A (so there is a
difference between A1 and A2).
Since this is a completely between groups Factorial design, we set it up
like we did the independent samples t-test. The difference is that instead
of having only one column for our Independent variable, here we have two
columns, one for factor A and one for Factor B.
The analysis for a completely between groups factorial ANOVA
is found in the General Linear Model, Univariate submenu.
Now you need to specify what independent variables you want
to enter into the factorial ANOVA analysis, and what dependent variable
should be used.
Factor A A1 A2 Factor
B B1
20
11
18
16
4
2
5
8B2
8
1
1
5
15
20
17
16
If you have any questions, please feel free to contact me at
jccutti@mail.ilstu.edu.