Psychology 240: Statistics 1 Lectures: Chapter 5

Psychology 240 Lectures
Chapter 5
Statistics 1

Illinois State University
J. Cooper Cutting
Fall 1998, Section 04

Your textbook:

Gravetter, F. J., Wallnau, L. B. (1996). Statistics for the Behavioral Sciences:
A First Course for Students of Psychology and Education, 4th Edition. New York: West Publishing.

Chapter 5: Z-Scores: Location of scores and standardized distributions

Descriptive statistics, like the mean and standard deviation, describe distributions by summarizing the center (central tendency) and spread (variability). While this isn't evey detail about a distribution, it does give us a pretty good picture of what the distribution looks like.

for most bell-shaped curves (e.g., symmetric and unimodal), the mean should be at the mid-point and the standard deviation should be somewhere half-way between the mean and the most extreme values.

Our goal is to be able to find our raw scores within the distribution, and to be able to describe where it falls.

A good point of reference is the mean (since it is usually easy to find). So a natural choice for describing the location of a data point would be the deviation score (x - m) or (x - ).

If we are only concerned about a single distribution, then this seems to be pretty easy to do. But, if we want to compare two scores from two distributions, then the situation gets much harder.

Consider the following situation. You take the ACT test and the SAT test. You get a 26 on the ACT and a 620 on the SAT. The college that you apply to only needs one score. Which do you want to send them (that is, which score is better, 26 or 620?).

It is hard to do a direct comparison here because the two distributions have different properties: different means, and different variabilities.

How might we go about it? 1) look at the distribution graphs, locate the scores and compare -- still hard to tell 2) think about cumulative percentiles and percentile ranks -- this will work 3) try and take the deviations and standard deviations into account

The comparison that we just did is what z-scores are all about.

So to be able to make a comparison, one approach would be to transform both distributions into a standardized distribution.

standardized distribution

In other words, we need to convert the two distributions into a form that we can make a comparison. For example, we can transform these data into z-scores. That is what we'll do is convert every score in the distribution into a standardized score, making the overall distribution standardized.

standard score

A z-score specifies the precise location of each X value within a distribution. The sign of the z-score (+ or -) signifies whether the score is above the mean or below the mean. The numerical value of the z-score specifies the distance from the mean by counting the number of standard deviations between X and m.

For z-scores the mean of the distribution is always 0 and the standard deviation is always 1.

So what this means is that a z-score of 1, means that the data point is exactly 1 standard deviation away from the mean. If it is a positive 1, it means that the score is 1 standard deviation above the mean, if it is a negative 1, then it means that the score is 1 standard deviation below the mean.

So how do we do this transformation?

population

sample

Z = deviation = standard deviation
=

Now let's return to our ACT and SAT example. Notice what we did there, we subtracted the the distribution means from the scores, and then we divided by their standard deviations. In otherwords what we did was transform them into Z-scores. And then we made the comparisons based on those Z-scores.

We can transform any & all observations or values from a distribution to a z-score if we know either the m & s, or the & s.

We can also transform a z-score back into a raw score if we know the mean and standard deviation information of the original distribution. Let's look at the algebra.

Z = (X - m) --> (Z)( s) = (X - m) --> X = (Z)( s) + m s

So suppose that you know somebody else who said that they go 2 SD above the mean on the SAT. How would we go about figuring out their score?

2 SD above = Z of 2.0

we know that the mean of SAT = 500, and the SD = 100, so we just plug in the numbers

Properties of the z-score distribution.

Shape - the shape of the z-score distribution will be exactly the same as the original distribution of raw scores. Every score stays in the exact same position relative to every other score in the distribution.

Mean - when raw scores are transformed into z-scores, the mean will always = 0.

	Z =   (X - m)	
	         s

m = 100; s = 10 (100 - 100) / 10 = 0
m = 200; s = 10 (200 - 200) / 10 = 0
m = 100; s = 20 (100 - 100) / 20 = 0

The standard deviation - when any distribution of raw scores is transformed into z-scores the standard deviation will always = 1.

	Z =   (X - m)	
	         s

m = 100; s = 10 (110 - 100) / 10 = 1
m = 200; s = 10 (210 - 200) / 10 = 1
m = 100; s = 20 (120 - 100) / 20 = 1

The transformation procedure really is just a way of relabeling the axis of the distribution. So imagine that you leave the curve alone, but just draw new labels on the X-axis -- centering it on 0 and making each SD interval equal to 1.

EXAMPLE: Heights and weights of the men in stats 240 sec 04 (who responded)

person height weight
1 66 203
2 71 174
3 74 223
4 69 175
5 70 144
6 74 219
7 73 184
8 69 237
9 69 204
10 75 237
sum 710 2000

height² weight²
4356 41209
5041 30276
5476 49729
4761 30625
4900 20736
5476 47961
5329 33856
4761 56169
4761 41616
5625 56169
50,486 408,346

height
m = 710 / 10 = 71.0 SS = 50486 - (710)2 / 10 = 76.0
s = 2.8 weight
m = 2000 / 10 = 200.0 SS = 408346 - (2000)2 / 10 = 8346.0
s = 28.9

Z = (X - m) s Z1 = (66 - 71)/2.8 = -1.8
Z2 = (71 - 71)/2.8 = 0
Z3 = (74 - 71)/2.8 = 1.1
Z4 = (69 - 71)/2.8 = -0.7
Z5 = (70 - 71)/2.8 = -0.4
Z6 = (74 - 71)/2.8 = 1.1
Z7 = (73 - 71)/2.8 = 0.7
Z8 = (69 - 71)/2.8 = -0.7
Z9 = (69 - 71)/2.8 = -0.7
Z10 = (75 - 71)/2.8 = 1.4
Z = (X - m) s Z1 = (203 - 200)/28.9 = 0.1
Z2 = (174 - 200)/28.9 = -0.9
Z3 = (223 - 200)/28.9 = 0.8
Z4 = (175 - 200)/28.9 = -0.9
Z5 = (144 - 200)/28.9 = -1.9
Z6 = (219 - 200)/28.9 = 0.7
Z7 = (184 - 200)/28.9 = -0.6
Z8 = (237 - 200)/28.9 = 1.3
Z9 = (204 - 200)/28.9 = 0.1
Z10 = (237 - 200)/28.9 = 1.3

notice that the sums of the z-scores = 0 so the mean of the z-scores = 0 the standard deviations = 1 (a little off here due to round off)

So now we can compare for each person where they are in the two distributions and how their weights and heights compare to one another ("too tall for my weight" or "just right", etc.)

Person #4:

on the other hand:

Person # 8:

US male height mean is around 5'9" (69 inches), stdev ??? US male weight mean is around ???, stdev ???

So if we wanted to know how our mean corresponds with the US or even the world-wide population of males, how would we go about it?

Well, the numbers that we have are descriptive statistics, to go from samples to populations we'll need to start thinking about inferential statistics.

We'll start getting there next time, in chapter 6, when we begin our discussion of probabilities. Remember that what we'll be doing is using our sample statistics to make estimates of population parameters. These estimates/relationships are described in terms of probabilities.

Go to Chapter 4: Variability
Go to Chapter 6: Probability

Return to Psych 240 syllabus page
Return to Psych 345 syllabus page
Return to Statistics Lectures page

Return to Illinois State University Home Page
Return to Illinois State University Psychology Home Page

height m = 710 / 10 = 71.0 SS = 50486 - (710)2 / 10 = 76.0 s = 2.8	weight m = 2000 / 10 = 200.0 SS = 408346 - (2000)2 / 10 = 8346.0 s = 28.9

Z = (X - m) s Z1 = (66 - 71)/2.8 = -1.8 Z2 = (71 - 71)/2.8 = 0 Z3 = (74 - 71)/2.8 = 1.1 Z4 = (69 - 71)/2.8 = -0.7 Z5 = (70 - 71)/2.8 = -0.4 Z6 = (74 - 71)/2.8 = 1.1 Z7 = (73 - 71)/2.8 = 0.7 Z8 = (69 - 71)/2.8 = -0.7 Z9 = (69 - 71)/2.8 = -0.7 Z10 = (75 - 71)/2.8 = 1.4	Z = (X - m) s Z1 = (203 - 200)/28.9 = 0.1 Z2 = (174 - 200)/28.9 = -0.9 Z3 = (223 - 200)/28.9 = 0.8 Z4 = (175 - 200)/28.9 = -0.9 Z5 = (144 - 200)/28.9 = -1.9 Z6 = (219 - 200)/28.9 = 0.7 Z7 = (184 - 200)/28.9 = -0.6 Z8 = (237 - 200)/28.9 = 1.3 Z9 = (204 - 200)/28.9 = 0.1 Z10 = (237 - 200)/28.9 = 1.3
notice that the sums of the z-scores = 0 so the mean of the z-scores = 0	the standard deviations = 1 (a little off here due to round off)

Psychology 240 LecturesChapter 5 Statistics 1

Illinois State University J. Cooper Cutting Fall 1998, Section 04

If you have any questions, please feel free to contact me at cutting@main.psy.ilstu.edu.

Psychology 240 Lectures
Chapter 5
Statistics 1

Illinois State University
J. Cooper Cutting
Fall 1998, Section 04