Psychology 340 Syllabus
Statistics for the Social Sciences

Illinois State University
J. Cooper Cutting
Fall 2002



Some Basics about Probability, Sampling Distributions, and the Normal Distribution

Pop quiz 1

  • Our distributions
  • the Central Limit theorem
  • the Normal Distribution
  • Using the Normal distribution

    So far we've pretty much focused on descriptive statistics, which are ways of describing distributions (e.g., means and standard deviations).

    The goal of inferential statistics is to make claims about population parameters based on sample statistics.

    Typically we can't measure the entire population of individuals that we're interested in. So instead we select a sub-set of individuals, which we call a sample. Then we measure our sample, and use those measurements to make estimates of population values (which correspond to those measurements, e.g., means and standard deviations). Our best estimate for the mean of the population will be the mean of our sample.

    It sounds simple and straight forward, but consider the following:

    Suppose that you take 3 different samples from the same population. They are going to be different from one another. They will have different shapes, different means, and different variability. So how do you figure out what the best estimate of the population mean is?

    How many possible samples can we take? Infinite (if we are sampling with replacement)

    Luckily for us, the huge set of possible samples forms a simple, orderly, and predictable pattern (a sampling distribution). Because of this, we are able to base our predictions about sample characteristics on the distribution of sample means.

    The distribution of sample means

    A simple example

    Properties of the distribution of sample means


    Normal distribution is a commonly found distribution that is symmetrical and unimodal. It is defined by the following equation:

                                    Y = 
    
    

    A few things to note about Normal Distributions.


    Let's return briefly to our simplified example.


    Using the Unit Normal Table to answer questions of probability

    Here is the "best" way to find a probability from the table:

    Here is the "best" way to find a Z-score from a probability:

    Sometimes we need to find the probability that X will fall between two scores rather than simply above a score or below a score.

    You might want to know what percentage lies outside two points (essentially the opposite of the last situation).

    Another thing that you can use the unit normal table for is to find percentile ranks and interquartile ranges

    Note there is a short-cut for figuring out the IQR. Since the range is always + .67s, then you can compute the IQR as being (2)(.67)(m)

    Now let's bring Samples back into the picture.

    So the take home message is: the smaller your sample size, the larger your sampling error (standard error, ).

    If you have any questions, please feel free to contact me at jccutti@mail.ilstu.edu.