|
So far we've talked about one way to use inferential statistics: hypothesis testing. In this lab, we're going to look at another way to use inferential statistics: to estimate the population mean from the sample.
This lab will focus on the basic logic of estimation using the tests we've talked about this semester (i.e., z-test and t-tests).
When do we use estimates?
(2) If you already know that there is an effect but you want to know how big it is. (3) After we do hypothesis testing and have rejected the H0 .
We'll focus on two kinds of estimates of the population mean.
(2) interval estimates (confidence intervals) of the mean: using a range of values as your estimate of an unknown quantity. When an interval is accompanied with a specific level of confidence (or probability) , it is called a confidence interval. Both kinds of estimates are determined by the same equation, the difference is that for the point estimates, we'll just compute a single number (that's why it is called a point estimate), but for the interval estimate, we'll compute an interval between two points. Let's start at the conceptual level. Consider the following population distribution.
Suppose that we guess that the mean is somewhere between 71 & 99? How confident are we in this guess?
Remember what the confidence interval is: it is an interval of estimates of the population mean based on the data from a sample. So a 90% confidence interval means that for 90% of our interval estimates will contain the actual population mean (of course that also means the 10% of our confidence intervals won't contain the actual population mean).
Consider a point estimate of the mean. What will be the best single estimate of the population mean?
population sample means However, suppose that all we have is a single sample. Now what is our best guess?
(2) Recall, that most of our sample means will be pretty close to the population mean, so we have a good chance that our sample mean is close. How can we get an estimate where we'd have a better chance of being right? Instead of giving a point estimate, we can estimate an interval.
What do the z-units correspond to? The standard error (the standard deviation of the distribution of sample means). The standard error is essentially the average amount that a sample mean will deviate from the population mean. In other words, most of the means will be close to M , but some are further away. The variability of these sample means represents the standard distance between mu and , or the "standard" error distance. It defines the relationship between sample size and the accuracy with which represents mu.
So far we have discussed the general concept of estimation. Now we will formalize things a bit (i.e., do the math). Let's first talk about the logic of estimation, and then move onto the actual formulas that we'll use.
Okay, so what's the formula? The formula to use will depend on the inferential test that is appropriate for the situation. It turns out that the math that we did when we discussed hypothesis tests is essentially the same math that we'll use for estimation. As was the case with hypothesis testing, the research design determines the formula that we use. Remember,
Estimation of the population mean using one sample and the population standard deviation(the 1-sample z design)
step 2: and we see that µ = is our most reasonable estimate. Okay, so that's the formula for point estimation. What about for an interval estimation?
µ = + (z)() = 85 +
(1.65)(5/sqroot 25) = 86.65 The above example, used z-scores. The same logic will apply to the other statistics we've talked about (e.g., t-statistics). Here is a list of all the formulas for estimation by test:
(b) Suppose that the same result, mean = 275, had come from a sample of n = 250 students. What is the 95% confidence interval for the mean score m in the population of students. (c) Suppose that the same result, mean = 275, had come from a sample of n = 4000 students. What is the 95% confidence interval for the mean score m in the population of students.
Now try estimation using other research designs we've talked about (you'll need to figure out which test to use before you start the problem). (2) Suppose that you give a sample of 244 students a test of verbal skills twice, once under normal conditions and once with a noise distraction. You find that the mean of the difference scores is = 315 and SS of the difference scores is 25.
(b) Suppose that the same result, mean = 315, had come from a sample of n = 244 students. What is the 90% confidence interval for the mean score m in the population of students. (c) Suppose that the same result, mean = 325, had come from a sample of n = 244 students. What is the 80% confidence interval for the mean score m in the population of students. (4) A professor notices that students who get a A in physics have high grade point averages in their engineering courses. The professor selects a sample of n=16 engineering majors who have earned As in physics. The mean GPA in engineering courses for this sample is 3.30 with SS = 10. Use this sample to find the 99% confidence interval for the population mean. (5) How do confidence intervals behave?
(b) As the variability gets smaller, what happens to the confidence interval? (c) As you increase sample size n, what happens to the confidence interval?
A few cautions about estimation
|
|||||||||||||||||||||||||||
Estimating Confidence Intervals with SPSSEstimating the population mean of a sample (One-sample t-test)When you wish to estimate the population mean from a sample of scores in SPSS, the procedure is very simple. Click Analyze→Compare Means→One Sample t-test Select the variable in question in the list in at the left and click the arrow button to move the variable to the list at the right. Leave the Test Value box as 0. If a confidence level other than 95% is desired, click the Options button. Click OK when ready. Find the lower and upper bounds of the 95% confidence interval in the output. Estimating the population mean of the difference between two paired sample means (Paired-samples t-test)Proceed as if conducting a paired-samples t-test and then read the upper and lower bounds of the confidence interval from the output. Estimating the population mean of the difference between two independent sample means (Independent-samples t-test)Proceed as if conducting an independent-samples t-test and then read the upper and lower bounds of the confidence interval from the output. |