Difference Between Means
I surveyed 50 people from a poor area of town and 70 people from an affluent
area of town about their feelings towards minorities. I counted the
number of negative comments made. I was interested in comparing their
attitudes. The average number of negative
comments in the poor area was 14 and in the affluent area was
standard deviations were 5 and 4 respectively.
Let's determine a 95% confidence for the difference
in mean negative comments. First, we need some formulas.
For our investigation, we use s1 and s2 as point estimates for s1 and s2. We have
x1 = 14 x2 = 12 s1 = 5 s2 = 4 n1 = 50 n2 = 70
x1 - x2 = 14 - 12 = 2
The margin of error is
E = zcs = (1.96)(0.85) = 1.7
The confidence interval is
Note: To calculate the degrees of freedom, we can take the smaller of the two numbers n1 - 1 and n2 - 1. So in the prior example, a better estimate would use 49 degrees of freedom. The t-table gives a value of 2.014 for the t.95 value and the margin of error is
E = tcs = (2.014)(0.85) = 1.7119
which still rounds to 1.7. This is an example that demonstrates that using the t-table and z-table for large samples results in practically the same results.
Small Samples With Pooled Pooled Standard Deviations (Optional)
When the sample size is small, we can still run the statistics provided the distributions are approximately normal. If in addition we know that the two standard deviations are approximately equal, then we can pool the data together to produce a pooled standard deviation. We have the following theorem.
You've gotta love the beautiful formula!
After finding the pooled estimate we have that a confidence interval is given by
What is the difference between commuting patterns for students and professors. 11 students and 14 professors took part in a study to find mean commuting distances. The mean number of miles traveled by students was 5.6 and the standard deviation was 2.8. The mean number of miles traveled by professors was 14.3 and the standard deviation was 9.1. Construct a 95% confidence interval for the difference between the means. What assumption have we made?
x1 = 5.6 x2 = 14.3 s1 = 2.8 s2 = 9.1 n1 = 11 n2 = 14
The pooled standard deviation is
The point estimate for the mean is
14.3 - 5.6 = 8.7
Use the t-table to find tc for a 95% confidence interval with 23 degrees of freedom and find
tc = 2.07
8.7 (2.07)(7.09)(.403) = 8.7 5.9
The range of values is [2.8, 14.6]
The difference in average miles driven by students and professors is between 2.8 and 14.6. We have assumed that the standard deviations are approximately equal and the two distributions are approximately normal.
Difference Between Proportions
So far, we have discussed the difference between two means (both large and small samples). Our next task is to estimate the difference between two proportions. We have the following theorem
And a confidence interval for the difference of proportions is
Note: in order for this to be valid, we need all four of the quantities
p1n1 p2n2 q1n1 q2n2
to be greater than 5.
300 men and 400 women we asked how they felt about taxing Internet sales. 75 of the men and 90 of the women agreed with having a tax. Find a confidence interval for the difference in proportions.
p1 = 75/300 = .25 q1 = .75 n1 = 300
p2 = 90/400 = .225 q2 = .775 n2 = 400
We can calculate
We can conclude that the difference in opinions is between -8.5% and 3.5%.