Answer:
a) 0.6212 = 62.12% probability that the mean price for a sample of 30 federal income tax returns is within $16 of the population mean.
b) 0.7416 = 74.16% probability that the mean price for a sample of 50 federal income tax returns is within $16 of the population mean.
c) 0.8804 = 88.04% probability that the mean price for a sample of 100 federal income tax returns is within $16 of the population mean.
d) None of them ensure, that one which comes closer is a sample size of 100 in option c), to guarantee, we need to keep increasing the sample size.
Step-by-step explanation:
To solve this question, we need to understand the normal probability distribution and the central limit theorem.
Normal Probability Distribution
Problems of normal distributions can be solved using the z-score formula.
In a set with mean
and standard deviation
, the z-score of a measure X is given by:
![Z = \frac{X - \mu}{\sigma}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7BX%20-%20%5Cmu%7D%7B%5Csigma%7D)
The Z-score measures how many standard deviations the measure is from the mean. After finding the Z-score, we look at the z-score table and find the p-value associated with this z-score. This p-value is the probability that the value of the measure is smaller than X, that is, the percentile of X. Subtracting 1 by the p-value, we get the probability that the value of the measure is greater than X.
Central Limit Theorem
The Central Limit Theorem establishes that, for a normally distributed random variable X, with mean
and standard deviation
, the sampling distribution of the sample means with size n can be approximated to a normal distribution with mean
and standard deviation
.
For a skewed variable, the Central Limit Theorem can also be applied, as long as n is at least 30.
The CPA Practice Advisor reports that the mean preparation fee for 2017 federal income tax returns was $273. Use this price as the population mean and assume the population standard deviation of preparation fees is $100.
This means that ![\mu = 273, \sigma = 100](https://tex.z-dn.net/?f=%5Cmu%20%3D%20273%2C%20%5Csigma%20%3D%20100)
A) What is the probability that the mean price for a sample of 30 federal income tax returns is within $16 of the population mean?
Sample of 30 means that ![n = 30, s = \frac{100}{\sqrt{30}}](https://tex.z-dn.net/?f=n%20%3D%2030%2C%20s%20%3D%20%5Cfrac%7B100%7D%7B%5Csqrt%7B30%7D%7D)
The probability is the p-value of Z when X = 273 + 16 = 289 subtracted by the p-value of Z when X = 273 - 16 = 257. So
X = 289
![Z = \frac{X - \mu}{\sigma}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7BX%20-%20%5Cmu%7D%7B%5Csigma%7D)
By the Central Limit Theorem
![Z = \frac{X - \mu}{s}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7BX%20-%20%5Cmu%7D%7Bs%7D)
![Z = \frac{289 - 273}{\frac{100}{\sqrt{30}}}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7B289%20-%20273%7D%7B%5Cfrac%7B100%7D%7B%5Csqrt%7B30%7D%7D%7D)
![Z = 0.88](https://tex.z-dn.net/?f=Z%20%3D%200.88)
has a p-value of 0.8106
X = 257
![Z = \frac{X - \mu}{s}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7BX%20-%20%5Cmu%7D%7Bs%7D)
![Z = \frac{257 - 273}{\frac{100}{\sqrt{30}}}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7B257%20-%20273%7D%7B%5Cfrac%7B100%7D%7B%5Csqrt%7B30%7D%7D%7D)
![Z = -0.88](https://tex.z-dn.net/?f=Z%20%3D%20-0.88)
has a p-value of 0.1894
0.8106 - 0.1894 = 0.6212
0.6212 = 62.12% probability that the mean price for a sample of 30 federal income tax returns is within $16 of the population mean.
B) What is the probability that the mean price for a sample of 50 federal income tax returns is within $16 of the population mean?
Sample of 30 means that ![n = 50, s = \frac{100}{\sqrt{50}}](https://tex.z-dn.net/?f=n%20%3D%2050%2C%20s%20%3D%20%5Cfrac%7B100%7D%7B%5Csqrt%7B50%7D%7D)
X = 289
![Z = \frac{X - \mu}{\sigma}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7BX%20-%20%5Cmu%7D%7B%5Csigma%7D)
By the Central Limit Theorem
![Z = \frac{X - \mu}{s}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7BX%20-%20%5Cmu%7D%7Bs%7D)
![Z = \frac{289 - 273}{\frac{100}{\sqrt{50}}}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7B289%20-%20273%7D%7B%5Cfrac%7B100%7D%7B%5Csqrt%7B50%7D%7D%7D)
![Z = 1.13](https://tex.z-dn.net/?f=Z%20%3D%201.13)
has a p-value of 0.8708
X = 257
![Z = \frac{X - \mu}{s}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7BX%20-%20%5Cmu%7D%7Bs%7D)
![Z = \frac{257 - 273}{\frac{100}{\sqrt{50}}}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7B257%20-%20273%7D%7B%5Cfrac%7B100%7D%7B%5Csqrt%7B50%7D%7D%7D)
![Z = -1.13](https://tex.z-dn.net/?f=Z%20%3D%20-1.13)
has a p-value of 0.1292
0.8708 - 0.1292 = 0.7416
0.7416 = 74.16% probability that the mean price for a sample of 50 federal income tax returns is within $16 of the population mean.
C) What is the probability that the mean price for a sample of 100 federal income tax returns is within $16 of the population mean?
Sample of 30 means that ![n = 100, s = \frac{100}{\sqrt{100}}](https://tex.z-dn.net/?f=n%20%3D%20100%2C%20s%20%3D%20%5Cfrac%7B100%7D%7B%5Csqrt%7B100%7D%7D)
X = 289
![Z = \frac{X - \mu}{\sigma}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7BX%20-%20%5Cmu%7D%7B%5Csigma%7D)
By the Central Limit Theorem
![Z = \frac{X - \mu}{s}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7BX%20-%20%5Cmu%7D%7Bs%7D)
![Z = \frac{289 - 273}{\frac{100}{\sqrt{100}}}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7B289%20-%20273%7D%7B%5Cfrac%7B100%7D%7B%5Csqrt%7B100%7D%7D%7D)
![Z = 1.6](https://tex.z-dn.net/?f=Z%20%3D%201.6)
has a p-value of 0.9452
X = 257
![Z = \frac{X - \mu}{s}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7BX%20-%20%5Cmu%7D%7Bs%7D)
![Z = \frac{257 - 273}{\frac{100}{\sqrt{100}}}](https://tex.z-dn.net/?f=Z%20%3D%20%5Cfrac%7B257%20-%20273%7D%7B%5Cfrac%7B100%7D%7B%5Csqrt%7B100%7D%7D%7D)
![Z = -1.6](https://tex.z-dn.net/?f=Z%20%3D%20-1.6)
has a p-value of 0.0648
0.9452 - 0.0648 =
0.8804 = 88.04% probability that the mean price for a sample of 100 federal income tax returns is within $16 of the population mean.
D) Which, if any of the sample sizes in part (a), (b), and (c) would you recommend to ensure at least a .95 probability that the same mean is withing $16 of the population mean?
None of them ensure, that one which comes closer is a sample size of 100 in option c), to guarantee, we need to keep increasing the sample size.