1answer.
Ask question
Login Signup
Ask question
All categories
  • English
  • Mathematics
  • Social Studies
  • Business
  • History
  • Health
  • Geography
  • Biology
  • Physics
  • Chemistry
  • Computers and Technology
  • Arts
  • World Languages
  • Spanish
  • French
  • German
  • Advanced Placement (AP)
  • SAT
  • Medicine
  • Law
  • Engineering
sdas [7]
3 years ago
10

Various problems with data collection can cause some observations to be missing. Suppose a data set has 20 cases. Here are the v

alues of the variable x for 10 of these cases: 17, 6, 12, 14, 20, 23, 9, 12, 16, 21 The values for the other ten cases are missing. One way to deal with missing data is called imputation. The basic idea is that missing values are replaced, or imputed, with values that are based on an analysis of the data that are not missing. For a data set with a single variable, the usual choice of a value for imputation is the mean of the values that are not missing. (a) Compute the mean and standard deviation for the 10 cases for which x is not missing. (b) Create a new data set with 20 cases by using imputation where you set the values for the 10 missing cases equal to the mean that you computed in the previous part. Compute the mean and standard deviation for this new data set with 20 cases. (c) Summarize what you have learned about the possible effects of this type of imputation on the mean and the standard deviation.
Mathematics
1 answer:
mr Goodwill [35]3 years ago
4 0

Answer:

a) Mean= 15, standard deviation= 5.1575

b) Mean= 15, standard deviation= 3.6469

c) See below

Step-by-step explanation:

(a) Compute the mean and standard deviation for the 10 cases for which x is not missing.

The mean using the ten known values is

\bar x=\frac{17+6+12+14+20+23+9+12+16+21}{10}=15

The standard deviation is

\small s=\sqrt{\frac{(17-15)^2+(6-15)^2+(12-15)^2+(14-15)^2+(20-15)^2+(23-15)^2+(9-15)^2+(12-15)^2+(16-15)^2+(21-15)^2}{10}}=5.1575

(b) Create a new data set with 20 cases by using imputation where you set the values for the 10 missing cases equal to the mean that you computed in the previous part. Compute the mean and standard deviation for this new data set with 20 cases.

The new data set would be

17, 6, 12, 14, 20, 23, 9, 12, 16, 21, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15

The new mean for this set of values would be then

\bar x=\frac{\displaystyle\sum_{i=1}^{20}x_i}{20}=15

The new standard deviation is now

s=\sqrt{\frac{\displaystyle\sum_{i=1}^{20}(x_i-\bar x)^2}{20}}=3.6469

(c) Summarize what you have learned about the possible effects of this type of imputation on the mean and the standard deviation.

Obviously, this way of imputation is somewhat arbitrary and will produce a set of data undoubtedly skewed.  

It could possibly be a sensible way of imputing data if the amount of missing data is very little compared to the whole set, for example one or two data in a set of 100, but in this case we have 50% of missing data and it makes no sense this procedure.

You might be interested in
PLEASE HELP PLEASE PLEASE
PolarNik [594]
The answer is b y=-2/(x+3)-4
5 0
3 years ago
Read 2 more answers
If 12 men earn $810 in 10 days how much will 14 men earn in 8 days, if the daily wage is the same for each man
kogti [31]
1 man earns  810 / 12 = $67.5 in 10 days
by proportion 1 man will earn 0.8 *  67.5 = $54 in 8 days

So 14 men will earn 54 * 14 =  $756 in 8 days.
5 0
3 years ago
What is the diameter for circumference if the diameter is 28ft
amid [387]
I did my calculation and your answer should be 87.9
hope this helped :)
3 0
3 years ago
Read 2 more answers
A small business ships specialty homemade candies to anywhere in the world. Past records indicate that the weight of orders is n
Fantom [35]

Answer:

90% confidence interval for the true mean weight of orders is between a lower limit of 103.8645 grams and an upper limit of 116.1355 grams.

Step-by-step explanation:

Confidence interval for true mean weight is given as sample mean +/- margin of error (E)

sample mean = 110 g

sample sd = 14 g

n = 16

degree of freedom = n - 1 = 16 - 1 = 15

confidence level = 90% = 0.9

significance level = 1 - C = 1 - 0.9 = 0.1 = 10%

critical value (t) corresponding to 15 degrees of freedom and 10% significance level is 1.753

E = t × sample sd/√n = 1.753×14/√16 = 6.1355 g

Lower limit of sample mean = sample mean - E = 110 - 6.1355 = 103.8645 g

Upper limit of sample mean = sample mean + E = 110 + 6.1355 = 116.1355 g

90% confidence interval is (103.8645, 116.1355)

4 0
3 years ago
Read 2 more answers
How do I solve this question (19a)?
fredd [130]
Answer i think would be
l-4x+4l
---------
x-1
8 0
3 years ago
Other questions:
  • Which equation, in point-slope form, passes through (-2, 4) and has a slope of 3?
    11·1 answer
  • ~Please only answer if you know for sure~
    7·1 answer
  • You are adding two rational numbers with different signs. How can you tell if the sum will be positive negative or zero
    9·1 answer
  • Simplify. 8.2 - (-14.1)​
    8·2 answers
  • Which represents “4 more than one half a number”?
    8·2 answers
  • Use distributive property to rewrite the expression 3x(x+4)​
    13·1 answer
  • Help me out please an thank you
    13·2 answers
  • Please answer will mark brainliest thank you so much
    15·1 answer
  • Determine the slope of a function with the following two solutions:<br> (-1.5, 2) and (7.5, 5)
    7·2 answers
  • Which<br> graph represents an odd function?
    12·1 answer
Add answer
Login
Not registered? Fast signup
Signup
Login Signup
Ask question!