1answer.
Ask question
Login Signup
Ask question
All categories
  • English
  • Mathematics
  • Social Studies
  • Business
  • History
  • Health
  • Geography
  • Biology
  • Physics
  • Chemistry
  • Computers and Technology
  • Arts
  • World Languages
  • Spanish
  • French
  • German
  • Advanced Placement (AP)
  • SAT
  • Medicine
  • Law
  • Engineering
sdas [7]
3 years ago
10

Various problems with data collection can cause some observations to be missing. Suppose a data set has 20 cases. Here are the v

alues of the variable x for 10 of these cases: 17, 6, 12, 14, 20, 23, 9, 12, 16, 21 The values for the other ten cases are missing. One way to deal with missing data is called imputation. The basic idea is that missing values are replaced, or imputed, with values that are based on an analysis of the data that are not missing. For a data set with a single variable, the usual choice of a value for imputation is the mean of the values that are not missing. (a) Compute the mean and standard deviation for the 10 cases for which x is not missing. (b) Create a new data set with 20 cases by using imputation where you set the values for the 10 missing cases equal to the mean that you computed in the previous part. Compute the mean and standard deviation for this new data set with 20 cases. (c) Summarize what you have learned about the possible effects of this type of imputation on the mean and the standard deviation.
Mathematics
1 answer:
mr Goodwill [35]3 years ago
4 0

Answer:

a) Mean= 15, standard deviation= 5.1575

b) Mean= 15, standard deviation= 3.6469

c) See below

Step-by-step explanation:

(a) Compute the mean and standard deviation for the 10 cases for which x is not missing.

The mean using the ten known values is

\bar x=\frac{17+6+12+14+20+23+9+12+16+21}{10}=15

The standard deviation is

\small s=\sqrt{\frac{(17-15)^2+(6-15)^2+(12-15)^2+(14-15)^2+(20-15)^2+(23-15)^2+(9-15)^2+(12-15)^2+(16-15)^2+(21-15)^2}{10}}=5.1575

(b) Create a new data set with 20 cases by using imputation where you set the values for the 10 missing cases equal to the mean that you computed in the previous part. Compute the mean and standard deviation for this new data set with 20 cases.

The new data set would be

17, 6, 12, 14, 20, 23, 9, 12, 16, 21, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15

The new mean for this set of values would be then

\bar x=\frac{\displaystyle\sum_{i=1}^{20}x_i}{20}=15

The new standard deviation is now

s=\sqrt{\frac{\displaystyle\sum_{i=1}^{20}(x_i-\bar x)^2}{20}}=3.6469

(c) Summarize what you have learned about the possible effects of this type of imputation on the mean and the standard deviation.

Obviously, this way of imputation is somewhat arbitrary and will produce a set of data undoubtedly skewed.  

It could possibly be a sensible way of imputing data if the amount of missing data is very little compared to the whole set, for example one or two data in a set of 100, but in this case we have 50% of missing data and it makes no sense this procedure.

You might be interested in
What is equivalent to -1/4 - 5/3
Stolb23 [73]

-3/12 - 20/12 is equivalent to -1/4 - 5/3

7 0
3 years ago
How many cups of peaches will Fred need to make 60 cups of fruit salad
xeze [42]

How many cups of peaches does he need to make only 1 cup of fruit salad?

8 0
4 years ago
Mai thinks of a secret number. She says that her secret number is more than 11 units away from 50. Write an absolute value inequ
alexira [117]

Step-by-step explanation:

50 + 11 = 61

the number is equal to or more than 61

61 </= x

6 0
3 years ago
Distance between the points
Harman [31]
To understand the distance formula, you first need to understand the Pythagorean Theorem. For a refresher, the theorem states that the square of the legs of a right triangle is equal to the the square of its hypotenuse (the side opposite the right angle), or in symbols:

a^2+b^2=c^2, where a and b are the lengths of the legs, and c is the length of the hypotenuse. In the context of the x-y plane, the legs of the triangle correspond to separate x and y values on the plane, and the hypotenuse corresponds to a straight line between two points on that plane.

To find the distance between the points you've listed, (2√5,4) and (1,2√3), we'll first need to find the "legs" of the triangle. To find the length of the x leg, we'll just need the distance between the x values of the points, which we find to be 2√5-1. We do the same for the y component, which ends up being 4-2√3. Now that we have our legs, we're ready to find the hypotenuse - or the distance.

Going back to Pythagorus's equation, we have:

(2 \sqrt{5}-1)^{2}+(4-2 \sqrt{3})^{2}=d^2

where d, the hypotenuse of the triangle, means "distance."

To solve for d, we take the square root of both sides:

d= \sqrt{(2 \sqrt{5}-1)^2+(4-2 \sqrt{3} )^2}

And from there, all that's left to do is solve the right side of the equation, which just ends up being rote calculation.

Edit: I'll go through the steps of that calculation here. We'll start by expanding each of the squared terms inside the radical:

(2 \sqrt{5}-1)^2=(2 \sqrt{5}-1)(2 \sqrt{5}-1)=(2 \sqrt{5}-1)2 \sqrt{5}-(2 \sqrt{5}-1)
=(2 \sqrt{5})^2-2 \sqrt{5}-2 \sqrt{5}+1=20-4\sqrt{5}+1

(4-2\sqrt{3})^2=(4-2\sqrt{3})(4-2\sqrt{3})=(4-2\sqrt{3})4-(4-2\sqrt{3})2\sqrt{3}
=16-8\sqrt{3}-8\sqrt{3}+(2\sqrt{3})^2=16-16\sqrt{3}+12

Putting those values back under the radical:

\sqrt{20-4\sqrt{5}+1+16-16\sqrt{3}+12}

Collecting constants:

\sqrt{49-4\sqrt{5}-16\sqrt{3}}

If you wanted an exact answer, this messy-looking thing would be it, and you can verify those results on WolframAlpha if you'd like. If you want an approximation, just enter that expression in to the online calculator of your choice, and it should give out the value of approx. <span>3.51325.</span>

In general, if you want to solve for the distance between two points (y_{1},x_{1}) and (y_{2},x_{2}), the formula is:

d= \sqrt{(x_{2}-x_{1})^2+(y_{2}-y_{1})^2}
4 0
3 years ago
If the dimensions of a parallelogram are increased by a factor of two, how will the perimeter of the object be affected?
wolverine [178]
Perimeter would be affected by the same scale factor, in this case 2.  perimeter only has 1 dimension. 

Answer:  increas by a factor of 2
3 0
3 years ago
Read 2 more answers
Other questions:
  • Brian correctly used a method of completing the square to solve the equation x2+7x-11=0. Brian's first step was to rewrite the e
    11·1 answer
  • When n is a factor of a number, all factors of n are also factors of the number
    5·1 answer
  • The ages of people on the camp bus are listed.
    14·1 answer
  • Cual es la expresion decimal de - 18/3
    13·1 answer
  • 2.36 to the nearest whole number
    8·2 answers
  • Please help with this.
    15·2 answers
  • Midpoint of (4,3) and (9,10)
    5·1 answer
  • Write the equation of a function whose parent function function, f(x) =x+5, is shifter 3 units to the right a. g(x) = x+3 b. g(x
    15·2 answers
  • Leila says that 75% of a number will always be greater than 50% of any other number. Complete one inequality to support Leila's
    15·1 answer
  • After she graduated from college, Brianna began teaching at a salary of $28,750 a year. Which of the following choices shows Bri
    15·1 answer
Add answer
Login
Not registered? Fast signup
Signup
Login Signup
Ask question!