Solution to Quiz 2

The full quiz is here.  The answers appear below in boldface.  Comments, which are not part of the answers, are italicized.

For the batch {0, 0, 6, 6} compute the following or state why they are not defined:

  1. Mean: (0 + 0 + 6 + 6) / 4 = 3
  2. Five-letter summary: median depth is (4+1)/2 = 2h, hinge depths are (2h+1)/2 = 1h.  Therefore (min, lower hinge, median, upper hinge, max) = (0, 0, 3, 6, 6), the h-spread is 6, and the range is 6.
  3. Standard deviation: The residuals with respect to the mean are (-3, -3, 3, 3), their squares are all 9, their sum is 9*4 = 36.  The (usual) sd is therefore sqrt(36/3) = sqrt(12).  The mm estimate of sd is sqrt(36/4) = 3.  Either answer is acceptable, but if you computed the mm estimator, you needed to say so.
  4. Variance: this is the square of the sd, so the answer is either 12 or 9 (mm estimator).  It was a real waste of time to redo the computation in #3.  If you were using a calculator and got any answer even slightly different from 12, then your answer was wrong.  You should understand the formula well enough to recognize that the variance must be a multiple of 1/3 or 1/4 (the denominator in the formula).
  5. Skewness (use the method of moments estimator):  The residuals are (-3, -3, 3, 3) as before.  The cubes of the -3's balance the cubes of the 3's, so the skewness must be zero.  There was no need to compute the mm estimator of the sd in this case.
  6. MAD.  The median is the mean, so the residuals with respect to the median are the same as those relative to the mean; their absolute values are all 3, so their median--the MAD--is 3 as well.  Remember that MAD measures a spread: its computation definitely should not involve negative numbers.  Remember the absolute value (the "A" part of the acronym)!
  7. Geometric mean.  Not defined: two of the numbers (the zeros) are nonpositive.  It is possible to compute the fourth root of 0*0*6*6 = 0, but this does not mean the GM is defined.
  8. 25% trimmed mean.  This amount of trimming removes one zero and one six; the trimmed mean therefore is (0+6)/2 = 3.

Show your work (the answer alone is not sufficient).

Consider the batches A = {1, 2, 3}, B = {1, 2, 4}, C = {1, 3, 4}, D = {2, 3, 4}, E = {1, 10, 100}, and F = {-1, -10, -100}.

  1. Order the batches from least to greatest variance.

Evidently, A and D are related by a change of location, so Var(A) = Var(D).  B and C are related by a change of location and scale--multiply all values of B by -1, then add 5 to get the values in C--so Var(C) = (-1)2 * Var(B) = Var(B).  Similarly, Var(E) = Var(F).  It's easy to check that Var(A) < Var(B) < Var(E), so the correct order is

A=D, B=C, E=F.

  1. Order the batches from least to greatest skewness.

Using the same reasoning as in #9 we see Skew(A) = Skew(D), but Skew(C) = (-1)3 * Skew(B) = -Skew(B) and similarly Skew(F) = -Skew(E).  Clearly the batches with negative skewness are C and F.  It follows quickly that the correct order is

F, C, A=D, B, E.

In your ordering, if two or more batches have the same variance or skewness then please indicate that.

Scoring: Start with 100.  Subtract 1 for minor errors and trivial mistakes in computation.  Subtract 1 for minor calculation or transcription mistakes, 5 for major blunders, 10 for completely wrong answers.  The passing score is 96.

Return to the Environmental Statistics home page

This page is copyright (c) 2001 Quantitative Decisions.  Please cite it as