6.1.3 Exploration 6A: Data Summaries and Sensitivity

Open the data file C06 ExplorationA.xls [.rda]. This file contains 1000 observations from five random distributions of data. Each data set contains values in the range of 0 to 100. In this exploration, you are going to investigate two aspects of the data. You will look at the error in estimating the mean and standard deviation of data from different distributions (symmetric, positively skewed, negatively skewed, bimodal and uniform) and you will explore how these errors are affected by the way the data is summarized.

  1. Compute the actual values for the mean and standard deviation of each of the distributions (symmetric, positively skewed, negatively skewed, bimodal, and uniform).
  2. Below are two different summaries of the data into frequency tables, one using bins of width 10, and the other with width 25. Reproduce these summary tables using your software.
  3. Use the techniques of this section to estimate the mean and standard deviation from each of the data summaries. Enter the results in the table below.
  4. Once you have recorded the results of your calculations in the table below, think about how our assumptions work when estimating the mean and standard deviation. For which types of data are these estimates most accurate? Why? For which are the estimates least accurate? Why? Keep in mind, these errors may seem small; typically the most error you get with these data is about 0.5 to 1.0, but that’s about a 1% to 2% error in estimating these statistics! How does the bin width affect the accuracy of the estimates?







Bin Symmetric Positive Skew Negative Skew Bimodal Uniform






0 - 10 22 13 7 10 106






10 - 20 56 72 26 125 99






20 - 30 78 152 36 278 110






30 - 40 140 249 77 97 102






40 - 50 199 201 140 12 90






50 - 60 184 134 242 11 90






60 - 70 165 90 271 124 96






70 - 80 90 53 143 226 113






80 - 90 46 24 48 106 97






90 - 100 20 12 10 11 97






Data summary with bins of width 10.







Bin Symmetric Positive Skew Negative Skew Bimodal Uniform






0 - 25 108 158 46 287 264






25 - 50 387 529 240 235 243






50 - 75 398 255 597 247 238






75 - 100 107 58 117 231 255






Data summary with bins of width 25.







Symmetric Positively Negatively Bimodal Uniform
Skewed Skewed






Actual Mean






Mean (bin width = 10)






Mean (bin width = 25)






Actual St Dev






St Dev (bin width = 10)






St Dev (bin width = 25)