Open the data file C06 ExplorationA.xls [.rda]. This file contains 1000 observations from five random distributions of data. Each data set contains values in the range of 0 to 100. In this exploration, you are going to investigate two aspects of the data. You will look at the error in estimating the mean and standard deviation of data from different distributions (symmetric, positively skewed, negatively skewed, bimodal and uniform) and you will explore how these errors are affected by the way the data is summarized.
Bin | Symmetric | Positive Skew | Negative Skew | Bimodal | Uniform |
0 - 10 | 22 | 13 | 7 | 10 | 106 |
10 - 20 | 56 | 72 | 26 | 125 | 99 |
20 - 30 | 78 | 152 | 36 | 278 | 110 |
30 - 40 | 140 | 249 | 77 | 97 | 102 |
40 - 50 | 199 | 201 | 140 | 12 | 90 |
50 - 60 | 184 | 134 | 242 | 11 | 90 |
60 - 70 | 165 | 90 | 271 | 124 | 96 |
70 - 80 | 90 | 53 | 143 | 226 | 113 |
80 - 90 | 46 | 24 | 48 | 106 | 97 |
90 - 100 | 20 | 12 | 10 | 11 | 97 |
Data summary with bins of width 10.
Bin | Symmetric | Positive Skew | Negative Skew | Bimodal | Uniform |
0 - 25 | 108 | 158 | 46 | 287 | 264 |
25 - 50 | 387 | 529 | 240 | 235 | 243 |
50 - 75 | 398 | 255 | 597 | 247 | 238 |
75 - 100 | 107 | 58 | 117 | 231 | 255 |
Data summary with bins of width 25.
Symmetric | Positively | Negatively | Bimodal | Uniform | |
Skewed | Skewed | ||||
Actual Mean | |||||
Mean (bin width = 10) | |||||
Mean (bin width = 25) | |||||
Actual St Dev | |||||
St Dev (bin width = 10) | |||||
St Dev (bin width = 25) | |||||