(a)
The histograms and summary statistics summarize the data for the number of hits in the season by baseball players in two leagues.
Some summary statistics for the number of hits by players in each league.
Use the shape of the distributions to select the appropriate measures of center and variability for the number of hits by players in each of the two leagues. Compare the number of hits by players in the two leagues using these measures. Explain what each value means in your comparison.
(b)
Each data set contains one outlier. What are the values of the two outliers? Explain how each value is determined to be an outlier.
(c)
Elena suggests removing the outliers from each data set because they are so unusual. Is this the right action to take? Explain your reasoning.
(d)
If the outliers are removed, which would be more likely to change significantly: the mean or the median? Is the standard deviation or interquartile range more likely to change significantly? Explain your reasoning.