Skip to content

incorrect explanation of variance increase for exercise 2.27 #31

@goretkin

Description

@goretkin

The exercise here: https://github.com/OpenIntroStat/openintro-statistics/blob/master/ch_summarizing_data/TeX/review_exercises.tex#L5

has a solution here:

(c)~The new score is more than 1 standard deviation away from the previous mean, so

But the justification is incorrect.

The update equation for the unbiased sample variance, supposing n-1 old measurements, and given the mean and sample variance on n-1 measurements, and a new nth measurement is:

sampvar_{new}&=\frac{n-2}{n-1}sampvar_{old}+\frac{1}{n}\left(x_{n}-\bar{x}_{old}\right)^{2}

(you can see this is identical to the second equation at https://en.wikipedia.org/wiki/Algorithms_for_calculating_variance#Welford's_online_algorithm)

So the critical value of x_n so that the sample variance stays the same is

\frac{n}{n-1}sampvar&=\left(x_{n}-\bar{x}_{old}\right)^{2}

therefore, the there is a factor of 25/24 missing in the justification. I would submit a pull request, but I am not sure if the textbook covers these equations.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions