Chauvenet’s Criterion

Chauvenet’s Criterion

The following is from Chauvenet’s Criterion

The procedure assumes your data is normally distributed.

Chauvenet’s criterion is a way to identify outliers. The method works by creating an acceptable band of data around the mean. Eliminate any value that fall outside of the band.

Assuming you have a random sample of n values

  1. Find the sample's mean (m)

  2. Find the sample's standard deviation (s)

  3. Use the following formula to find the standardized deviation from the mean for each value in the sample

    image missing

    Run the formula n times (One time for each data point).
  4. Compare the value you get with the tables of Chauvenet’s criterion below. Reject values that exceed the criterion.

    For example if n=30, the table tells us a value's standard deviation should not exceed 2.394. If it does reject it.

    If you’ve find an abnormal amount of outliers, you might want to widen the band to include a certain percentage of data points. The empirical rule tells us that 95% of data lies within two standard deviations from the mean. Therefore you should eliminate a maximum of 5% of your data points.

    nτ
    31.383
    41.534
    51.645
    61.732
    71.803
    81.863
    91.915
    101.960
    112.000
    122.037
    nτ
    132.070
    142.100
    152.128
    162.154
    172.178
    182.200
    192.222
    202.241
    212.260
    222.278
    nτ
    232.295
    242.311
    252.326
    262.341
    272.355
    282.369
    292.382
    302.394
    312.406
    322.418
    nτ
    332.429
    342.440
    352.450
    362.460
    372.470
    382.479
    392.489
    402.498
    502.576
    1002.807
    nτ
    5003.291
    10003.481

Note: Deletion of outlier data is a controversial practice frowned on by many scientists and science instructors; while Chauvenet's criterion provides an objective and quantitative method for data rejection, it does not make the practice more scientifically or methodologically sound, especially in small sets or where a normal distribution cannot be assumed. Rejection of outliers is more acceptable in areas of practice where the underlying model of the process being measured and the usual distribution of measurement error are confidently known.
Chauvenet's criterion (Wikipedia)