Grubbs’ Test¶
Global Algorithm - One-Dimensional Algorithm
Grubbs’ test algorithm calculates the ratio of the deviation of each data point from the mean of the data set to the standard deviation of the data set. The basic formula is as follows:
\[G = \frac{\text{max}|(Y_i-\mu)|}{\sigma}\]
where \(G\) is the Grubbs’ test statistic, \(Y_i\) is the \(i\)th component of \(Y\), \(\mu\) is the mean value, and \(\sigma\) is the standard deviation of the data set. If
\[G > G^{\text{table}} \, \text{,}\]
then the data point can be considered outside the region of interest. \(G^{\text{table}}\) includes literature values for a certain significance level.
Input Parameters
Parameter | Type | Constraint | Description | Remarks |
---|---|---|---|---|
\(Y\) | \(Y \in \mathbb R^N\) | \(N \in \mathbb{N}\) | Input data sequence of length \(N\) | Not effective for a data vector with less than \(6\) constituents. |
\(\mu\) | \(\mu \in \mathbb{R}\) | Mean distribution of \(Y\) | ||
\(\sigma\) | \(\sigma\in \mathbb{R}\) | Standard deviation of \(Y\) | ||
\(G^{\text{table}}\) | Grubbs’ test values from literature |
Output Parameters
Parameter | Type | Constraint | Description | Remarks |
---|---|---|---|---|
\(\hat{Y}\) | \(\hat{Y} \in \mathbb R^N\) | Values in the \(Y\) list which are outside the region of interest are marked |
Single Steps using the Algorithm
References
- F. Grubbs, Procedures for Detecting Outlying Observations in Samples, Technometrics, vol. 11(1), pp. 1-21, 1969.