This article describes how to use a built-in QScript to check the selected numeric data for outliers and creates new copies of the data with the outliers removed. Outliers are defined as values that are not within a certain number of standard deviations from the variable mean. You can choose how many standard deviations are used to determine which values are considered to be outliers. The default value is 3 standard deviations. The new copies of data will have the outlier values replaced with missing values. Data that does not contain outliers will not be copied.
- A data file loaded in Displayr
- One or more numeric variables
To run the script:
- Select a numeric, numeric - multi, or numeric - grid in the Data Sets tree.
- Click + > Ready-Made New Variables > Variables(s) with Outliers Removed or from the object inspector, click TRANSFORMATIONS > Variable(s) with Outliers Removed.
- Enter a cut-off value to identify cases whose standard deviations are not within that value. The default value is 3.
A folder will be created in the Pages tree that contains tables for the selected data and any new copies of data with the outliers removed.