This article describes how to create a new variable that identifies the cases in a data set that have duplicated values in one or more variables. This feature is useful when you want to identify cases to delete from the data set.
Requirements
- A data set loaded into a Displayr document.
Please note these steps require a Displayr license.
Method
- Select one or more variables in the Data Sources tree.
- Click the variable hover button to the right of the variable and then select Ready-Made New Variables > Duplicates.
- A new variable called Duplicates will be added to the Data Sources tree.
- Drag the variable Duplicates onto the page to review counts.
- OPTIONAL: You can then use this variable as a filter to remove duplicate cases from the data set. See How to Delete Cases From a Data Set for details.
Next
How to Remove Duplicate Cases From a Data Set