This article describes how to create a new variable which identifies the cases in a data set which have duplicated values in one or more variables. This feature is useful when you want to identify cases to delete from the data set.
Requirements
- A data set loaded into a Displayr document.
Method
- Select one or more variables in the Data Sets tree.
- Click the variable hover button
to the right of the variable and then select Ready-Made New Variables > Duplicates
A new variable called Duplicates will be added to the Data Sets tree. - Drag the variable Duplicates onto the page.
The results are as follows:
Next
How to Remove Duplicate Cases From a Data Set
How to De-duplicate Raw Data Using R
Comments
0 comments
Article is closed for comments.