remove_duplicates {cleanepi} | R Documentation |
When removing duplicates, users can specify a set columns to consider with the 'target_columns' argument.
remove_duplicates(data, target_columns = NULL)
data |
A input data frame or linelist. |
target_columns |
A vector of column names to use when looking for
duplicates. When the input data is a |
A data frame or linelist without the duplicates values and nor constant columns.
no_dups <- remove_duplicates(
data = readRDS(system.file("extdata", "test_linelist.RDS",
package = "cleanepi")),
target_columns = "linelist_tags"
)