cdm_reader_mapper.DataBundle.flag_duplicates

cdm_reader_mapper.DataBundle.flag_duplicates#

DataBundle.flag_duplicates(inplace=False, **kwargs)[source]#

Flag detected duplicates in data.

Parameters:

inplace (bool) – If True overwrite data in DataBundle else return a copy of DataBundle with data containing flagged duplicates. Default: False

Return type:

DataBundle | None

Returns:

DataBundle or None – DataBundle containing duplicate flags in data or None if inplace=True.

Note

Before flagging duplicates, a duplictate check has to be done, DataBundle.duplicate_check().

Examples

Flag duplicates without overwriting data.

>>> flagged_tables = db.flag_duplicates()

Flag duplicates with overwriting data.

>>> db.flag_duplicates(inplace=True)
>>> flagged_tables = db.data

See also

DataBundle.remove_duplicates

Remove detected duplicates in data.

DataBundle.get_duplicates

Get duplicate matches in data.

DataBundle.duplicate_check

Duplicate check in data.

Note

For more information see DupDetect.flag_duplicates()