cdm_reader_mapper.DataBundle.get_duplicates

cdm_reader_mapper.DataBundle.get_duplicates#

DataBundle.get_duplicates(**kwargs)[source]#

Get duplicate matches in data.

Return type:

DataFrame

Returns:

pd.DataFrame – DataFrame containing duplicate matches.

Note

Before getting duplicates, a duplictate check has to be done, DataBundle.duplicate_check().

Examples

>>> matches = db.get_duplicates()

See also

DataBundle.remove_duplicates

Remove detected duplicates in data.

DataBundle.flag_duplicates

Flag detected duplicates in data.

DataBundle.duplicate_check

Duplicate check in data.

Note

For more information see DupDetect.get_duplicates()