API reference#
This page provides an auto-generated summary of the cdm_reader_mapper API.
Read data from disk#
|
Read either original marine-meteorological data or MDF data or CDM tables from disk. |
|
Read MDF data which is already on a pre-defined data model. |
|
Read data files compliant with a user specific data model. |
|
Read CDM-table-like files from file system to a pandas.DataFrame. |
DataBundle#
|
Class for manipulating the MDF data and mapping it to the CDM. |
DataBundle’s method functions#
Information#
|
Get unique values of |
Manipulation#
|
Adding information to a |
Make deep copy of a |
|
|
Replace columns in |
|
Stack multiple |
|
Stack multiple |
Selection#
|
Split |
|
Split |
|
Split |
|
Split |
|
Select rows from |
|
Select rows from |
|
Select rows from |
Validation#
|
Validate datetime information in |
|
Validate station id information in |
Map data to CDM tables#
|
Map |
Correction#
|
Correct datetime information in |
|
Correct platform type information in |
Duplicate check#
|
Duplicate check in |
|
Flag detected duplicates in |
|
Get duplicate matches in |
|
Remove detected duplicates in |
Write data on disk#
|
Write |
DataBundle’s property attributes#
Column labels of |
|
MDF pandas.DataFrame data. |
|
Dictionary of data types on |
|
A string representing the encoding to use in the |
|
Name of the MDF/CDM input model. |
|
MDF pandas.DataFrame validation mask. |
|
Data mode. |
|
Information of how to parse dates in |
Useful functions#
|
Apply ICOADS deck specific datetime corrections. |
|
Apply ICOADS deck specific platform ID corrections. |
|
Run a duplicate check on a dataset using recordlinkage. |
|
Map a pandas DataFrame to the CDM header and observational tables. |
|
Read either original marine-meteorological data or MDF data or CDM tables from disk. |
|
Read MDF data which is already on a pre-defined data model. |
|
Read data files compliant with a user specific data model. |
|
Read CDM-table-like files from file system to a pandas.DataFrame. |
|
Replace columns in one DataFrame using row-matching from another. |
|
Split a DataFrame using a boolean mask via |
|
Split rows where all mask columns are |
|
Split a DataFrame based on matching values in a given column. |
|
Split a DataFrame by selecting specific index labels. |
|
Count unique values per column in a DataFrame or a Iterable of DataFrame. |
|
Validate datetime columns in a dataset according to the specified model. |
|
Validate ID column(s) in a dataset against deck-specific patterns. |
|
Write either MDF data or CDM tables on disk. |
|
Write pandas.DataFrame to MDF file on file system. |
|
Write pandas.DataFrame to CDM-table file on file system. |
DupDetect#
|
Class to detect, flag, and remove duplicate entries in a DataFrame using a comparison matrix from recordlinkage. |
|
Get result dataset with flagged duplicates. |
|
Identify duplicate matches based on the comparison matrix. |
|
Remove duplicate entries from the dataset. |