cdm_reader_mapper.split_by_boolean

cdm_reader_mapper.split_by_boolean#

cdm_reader_mapper.split_by_boolean(data, mask, boolean, reset_index=False, inverse=False, return_rejected=False)[source]#

Split a DataFrame using a boolean mask via split_dataframe_by_boolean.

Parameters:
  • data (pandas.DataFrame or Iterable[pd.DataFrame]) – DataFrame to be split.

  • mask (pandas.DataFrame or Iterable[pd.DataFrame]) – Boolean mask with the same length as data.

  • boolean (bool) – Determines mask interpretation:

    • True ? select rows where all mask columns are True.

    • False ? select rows where any mask column is False.

  • reset_index (bool, optional) – If True, reset the index of returned DataFrames.

  • inverse (bool, optional) – If True, invert the selection performed by the underlying function.

  • return_rejected (bool, optional) – If True, return rejected rows as the second output. If False, the rejected output is empty but dtype-preserving.

Return type:

tuple[DataFrame | ParquetStreamReader, DataFrame | ParquetStreamReader, Index | MultiIndex, Index | MultiIndex]

Returns:

(pandas.DataFrame or ParquetStreamReader, pandas.DataFrame or ParquetStreamReader, pd.Index or pd.MultiIndex, pd.Index or pd.MultiIndex) – Selected rows (all mask columns True), rejected rows, original indexes of selection and original indexes of rejection.