lale.lib.rasl.aggregate module

class lale.lib.rasl.aggregate.Aggregate(*args, _lale_trained=False, _lale_impl=None, **kwargs)

Bases: TrainedIndividualOp

Relational algebra aggregate operator.

This documentation is auto-generated from JSON schemas.

Parameters
  • columns (union type, optional, not for optimizer, default []) –

    Aggregations for producing output columns.

    • dict

      Dictionary of output column names and aggregation expressions.

    • or array of items : expression

      List of aggregation expressions. The output column name is determined by a heuristic based on the input column name and the transformation function.

  • group_by (union type, optional, not for optimizer, default []) –

    Group by columns for aggregates.

    • expression

      Expressions for columns name if there is a single column.

    • or array of items : expression

      List of expressions for columns.

  • exclude_value (Any, optional, not for optimizer, default None) – Exclude this value in computation of aggregates. Useful for missing value imputation.

transform(X, y=None)

Transform the data.

Parameters

X (array, >=1 items of items : array of items : Any) – Output of the group by operator - Pandas / Pyspark grouped dataframe

Returns

result – The outer array is over rows.

  • items : array of items : Any

    The inner array is over columns.

Return type

array