lale.lib.sklearn.pca module¶
- class lale.lib.sklearn.pca.PCA(*, n_components=None, copy=True, whiten=False, svd_solver='auto', tol=0.0, iterated_power='auto', random_state=None, n_oversamples=10, power_iteration_normalizer='auto')¶
Bases:
PlannedIndividualOp
Principal component analysis transformer from scikit-learn for linear dimensionality reduction.
This documentation is auto-generated from JSON schemas.
- Parameters
n_components (union type, default None) –
None
If not set, keep all components.
or ‘mle’
Use Minka’s MLE to guess the dimension.
or float, >0.0, <1.0
Select the number of components such that the amount of variance that needs to be explained is greater than the specified percentage.
or integer, >=1, <=’X/items/maxItems’, not for optimizer
Number of components to keep.
See also constraint-2, constraint-3.
copy (boolean, not for optimizer, default True) – If false, overwrite data passed to fit.
whiten (boolean, default False) – When true, multiply the components vectors by the square root of n_samples and then divide by the singular values to ensure uncorrelated outputs with unit component-wise variances.
svd_solver (‘auto’, ‘full’, ‘arpack’, or ‘randomized’, default ‘auto’) –
Algorithm to use.
See also constraint-2, constraint-3, constraint-4.
tol (float, >=0.0, <=1 for optimizer, not for optimizer, default 0.0) – Tolerance for singular values computed by svd_solver arpack.
iterated_power (union type, not for optimizer, default 'auto') –
integer, >=0, <=10 for optimizer
Number of iterations for the power method computed by svd_solver randomized.
or ‘auto’
Pick automatically.
See also constraint-4.
random_state (union type, not for optimizer, default None) –
Seed of pseudo-random number generator for shuffling data.
None
RandomState used by np.random
or numpy.random.RandomState
Use the provided random state, only affecting other users of that same random state instance.
or integer
Explicit seed.
n_oversamples (integer, >=0, <=1000 for optimizer, optional, not for optimizer, default 10) – This parameter is only relevant when
svd_solver="randomized"
. It corresponds to the additional number of random vectors to sample the range of X so as to ensure proper conditioning. See randomized_svd for more details.power_iteration_normalizer (‘auto’, ‘QR’, ‘LU’, or ‘none’, optional, not for optimizer, default ‘auto’) – Power iteration normalizer for randomized SVD solver. Not used by ARPACK. See
randomized_svd
for more details.
Notes
constraint-1 : negated type of ‘X/isSparse’
This class does not support sparse input. See TruncatedSVD for an alternative with sparse data.
constraint-2 : union type
Option n_components mle can only be set for svd_solver full or auto.
n_components : negated type of ‘mle’
or svd_solver : ‘full’ or ‘auto’
constraint-3 : union type
Setting 0 < n_components < 1 only works for svd_solver full.
n_components : negated type of float, >0.0, <1.0
or svd_solver : ‘full’
constraint-4 : union type
Option iterated_power can be set for svd_solver randomized.
iterated_power : ‘auto’
or svd_solver : ‘randomized’
- fit(X, y=None, **fit_params)¶
Train the operator.
Note: The fit method is not available until this operator is trainable.
Once this method is available, it will have the following signature:
- Parameters
X (array of items : array of items : float) – Features; the outer array is over samples.
y (Any, optional) – Target for supervised learning (ignored).
- transform(X, y=None)¶
Transform the data.
Note: The transform method is not available until this operator is trained.
Once this method is available, it will have the following signature:
- Parameters
X (array of items : array of items : float) – Features; the outer array is over samples.
- Returns
result – Features; the outer array is over samples.
- Return type
array of items : array of items : float