# GaussianMixtureSummary¶

class pyspark.ml.clustering.GaussianMixtureSummary(java_obj=None)[source]

Gaussian mixture clustering results for a given model.

New in version 2.1.0.

Attributes

 cluster DataFrame of predicted cluster centers for each training data point. clusterSizes Size of (number of data points in) each cluster. featuresCol Name for column of features in predictions. k The number of clusters the model was trained with. logLikelihood Total log-likelihood for this model on the given data. numIter Number of iterations. predictionCol Name for column of predicted clusters in predictions. predictions DataFrame produced by the model’s transform method. probability DataFrame of probabilities of each cluster for each training data point. probabilityCol Name for column of predicted probability of each cluster in predictions.

Attributes Documentation

cluster

DataFrame of predicted cluster centers for each training data point.

New in version 2.1.0.

clusterSizes

Size of (number of data points in) each cluster.

New in version 2.1.0.

featuresCol

Name for column of features in predictions.

New in version 2.1.0.

k

The number of clusters the model was trained with.

New in version 2.1.0.

logLikelihood

Total log-likelihood for this model on the given data.

New in version 2.2.0.

numIter

Number of iterations.

New in version 2.4.0.

predictionCol

Name for column of predicted clusters in predictions.

New in version 2.1.0.

predictions

DataFrame produced by the model’s transform method.

New in version 2.1.0.

probability

DataFrame of probabilities of each cluster for each training data point.

New in version 2.1.0.

probabilityCol

Name for column of predicted probability of each cluster in predictions.

New in version 2.1.0.