Class FValueTest

Object
org.apache.spark.ml.stat.FValueTest

public class FValueTest extends Object
FValue test for continuous data.
  • Constructor Details

    • FValueTest

      public FValueTest()
  • Method Details

    • test

      public static Dataset<Row> test(Dataset<Row> dataset, String featuresCol, String labelCol)
      Parameters:
      dataset - DataFrame of continuous labels and continuous features.
      featuresCol - Name of features column in dataset, of type Vector (VectorUDT)
      labelCol - Name of label column in dataset, of any numerical type
      Returns:
      DataFrame containing the test result for every feature against the label. This DataFrame will contain a single Row with the following fields: - pValues: Vector - degreesOfFreedom: Array[Long] - fValues: Vector Each of these fields has one value per feature.
    • test

      public static Dataset<Row> test(Dataset<Row> dataset, String featuresCol, String labelCol, boolean flatten)
      Parameters:
      dataset - DataFrame of continuous labels and continuous features.
      featuresCol - Name of features column in dataset, of type Vector (VectorUDT)
      labelCol - Name of label column in dataset, of any numerical type
      flatten - If false, the returned DataFrame contains only a single Row, otherwise, one row per feature.
      Returns:
      (undocumented)