Java stub for mllib Statistics.
Java stub for mllib Statistics.colStats(X: RDD[Vector]). TODO figure out return type.
Java stub for mllib Statistics.
Java stub for mllib Statistics.corr(x: RDD[Double], y: RDD[Double], method: String).
Java stub for mllib Statistics.
Java stub for mllib Statistics.corr(X: RDD[Vector], method: String). Returns the correlation matrix serialized into a byte array understood by deserializers in pyspark.
Loads and serializes labeled points saved with RDD#saveAsTextFile
.
Loads and serializes labeled points saved with RDD#saveAsTextFile
.
Java SparkContext
file or directory path in any Hadoop-supported file system URI
min number of partitions
serialized labeled points stored in a JavaRDD of byte array
Java stub for Python mllib RandomRDDGenerators.
Java stub for Python mllib RandomRDDGenerators.normalRDD()
Java stub for Python mllib RandomRDDGenerators.
Java stub for Python mllib RandomRDDGenerators.normalVectorRDD()
Java stub for Python mllib RandomRDDGenerators.
Java stub for Python mllib RandomRDDGenerators.poissonRDD()
Java stub for Python mllib RandomRDDGenerators.
Java stub for Python mllib RandomRDDGenerators.poissonVectorRDD()
Predict the labels of the given data points.
Predict the labels of the given data points. This is a Java stub for python DecisionTreeModel.predict()
A JavaRDD with serialized feature vectors
JavaRDD of serialized predictions
Predict the label of the given data point.
Predict the label of the given data point. This is a Java stub for python DecisionTreeModel.predict()
Serialized feature vector for data point
predicted label
Java stub for Python mllib ALS.
Java stub for Python mllib ALS.train(). This stub returns a handle to the Java object instead of the content of the Java object. Extra care needs to be taken in the Python code to ensure it gets freed on exit; see the Py4J documentation.
Java stub for Python mllib DecisionTree.
Java stub for Python mllib DecisionTree.train(). This stub returns a handle to the Java object instead of the content of the Java object. Extra care needs to be taken in the Python code to ensure it gets freed on exit; see the Py4J documentation.
Training data
Categorical features info, as Java map
Java stub for Python mllib ALS.
Java stub for Python mllib ALS.trainImplicit(). This stub returns a handle to the Java object instead of the content of the Java object. Extra care needs to be taken in the Python code to ensure it gets freed on exit; see the Py4J documentation.
Java stub for Python mllib KMeans.
Java stub for Python mllib KMeans.train()
Java stub for Python mllib LassoWithSGD.
Java stub for Python mllib LassoWithSGD.train()
Java stub for Python mllib LinearRegressionWithSGD.
Java stub for Python mllib LinearRegressionWithSGD.train()
Java stub for Python mllib LogisticRegressionWithSGD.
Java stub for Python mllib LogisticRegressionWithSGD.train()
Java stub for NaiveBayes.
Java stub for NaiveBayes.train()
Java stub for Python mllib RidgeRegressionWithSGD.
Java stub for Python mllib RidgeRegressionWithSGD.train()
Java stub for Python mllib SVMWithSGD.
Java stub for Python mllib SVMWithSGD.train()
Java stub for Python mllib RandomRDDGenerators.
Java stub for Python mllib RandomRDDGenerators.uniformRDD()
Java stub for Python mllib RandomRDDGenerators.
Java stub for Python mllib RandomRDDGenerators.uniformVectorRDD()
:: DeveloperApi :: The Java stubs necessary for the Python mllib bindings.
See python/pyspark/mllib/_common.py for the mutually agreed upon data format.