pyspark.RDD.variance

RDD.variance() → float[source]

Compute the variance of this RDD’s elements.

New in version 0.9.1.

Returns
float

the variance of all elements

Examples

>>> sc.parallelize([1, 2, 3]).variance()
0.666...