Core Spark functionality.
Spark's broadcast variables, used to broadcast immutable datasets to all nodes.
ALPHA COMPONENT GraphX is a graph processing framework built on top of Spark.
IO codecs used for compression.
DataFrame-based machine learning APIs to let users quickly assemble and configure practical machine learning pipelines.
RDD-based machine learning APIs (in maintenance mode).
This package contains the default implementation of the decision tree algorithm, which supports:
Support for approximate results.
Provides several RDD implementations.
Spark's scheduling components.
Pluggable serializers for RDD and shuffle data.
Allows the execution of relational queries, including those expressed in SQL using Spark.
Spark Streaming functionality.