Seamlessly work with both graphs and collections.
GraphX unifies ETL, exploratory analysis, and iterative graph computation within a single system. You can view the same data as both graphs and collections, transform and join graphs with RDDs efficiently, and write custom iterative graph algorithms using the Pregel API.
Comparable performance to the fastest specialized graph processing systems.
GraphX competes on performance with the fastest graph systems while retaining Spark's flexibility, fault tolerance, and ease of use.
Choose from a growing library of graph algorithms.
In addition to a highly flexible API, GraphX comes with a variety of graph algorithms, many of which were contributed by our users.
GraphX is developed as part of the Apache Spark project. It thus gets tested and updated with each Spark release.
If you have questions about the library, ask on the Spark mailing lists.
GraphX is in the alpha stage and welcomes contributions. If you'd like to submit a change to GraphX, read how to contribute to Spark and send us a patch!
To get started with GraphX: