pyspark.sql.DataFrame.toJSON

DataFrame.toJSON(use_unicode: bool = True) → pyspark.rdd.RDD[str][source]

Converts a DataFrame into a RDD of string.

Each row is turned into a JSON document as one element in the returned RDD.

New in version 1.3.0.

Parameters
use_unicodebool, optional, default True

Whether to convert to unicode or not.

Returns
RDD

Examples

>>> df = spark.createDataFrame([(2, "Alice"), (5, "Bob")], schema=["age", "name"])
>>> df.toJSON().first()
'{"age":2,"name":"Alice"}'