pyspark.sql.functions.map_from_arrays

pyspark.sql.functions.map_from_arrays(col1: ColumnOrName, col2: ColumnOrName) → pyspark.sql.column.Column[source]

Creates a new map from two arrays.

New in version 2.4.0.

Changed in version 3.4.0: Supports Spark Connect.

Parameters
col1Column or str

name of column containing a set of keys. All elements should not be null

col2Column or str

name of column containing a set of values

Returns
Column

a column of map type.

Examples

>>> df = spark.createDataFrame([([2, 5], ['a', 'b'])], ['k', 'v'])
>>> df = df.select(map_from_arrays(df.k, df.v).alias("col"))
>>> df.show()
+----------------+
|             col|
+----------------+
|{2 -> a, 5 -> b}|
+----------------+
>>> df.printSchema()
root
 |-- col: map (nullable = true)
 |    |-- key: long
 |    |-- value: string (valueContainsNull = true)