pyspark.sql.functions.datediff

pyspark.sql.functions.datediff(end: ColumnOrName, start: ColumnOrName) → pyspark.sql.column.Column[source]

Returns the number of days from start to end.

New in version 1.5.0.

Changed in version 3.4.0: Supports Spark Connect.

Parameters
endColumn or str

to date column to work on.

startColumn or str

from date column to work on.

Returns
Column

difference in days between two dates.

Examples

>>> df = spark.createDataFrame([('2015-04-08','2015-05-10')], ['d1', 'd2'])
>>> df.select(datediff(df.d2, df.d1).alias('diff')).collect()
[Row(diff=32)]