pyspark.sql.functions.xpath_number

pyspark.sql.functions.xpath_number(xml: ColumnOrName, path: ColumnOrName) → pyspark.sql.column.Column[source]

Returns a double value, the value zero if no match is found, or NaN if a match is found but the value is non-numeric.

New in version 3.5.0.

Examples

>>> import pyspark.sql.functions as sf
>>> spark.createDataFrame(
...     [('<a><b>1</b><b>2</b></a>',)], ['x']
... ).select(sf.xpath_number('x', sf.lit('sum(a/b)'))).show()
+-------------------------+
|xpath_number(x, sum(a/b))|
+-------------------------+
|                      3.0|
+-------------------------+