Package org.apache.spark.sql.util
Class ArrowUtils
Object
org.apache.spark.sql.util.ArrowUtils
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionstatic DataTypefromArrowField(org.apache.arrow.vector.types.pojo.Field field) static StructTypefromArrowSchema(org.apache.arrow.vector.types.pojo.Schema schema) static DataTypefromArrowType(org.apache.arrow.vector.types.pojo.ArrowType dt) static org.apache.arrow.memory.RootAllocatorstatic org.apache.arrow.vector.types.pojo.FieldtoArrowField(String name, DataType dt, boolean nullable, String timeZoneId, boolean largeVarTypes) Maps field from Spark to Arrow.static org.apache.arrow.vector.types.pojo.SchematoArrowSchema(StructType schema, String timeZoneId, boolean errorOnDuplicatedFieldNames, boolean largeVarTypes) Maps schema from Spark to Arrow.static org.apache.arrow.vector.types.pojo.ArrowTypetoArrowType(DataType dt, String timeZoneId, boolean largeVarTypes) Maps data type from Spark to Arrow.
-
Constructor Details
-
ArrowUtils
public ArrowUtils()
-
-
Method Details
-
rootAllocator
public static org.apache.arrow.memory.RootAllocator rootAllocator() -
toArrowType
public static org.apache.arrow.vector.types.pojo.ArrowType toArrowType(DataType dt, String timeZoneId, boolean largeVarTypes) Maps data type from Spark to Arrow. NOTE: timeZoneId required for TimestampTypes -
fromArrowType
-
toArrowField
public static org.apache.arrow.vector.types.pojo.Field toArrowField(String name, DataType dt, boolean nullable, String timeZoneId, boolean largeVarTypes) Maps field from Spark to Arrow. NOTE: timeZoneId required for TimestampType -
fromArrowField
-
toArrowSchema
public static org.apache.arrow.vector.types.pojo.Schema toArrowSchema(StructType schema, String timeZoneId, boolean errorOnDuplicatedFieldNames, boolean largeVarTypes) Maps schema from Spark to Arrow. NOTE: timeZoneId required for TimestampType in StructType -
fromArrowSchema
-