Spark Release 3.2.4

Spark 3.2.4 is a maintenance release containing stability fixes. This release is based on the branch-3.2 maintenance branch of Spark. We strongly recommend all 3.2 users to upgrade to this stable release.

Notable changes

  • [SPARK-38173]: Quoted column cannot be recognized correctly when quotedRegexColumnNames is true
  • [SPARK-39399]: proxy-user not working for Spark on k8s in cluster deploy mode
  • [SPARK-40817]: Remote spark.jars URIs ignored for Spark on Kubernetes in cluster mode
  • [SPARK-40819]: Parquet INT64 (TIMESTAMP(NANOS,true)) now throwing Illegal Parquet type instead of automatically converting to LongType
  • [SPARK-41162]: Anti-join must not be pushed below aggregation with ambiguous predicates
  • [SPARK-41254]: YarnAllocator.rpIdToYarnResource map is not properly updated
  • [SPARK-41360]: Avoid BlockManager re-registration if the executor has been lost
  • [SPARK-41376]: Executor netty direct memory check should respect spark.shuffle.io.preferDirectBufs
  • [SPARK-41388]: getReusablePVCs should ignore recently created PVCs in the previous batch
  • [SPARK-41415]: SASL Request Retries
  • [SPARK-41554]: Decimal.changePrecision produces ArrayIndexOutOfBoundsException
  • [SPARK-41732]: Session window: analysis rule “SessionWindowing” does not apply tree-pattern based pruning
  • [SPARK-41952]: Upgrade Parquet to fix off-heap memory leaks in Zstd codec
  • [SPARK-41989]: PYARROW_IGNORE_TIMEZONE warning can break application logging setup
  • [SPARK-42071]: Register scala.math.Ordering$Reverse to KyroSerializer
  • [SPARK-42090]: Introduce sasl retry count in RetryingBlockTransferor
  • [SPARK-42157]: spark.scheduler.mode=FAIR should provide FAIR scheduler
  • [SPARK-42168]: CoGroup with window function returns incorrect result when partition keys differ in order
  • [SPARK-42259]: ResolveGroupingAnalytics should take care of Python UDAF
  • [SPARK-42462]: Prevent docker-image-tool.sh from publishing OCI manifests
  • [SPARK-42478]: Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory
  • [SPARK-42596]: OMP_NUM_THREADS not set to number of executor cores by default
  • [SPARK-42697]: /api/v1/applications return 0 for duration
  • [SPARK-42747]: Fix incorrect internal status of LoR and AFT
  • [SPARK-42785]: When spark submit without –deploy-mode, will face NPE in Kubernetes Case
  • [SPARK-42906]: Replace a starting digit with x in resource name prefix
  • [SPARK-42967]: Fix SparkListenerTaskStart.stageAttemptId when a task is started after the stage is cancelled
  • [SPARK-43004]: vendor==vendor typo in ResourceRequest.equals()

Dependency Changes

While being a maintenance release we did still upgrade some dependencies in this release they are:

You can consult JIRA for the detailed changes.

We would like to acknowledge all community members for contributing patches to this release.


Spark News Archive