Friday, July 28, 2023

Removing Duplicates in Hudi Partitions with Insert_Overwrite API and Spark SQL

lakehouse

No comments:

Post a Comment

SPJ Joins in Iceberg how to use them | Faster Join Avoid Shuffle

demo Create Spark Session ¶ In [1]: from pyspark.sql import SparkSe...