Number of spark frame sections created by reading data from the Hive table
I have a question about the number of section blocks for spark frames.
If I have a Hive (employee) table that has columns (name, age, id, location).
CREATE TABLE employee (name String, age String, id Int) PARTITIONED BY (location String);
If the employee table has 10 different locations. Thus, the data will be split into 10 partitions in HDFS.
If I create a Spark data file (df) by reading all the data of the Hive (employee) table.
How many Spark partitions will be created for the dataframe (df)?
df.rdd.partitions.size = ??
+2
source to share