Spark & โโHCatalog?
I feel comfortable downloading HCatalog with Pig and was wondering if Spark could be used instead of Pig. Unfortunately I'm pretty new to Spark ...
Can you provide any input on how to get started? Are there Spark libraries to use? Any examples? I've done all the exercises at http://spark.apache.org/ but they focus on RDD and don't go any further.
Any help would be grateful ...
Regards, Pawel
+3
source to share
3 answers
You can refer to the following link for using the HCLog InputFormat wrapper with Spark; which was written before SparkSQL.
https://gist.github.com/granturing/7201912
+1
source to share