Monday 24 August 2020

Containerization of PySpark Using Kubernetes by Ajaykumar Baljoshi via @sigmoidInc

 This article demonstrates the approach of how to use Spark on Kubernetes. It also includes a brief comparison between various cluster managers available for Spark.

I thought this was a really good article with a great level of detail. If you are interested in doing this in real life I recommend you read this first as there are code snippets and it will get you ahead of the curve.

No comments:

Post a Comment

Note: only a member of this blog may post a comment.