Kubernetes Spark Operator
Kubernetes Spark Operator
Last updated
Was this helpful?
Kubernetes Spark Operator
Last updated
Was this helpful?
Kubernetes에서 Spark Application를 구동하려면 Spark-submit, Spark on Kubernetes Operator를 사용할 수 있다.
여기서는 Kubernetes Spark Operator를 사용 것에 대해 알아본다.
헬름 차트를 사용해 Spark Operator를 설치한다.
Spark Job namespace 생성한다.
Spark Operator 헬름 차트를 레포에 등록하고 Operator는 spark-operator, Application은 spark-jobs namespace를 지정하여 Operator를 설치한다.
Spark Job 을 생성하고 실행한다.
완료된 Spark Job 을 삭제한다.
필요하면 Spark Operator를 삭제한다.
https://aws.amazon.com/ko/blogs/compute/running-cost-optimized-spark-workloads-on-kubernetes-using-ec2-spot-instances/ https://www.datamechanics.co/blog-post/setting-up-managing-monitoring-spark-on-kubernetes https://dev.to/stack-labs/my-journey-with-spark-on-kubernetes-in-python-1-3-4nl3 https://kubernetes.io/docs/tasks/administer-cluster/namespaces/ https://github.com/GoogleCloudPlatform/spark-on-k8s-operator/blob/master/docs/quick-start-guide.md#running-the-examples https://github.com/GoogleCloudPlatform/spark-on-k8s-operator/issues/454https://spark.apache.org/downloads.html https://mirror.navercorp.com/apache/spark/spark-3.1.2/spark-3.1.2-bin-hadoop3.2.tgz https://www.youtube.com/watch?v=SqKlPiv_RRg https://www.slideshare.net/seungyongoh3/rearchitecting-data-platform-with-kubernetes