feat: Support for Spark Client in Kubeflow SDK by Shekharrajak · Pull Request #158 · kubeflow/sdk

Shekharrajak · 2025-11-11T19:12:12Z

This PR introduces the Kubeflow Spark Client - a cloud-native
Python client for managing Apache Spark applications on
Kubernetes. It provides a unified, Pythonic interface
for submitting, monitoring, and managing Spark jobs using the
Kubeflow Spark Operator.

KEP: #163

Few examples added in examples/spark directory to play with.

# Setup the env using 
cd examples/spark

./setup_test_environment.sh

# run simple example 
python test_spark_client_integration.py

# spark connect 
./setup_spark_connect.sh
python ipython_spark_connect_demo.py

Slack thread: https://cloud-native.slack.com/archives/C074588U7EG/p1763656387742729?thread_ts=1763568656.642239&cid=C074588U7EG

google-oss-prow · 2025-11-11T19:12:18Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign astefanutti for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

andreyvelich · 2025-11-12T06:08:47Z

/retitle feat: Support for Spark Client in Kubeflow SDK

andreyvelich · 2025-11-12T06:09:37Z

Thanks a lot for this @Shekharrajak 🚀
We will review this PR after KubeCon + CloudNativeCon NA!
cc @kubeflow/kubeflow-sdk-team

andreyvelich · 2025-11-12T06:15:52Z

/cc @akshaychitneni @shravan-achar @bigsur0 @vara-bonthu @nabuskey @ChenYi015 @jacobsalway @aagumin @ImpSy

google-oss-prow · 2025-11-12T06:16:02Z

@andreyvelich: GitHub didn't allow me to request PR reviews from the following users: aagumin, shravan-achar, bigsur0.

Note that only kubeflow members and repo collaborators can review this PR, and authors cannot review their own PRs.

Details

In response to this:

/cc @akshaychitneni @shravan-achar @bigsur0 @vara-bonthu @nabuskey @ChenYi015 @jacobsalway @aagumin @ImpSy

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

kramaranya · 2025-11-12T22:57:07Z

Shall we create a KEP first since this is quite a big addition to Kubeflow SDK?

Shekharrajak · 2025-11-14T18:09:57Z

Shall we create a KEP first this is quite a big addition to Kubeflow SDK?

I can create one, but this PR is having the same pattern as trainer client.

aagumin · 2025-11-17T08:34:05Z

Is there a plan for the SDK to support working with Spark Connect? For example, a data scientist might have a dynamic infrastructure where they can create a Spark Connect cluster on demand.
It would also be great to see the required Kubernetes RBAC in the documentation so that all examples work. Ideally, it would be limited to CRDs and pods/logs.

andreyvelich · 2025-11-17T16:51:04Z

@Shekharrajak Let's create a simple KEP which identifies use-cases and users patterns to interact with SparkApplication CRD.
I doesn't need to be super detailed like HPO: https://github.com/kubeflow/sdk/tree/main/docs/proposals/46-hyperparameter-optimization, but we can discuss initial API design there.

andreyvelich · 2025-11-17T16:54:51Z

Is there a plan for the SDK to support working with Spark Connect? For example, a data scientist might have a dynamic infrastructure where they can create a Spark Connect cluster on demand.
It would also be great to see the required Kubernetes RBAC in the documentation so that all examples work. Ideally, it would be limited to CRDs and pods/logs.

Yeah, I think we can talk about solutions to connect to existing Spark cluster, and where Kubeflow SDK APIs might be helpful.

I know that @lresende and @fresende added instructions on how to connect Jupyter Notebooks to Spark cluster via Jupyter Enterprise Gateway, but we can discuss various options for Spark Connect too.
https://www.kubeflow.org/docs/components/spark-operator/user-guide/notebooks-spark-operator/

Shekharrajak · 2025-11-17T17:41:03Z

@Shekharrajak Let's create a simple KEP which identifies use-cases and users patterns to interact with SparkApplication CRD. I doesn't need to be super detailed like HPO: https://github.com/kubeflow/sdk/tree/main/docs/proposals/46-hyperparameter-optimization, but we can discuss initial API design there.

Created the doc: #163 Please have a look.

Signed-off-by: shekharrajak <shekharrajak@live.com>

updated the spark connect backend and examples Signed-off-by: shekharrajak <shekharrajak@live.com>

Support for Spark connect backend in Spark Client

updated the spark connect backend and examples Signed-off-by: shekharrajak <shekharrajak@live.com>

Spark connect backend for Spark Client

Update Python docstrings to use SparkSessionClient and BatchSparkClient

consistent apis like trainer client

…_status - Add get_job_logs(submission_id, executor_id, follow) method for retrieving logs - Rename wait_for_job to wait_for_job_status for TrainerClient API consistency - Update docstring examples to reflect the new method name

- Add get_job_status(job_id) method to retrieve the current status of a job - Update documentation to include usage examples for the new method

Shekharrajak · 2026-01-13T14:48:05Z

Since there is few design changes - started fresh here #225

google-oss-prow bot requested review from kramaranya and szaher November 11, 2025 19:12

google-oss-prow bot added the size/XXL label Nov 11, 2025

google-oss-prow bot changed the title ~~Spark Client~~ feat: Support for Spark Client in Kubeflow SDK Nov 12, 2025

google-oss-prow bot requested review from ChenYi015, ImpSy, akshaychitneni, jacobsalway, nabuskey and vara-bonthu November 12, 2025 06:15

Shekharrajak mentioned this pull request Nov 17, 2025

feat(docs): KEP- Spark Client for Kubeflow SDK #163

Merged

Shekharrajak force-pushed the feature/spark-client branch 4 times, most recently from b3b3941 to 210ad60 Compare November 19, 2025 05:46

initial version of spark client connecting to k8s spark cluster

fa9c89e

Signed-off-by: shekharrajak <shekharrajak@live.com>

Shekharrajak force-pushed the feature/spark-client branch from 210ad60 to fa9c89e Compare November 19, 2025 08:50

Shekharrajak mentioned this pull request Nov 20, 2025

Support for Spark connect backend in Spark Client Shekharrajak/sdk#1

Merged

Spark connect backend for Spark Client

0ef2c7c

updated the spark connect backend and examples Signed-off-by: shekharrajak <shekharrajak@live.com>

Shekharrajak and others added 12 commits November 20, 2025 23:15

Merge pull request #1 from Shekharrajak/feature/spark-connect

9d70256

Support for Spark connect backend in Spark Client

Spark connect backend for Spark Client

20296e5

updated the spark connect backend and examples Signed-off-by: shekharrajak <shekharrajak@live.com>

Merge pull request #2 from Shekharrajak/feature/spark-connect

92f4b6e

Spark connect backend for Spark Client

Update Python docstrings to use SparkSessionClient and BatchSparkClient

1c52349

Merge pull request #3 from Shekharrajak/feature/spark-connect

b697d64

Update Python docstrings to use SparkSessionClient and BatchSparkClient

consistent apis like trainer client

5fc0308

Merge pull request #4 from Shekharrajak/feature/spark-connect

7ebac5d

consistent apis like trainer client

Merge branch 'kubeflow:main' into feature/spark-client

3fe30c9

feat(spark): implement get_job_status method for job monitoring

c05b01e

- Add get_job_status(job_id) method to retrieve the current status of a job - Update documentation to include usage examples for the new method

updates after community call discussion and KEP update

bedae57

examples updated

28486b0

Shekharrajak closed this Jan 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Support for Spark Client in Kubeflow SDK#158

feat: Support for Spark Client in Kubeflow SDK#158
Shekharrajak wants to merge 14 commits intokubeflow:mainfrom
Shekharrajak:feature/spark-client

Shekharrajak commented Nov 11, 2025 •

edited

Loading

Uh oh!

google-oss-prow bot commented Nov 11, 2025

Uh oh!

andreyvelich commented Nov 12, 2025

Uh oh!

andreyvelich commented Nov 12, 2025

Uh oh!

andreyvelich commented Nov 12, 2025

Uh oh!

google-oss-prow bot commented Nov 12, 2025

Uh oh!

kramaranya commented Nov 12, 2025 •

edited

Loading

Uh oh!

Shekharrajak commented Nov 14, 2025

Uh oh!

aagumin commented Nov 17, 2025

Uh oh!

andreyvelich commented Nov 17, 2025

Uh oh!

andreyvelich commented Nov 17, 2025

Uh oh!

Shekharrajak commented Nov 17, 2025

Uh oh!

Shekharrajak commented Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Shekharrajak commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

google-oss-prow bot commented Nov 11, 2025

Uh oh!

andreyvelich commented Nov 12, 2025

Uh oh!

andreyvelich commented Nov 12, 2025

Uh oh!

andreyvelich commented Nov 12, 2025

Uh oh!

google-oss-prow bot commented Nov 12, 2025

Uh oh!

kramaranya commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Shekharrajak commented Nov 14, 2025

Uh oh!

aagumin commented Nov 17, 2025

Uh oh!

andreyvelich commented Nov 17, 2025

Uh oh!

andreyvelich commented Nov 17, 2025

Uh oh!

Shekharrajak commented Nov 17, 2025

Uh oh!

Shekharrajak commented Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Shekharrajak commented Nov 11, 2025 •

edited

Loading

kramaranya commented Nov 12, 2025 •

edited

Loading