Skip to content

[SparkConnect] Support url path in SparkConnect connection string #56816

Description

@hiboyang

Current Spark Connect uses connection string like sc://host1:15002, which does not support a path in the url, e.g. not support value like sc://host1/path1:15002.

In Kubernetes, people can set up an Ingress to expose Spark Connect driver endpoint by a matching url, e.g. http://host1/path1 can be routed to the driver endpoint. Need to update Spark Connect client to support this scenario.

This could be a Spark Client side change, e.g. in Python Spark Connect client, connect to server based on the host name, and intercept GPRC call to avoid path interfering GRPC protocol.

I am working on a prototype, also open to hear feedback / suggestions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions