Skip to main content

Databricks

A Databricks connection lets you access your cloud data warehouse. It’s optimized for analytics and built to support modern data warehousing workloads with high performance and scalability.

Feature support

The table below outlines whether the connection supports certain Prophecy features.

FeatureSupported
Read data with a Source gemYes
Write data with a Target gemYes
Browse data in the Environment browserYes

Parameters

To create a connection with Databricks, enter the following parameters:

ParameterDescription
Connection NameName to to identify your connection
JDBC URLURL to connect to your SQL warehouse
Example: jdbc:databricks://<databricks-instance>:443/default;transportMode=http;ssl=1;AuthMech=3;httpPath=/sql/1.0/warehouses/<warehouse-id>
CatalogDefault catalog for reading and writing data
SchemaDefault schema for reading and writing data
Authentication methodHow you want to authenticate your Databricks account (OAuth or PAT)

Authentication methods

You can configure your Databricks connection with one of three authentication methods:

  • Databricks OAuth. Prophecy prompts you to sign in with Databricks.
  • Personal Access Token (PAT). Use a secret to enter your PAT.

Databricks permissions

When you create an Databricks connection in Prophecy, access permissions are tied to the credentials you use. This is because Prophecy uses your credentials to execute all data operations, such as reading from or writing to tables.

To fully leverage a Databricks connection in Prophecy, you need the following Databricks permissions:

  • Create Schema
  • Create or Replace Table
  • Drop Table
  • Insert Into
  • Create Volume PROPHECY_ORCHESTRATOR_VOLUME
  • Access to /Volumes/<catalog>/<schema>/PROPHECY_ORCHESTRATOR_VOLUME
  • Remove File permission inside the volume
  • Copy Into (from the volume)

Sharing connections within teams

Connections are stored inside fabrics, which are assigned to certain teams. The way Prophecy manages how Databricks connections are shared within teams depends on the authentication method used to configure the connection.

OAuth

When you create a Databricks connection with OAuth:

  • Team members who use the connection will have to sign in with OAuth using their personal credentials.
  • Permissions will be inherited from their personal credentials.

Personal Access Token (PAT)

When you create a Databricks connection using a PAT:

  • All team members that have access to the fabric can use the connection in their projects.
  • No additional authentication is required. Team members automatically inherit the access and permissions of the original connection.
caution

Be mindful of the access level granted by the stored credentials. Anyone on the team will have the same permissions—including access to sensitive data if allowed.

To manage this securely, consider creating a dedicated fabric and team for high-sensitivity connections. This way, only approved users have access to those credentials.

Fetching data

Prophecy fetches data from Databricks connections in the following ways:

  • When you browse an Databricks connection in the Environment browser, Prophecy fetches data on demand as you expand folders. You can manually refresh the Environment browser to see updated files.

  • When a pipeline runs, Source gems will read the latest available version of the data. If the schema of your data in Databricks changes, Prophecy will automatically use the new schema.