Skip to main content

Connect Databricks Unity Catalog

Required Inputs

Prepare:

Databricks workspace URL.
Databricks account ID, if account-level APIs are used.
Authentication method approved by your Databricks administrator.
Catalogs, schemas, and workspaces in scope.
Decision on whether notebook, job, and pipeline metadata can be included.

Scope The First Sync

Start with a narrow scope:

One catalog.
One or two schemas.
A limited set of jobs or pipelines.
One destination Purview collection.

Expand after the first validation succeeds.

Metadata Collected

Depending on configuration and permissions, Lineage Advisor may collect:

Catalog, schema, table, view, and column names.
Unity Catalog lineage relationships where available.
Job, task, notebook, and pipeline identifiers.
Runtime and operational metadata needed to explain lineage.
Owners, tags, descriptions, and classifications where available and enabled.

Lineage Advisor does not need table data values to publish lineage metadata.

Validation Checks

The connector validates:

Workspace endpoint connectivity.
Authentication.
Catalog and schema visibility.
Lineage table or API availability.
Minimum metadata read permissions.

Operational Guidance

Use a dedicated service principal or token.
Avoid personal access tokens for production.
Rotate secrets under your enterprise policy.
Keep initial catalog scope small until lineage quality is verified.

Required Inputs
Scope The First Sync
Metadata Collected
Validation Checks
Operational Guidance