Use this file to discover all available pages before exploring further.
This guide contains the minimal steps you’ll need to take to get connected with the Data Experience.
The plugins needed are all running by default so they just have to be configured.More detailed information can be found for the Data Experience and each of the integrations in the sidebar.
Navigate to Config Manager > Data Experience 1. Expand the keys on
the sidebar dataExperience > registry > integrations > snowflake 1.
Under the sources key, select the Option 2 tab and add a new item to the
list - Enter SNOWFLAKE_JWT in the authenticator field - Enter the
username of the user the public key was assigned to - Enter the privateKey
Enter the warehouse that should be used for executing queries. If omitted,
the user’s default warehouse will be used. - Enter the role that should be
used for executing queries. If omitted, the user’s default role will be used.
Scroll to the bottom of the page and click the Save changes button
Create a Service Principal in Databricks. This can be at either the account or workspace level.
This service principal must be granted the USE CATALOG and USE SCHEMA privileges on each catalog and schema you wish to ingest datasets from. See the Databricks API documentation for more details.
Generate OAuth credentials for the service principal, and take note of the client ID and secret. Be mindful of the lifetime set for these credentials and remember to rotate the secrets before they expire to prevent ingestion failures.
Back in Portal, navigate to Config Manager > Data Experience > Databricks
For each workspace you’d like to ingest datasets from, add a new source with the workspace URL and service principal credentials. The integration will discover all catalogs, schemas, and tables which the service principal has access to.
Configure how often datasets are ingested from your sources to the data registry.
From Config Manager > Data Experience, expand the keys on the sidebar dataExperience > registry > integrations > defaults > schedule > frequency > cron
Enter a valid crontab string. This is 0 */6 * * * (every 6 hours) by default
Scroll to the bottom of the page and click the Save changes button
Wait for the first sync to complete - when this happens will depend on how you’ve configured your schedules in steps 3 and 4. You can monitor progress by visiting the Data Overview page, accessibile from Portal’s navigation.