Skip to main content

Data Experience Quick Start

This guide contains the minimal steps you'll need to take to get connected with the Data Experience. The plugins needed are all running by default so they just have to be configured.

More detailed information can be found for the Data Experience and each of the integrations in the sidebar.

What you'll achieve

  • Ingest data warehouse tables as APIs into Portal's Software Catalog
  • Make datasets searchable alongside your software components
  • Provide ownership and lifecycle information for data governance

Prerequisites

  • Admin access to Portal's Config Manager
  • Access to create credentials in your preferred integration(s)

Step 1: Configure Authentication

  1. Create a GCP service account with the following roles:
    • roles/bigquery.dataViewer
    • roles/bigquery.jobUser
  2. Download the JSON credentials file
  3. Navigate to Config Manager > Data Experience
  4. Expand the keys on the sidebar dataExperience > registry > integrations > bigquery
  5. Add an item to the sources list:
    • Enter your GCP project ID
    • Paste the service account JSON into the credentials field
  6. Scroll to the bottom of the page and click the Save changes button

(Optional) Step 2: Configure Registry Ingest Schedule

Configure how often datasets are ingested from your sources to the data registry.

  1. From Config Manager > Data Experience, expand the keys on the sidebar dataExperience > registry > integrations > defaults > schedule > frequency > cron
  2. Enter a valid crontab string. This is 0 */6 * * * (every 6 hours) by default
  3. Scroll to the bottom of the page and click the Save changes button

Step 3: Test & Verify

  1. Wait for the first sync to complete - when this happens will depend on how you've configured your schedules in steps 3 and 4. You can monitor progress by visiting the Data Overview page, accessibile from Portal's navigation.
  2. Search for your datasets in Portal's search

Next Steps

Troubleshooting