How to Create a Data Product
This guide walks you through the process of creating a new Data Product—a curated dataset derived from existing data sources.
Prerequisites
Before creating a data product, ensure you have the following:
- Source connection – A connection to the database or warehouse containing the data you want to derive from. Learn how to set up a source connection.
- Destination connection – A connection to the database or warehouse where your data product will be stored. Learn how to set up a destination connection.
Step 1: Navigate to Search and Create a New Dataset
- From the left navigation menu, expand Catalog and click Search.
- Click the + Create dataset button in the top-right corner of the page (or in the center if no datasets exist yet).
Step 2: Enter Basic Info and Select Data Product Type
- In the Title field, enter a memorable name for your data product.
- Optionally, add a Description to provide context about the data product. Markdown is supported.
- Under Dataset type, select Data product.
- Optionally, expand Add metadata to attach documents, link datasets, define terms, assign ownership, or classify the data with topics for easier discovery.
- Under the Data section, click Connect data to define your source query.
Step 3: Define Your Source Query
- From the Destination connection dropdown, select the connection where your data product will be created.
- In the query editor, enter your SQL query to define the source data for your data product. This query can select from one or more existing datasets in the Catalog.
- Click Preview results to run the query interactively and verify the output.
- Once you are satisfied with the results, click Done to confirm your query.
Step 4: Create the Data Product
- Review your configuration—ensure the title, data product type, and query are correct.
- Click the Create dataset button in the lower-left corner.
- The system will begin processing your data product. Depending on the size of the data and the source and destination database types, this may take anywhere from several seconds to several hours.
Step 5: Verify and Use Your Data Product
- After creation, you will be redirected to the dataset view page.
- Wait for the ingestion and finalization process to complete—this may take 30 seconds or more on the first run.
- Once complete, a green checkmark will appear in the top-right corner, indicating the data product is ready.
- You can now interact with the data product like any other dataset—view the table, explore lineage, check health metrics, share with others, and more.
Step 6: View Data Lineage
- Click the Lineage tab to see how your data product connects to its source datasets.
- The lineage view is automatically populated based on your source query—no manual configuration required.
- Click on any column to see column-level lineage, tracing exactly which source columns flow into your data product.