Creating and using Datalabs
Step 1: Determine which data you need
Check the datasets required for your specific use-case and prepare the data accordingly.
Step 2: Upload your datasets
First, upload your prepared datasets to the Dataset Portal using one of these methods:
- Manual upload in the UI (files < 1GB)
- Data connectors (files ≥ 1GB)
- Python SDK (automated pipelines)
Ensure your data is properly formatted according to the dataset requirements shown in the dataset types documentation.
Step 3: Create and configure your Datalab
Navigate to Datalabs: Go to your appropriate portal → Datalabs → Create Datalab
Configure the Datalab:
- Add a descriptive title
- Select which type of datasets to connect (based on use cases it needs to support)
- Choose the matching ID type (see supported types)

Connect Datasets:
If the Datalab has outdated data that needs to be refreshed, first deprovision the existing dataset:

Then provision new datasets:
- Click the green Provision dataset button
- Select Choose from my stored datasets
- Validation will check that the file is correctly formatted as expected (see detailed formatting guide)

Run validation: Click Validate Datalab (below the datasets) to start the validation process
Step 4: Validation process
The Datalab validation ensures that your datasets meet the required formatting standards, user IDs are consistent across all tables, and data quality requirements are met. Validation may take up to a few hours to complete, depending on your data size.
Once the validations are completed and successful, the label will turn green to indicate that the Datalab is ready to be provisioned to a Media DCR:

Validation report
The validation report is key to evaluating the consistency of the onboarded data and its quality. It reports only aggregated statistics but no user-level personal information. It displays the strengths and weaknesses of your first-party data.

Publishers are strongly advised to review the Datalab dashboard with a Decentriq expert to make sure that the publisher is ready for a Media DCR. Please contact Decentriq to book a session.
Step 5: Provisioning to Media DCRs
Once your Datalab is validated and shows a "Ready" status, you can provision it to Media DCRs. See the Media DCR Data Tab for detailed information about the provisioning process and monitoring results.
A single Datalab can be provisioned to multiple Media DCRs simultaneously, enabling efficient collaboration across different partnerships. To keep your Datalab current with fresh data, see Refreshing the base audience.