Skip to main content

decentriq_platform.legacy.data_science

Functions

provision_raw_dataset

def provision_raw_dataset(
data: _io.BytesIO,
*,
name: str,
session: decentriq_platform.session.Session,
key: decentriq_platform.storage.Key,
data_node: str,
data_room_id: str,
description: str = '',
secret_store_options: Optional[decentriq_platform.client.SecretStoreOptions] = None,
)> str

Convenience function for uploading data to a raw leaf node in a Data Science Data Room.

Parameters:

  • data: The input data to be uploaded. Use one of the reader functions provided in this package to read CSV-like data.
  • name: A descriptive name to assign to the dataset. This will help with finding the dataset when re-provisioning it to another Data Clean Room.
  • session: The session with which to communicate with the enclave.
  • key: A key for encrypting the data to-be-uploaded.
  • data_node: The name of the data node as seen in the Decentriq UI.
  • data_room_id: To which data room the dataset should be published. This is the id you get when publishing a data room.
  • description: An optional description of the dataset.

Returns: The manifest hash (dataset id) in case the upload succeeded.

provision_tabular_dataset

def provision_tabular_dataset(
data: _io.BytesIO,
*,
name: str,
session: decentriq_platform.session.Session,
key: decentriq_platform.storage.Key,
data_node: str,
data_room_id: str,
description: str = '',
secret_store_options: Optional[decentriq_platform.client.SecretStoreOptions] = None,
)> str

Convenience function for uploading data to a tabular data node in a Data Science Data Room.

Parameters:

  • data: The input data to be uploaded. Use one of the reader functions provided in this package to read CSV-like data.
  • name: A descriptive name to assign to the dataset. This will help with finding the dataset when re-provisioning it to another Data Clean Room.
  • session: The session with which to communicate with the enclave.
  • key: A key for encrypting the data to-be-uploaded.
  • data_node: The name of the data node as seen in the Decentriq UI.
  • data_room_id: To which data room the dataset should be published. This is the id you get when publishing a data room.
  • description: An optional description of the dataset.

Returns: The manifest hash (dataset id) in case the upload succeeded.