decentriq_platform.legacy.data_science
Functions
provision_raw_dataset
def provision_raw_dataset(
data: _io.BytesIO,
*,
name: str,
session: decentriq_platform.session.Session,
key: decentriq_platform.storage.Key,
data_node: str,
data_room_id: str,
description: str = '',
secret_store_options: Optional[decentriq_platform.client.SecretStoreOptions] = None,
) ‑> str
Convenience function for uploading data to a raw leaf node in a Data Science Data Room.
Parameters:
data
: The input data to be uploaded. Use one of the reader functions provided in this package to read CSV-like data.name
: A descriptive name to assign to the dataset. This will help with finding the dataset when re-provisioning it to another Data Clean Room.session
: The session with which to communicate with the enclave.key
: A key for encrypting the data to-be-uploaded.data_node
: The name of the data node as seen in the Decentriq UI.data_room_id
: To which data room the dataset should be published. This is the id you get when publishing a data room.description
: An optional description of the dataset.
Returns: The manifest hash (dataset id) in case the upload succeeded.
provision_tabular_dataset
def provision_tabular_dataset(
data: _io.BytesIO,
*,
name: str,
session: decentriq_platform.session.Session,
key: decentriq_platform.storage.Key,
data_node: str,
data_room_id: str,
description: str = '',
secret_store_options: Optional[decentriq_platform.client.SecretStoreOptions] = None,
) ‑> str
Convenience function for uploading data to a tabular data node in a Data Science Data Room.
Parameters:
data
: The input data to be uploaded. Use one of the reader functions provided in this package to read CSV-like data.name
: A descriptive name to assign to the dataset. This will help with finding the dataset when re-provisioning it to another Data Clean Room.session
: The session with which to communicate with the enclave.key
: A key for encrypting the data to-be-uploaded.data_node
: The name of the data node as seen in the Decentriq UI.data_room_id
: To which data room the dataset should be published. This is the id you get when publishing a data room.description
: An optional description of the dataset.
Returns: The manifest hash (dataset id) in case the upload succeeded.