Create an Asset from your Data Catalog
A regular synchronization takes place between the Platform and your local Data Catalog.
Any Assets in your Catalog that are tagged for inclusion in the Platform are automatically created and the associated Asset Data is copied onto the Platform so that it can be accessed via a Data Share.
Assets created from your Catalog are in a LIVE state on the Platform which means they are ready to be consumed or added to a Product.
You must first prepare the data in Your Data Catalog (Assets)
The Asset in your Data Catalog is automatically created in the Platform with the following default information:
Key Information | |
Name | Asset Name in English from the Catalog (mandatory) Max 100 char |
Source | The type of Connector specified in the Catalog |
Location | On Platform |
Asset Type | From the Catalog (mandatory) |
Created | Date and time of the first synchronization |
Created By | <service account> |
Released | Date and time of the first synchronization |
Released By | <service account> |
Display Name | Asset Name in English from the Catalog (mandatory) Asset Name in Arabic from the Catalog (optional) Max 200 char |
Description | Asset Description in English from the Catalog (mandatory) Asset Description in Arabic from the Catalog (optional) Max 1000 char |
Category Tags | None |
Metadata | |
Size | Automatically calculated, non editable |
Format | Automatically calculated, non editable |
Columns | Automatically calculated for tabular assets, non editable |
Rows | Automatically calculated for tabular assets, non editable |
Created | Date and time of the first synchronization, non editable |
Last data refresh | Date and time of when the Asset Data was last updated, non editable |
Custom Metadata | |
Data Classification | From the Catalog (mandatory) |
Tags | From the Catalog (optional) |
Data Quality Scores (dynamic) | From the Catalog (optional) |
Dictionary (for tabular assets) | |
Column Name | Automatically identified (in English), non editable |
Data Type | The data type held in the Databricks repository Automatically identified, non editable |
Tags | Entered directly in the Databricks repository, non editable |
Masking | Entered directly in the Databricks repository, non editable |
Column Description | Column Description in English from the Catalog (optional) Column Description in Arabic from the Catalog (optional) |
Sample | |
By default, a sample of up to 10 random rows is displayed for tabular assets. Sample data can help a Data Consumer decide if the Asset is right for them. It is possible toEdit an Asset and make the sample not visible to users if required. | |
Given to all registered users who are Contacts associated with the specific Asset in the Data Catalog. There must be at least one contact for the Asset for it to be created. | |
Share |
|
Text from the Catalog is truncated if it exceeds the number of characters specified on the Asset Overview screen for each relevant field. An '…' is displayed to indicate if there was a truncation.
The status of the Asset is LIVE.
If this Asset is deleted it is recreated when the synchronization happens
If the default Asset information is updated any changes are overwritten when the next synchronization happens
Next Steps
Your Data Catalog (Assets)
What is Data Classification ?
A ADGovClassification
property reflects the highest applicable classification label associated with an Asset.
All Assets must have a Classification before being made available in the Platform.
The permitted Classifications and associated weightings are as follows:
Open (weight: 1)
Confidential (weight: 2)
Sensitive (weight: 3)
Secret (weight: 4)
What are Dynamic Data Quality Scores?
The Data Quality Scores associated with an Asset are dynamic and can include the following:
Uniqueness
Consistency
Completeness
Accuracy
BlankValues
These are used to calculate the Dynamic Data Quality Scores for any Product that contains the Asset.