Create an Asset using a Connector
Connectors allow you to access data in a secure location that requires specific credentials to access it.
Note: You must have the necessary Roles associated with your User Profile
There are 2 options to initiate this process:
Go to My Collection on the Navigation bar
Click on the +New button at the top left
Select Asset
Or
Go to MANAGE on the Navigation bar
Select Assets
There are 4 steps involved with creating an Asset
Note: To create an Asset you must first Create a Connector
1. Start Setup
Enter Name - this can be a work-in-progress name for your Asset which can be changed at any time.
Select the Type of Asset to be created
Table
Files
Select the Source of data for your Asset
Select a Connector to access your Source of data, from a list of pre-defined Connectors
Provide additional details required to locate the exact location of your data. This may vary depending on the Source of your Connector.
Specify whether the path is to a single file or folder containing files and folders
Once this step is complete the Status of the Asset is set to IN SETUP
2. Check Data Access
This step makes sure the Source of your data can be accessed by the platform.
If this step fails then check that your Connector has been setup correctly.
3. Check Data Format
This step determines the structure of your tabular data.
Where the data is a CSV file then a schema review is required before setup can be completed. This is to allow any adjustments to the schema that has been discovered. This could be adjustments to table properties or data types. Often, no adjustment is necessary. Once this has been reviewed and the correct structure is available, the schema must be saved.
4. Finish Setup
The data held at your Source location is copied onto the Platform using the credentials specified in the Connector.
If the Type of Asset is Table then the following information is auto-generated for your Asset:
Metadata | |
---|---|
Size | The size the data in kb |
Format | Table or Files |
Columns | The number of columns |
Rows | The number of rows |
Created | The date and time (UTC) the Asset was created |
Last data refresh | Date of Asset Creation |
Data Dictionary (for Tabular Assets) | |
Column Name | Automatically identified |
Data Type | Automatically identified |
Tags | Entered directly in the Databricks repository, non editable |
Masking | Entered directly in the Databricks repository, non editable |
Column Description | Can be entered |
Sample | |
By default, a sample of up to 10 random rows is displayed for tabular assets. Sample data can help a Data Consumer decide if the Asset is right for them. It is possible toEdit an Asset and make the sample not visible to users if required. |
If the Type of Asset is Files then the following information is auto-generated for your Asset:
Metadata | |
---|---|
Size | The size the data in kb |
Created | Date of Asset Creation |
Last data refresh | Date of Asset Creation |
The Status of the Asset is updated from IN SETUP to DRAFT
Enter a Display Name for the Asset that will be visible to users browsing the Exchange if the Asset has been added to a Product.
Enter a Description for the Asset
Add pre-defined Categories and Tags to make them easier to find in My Collection