You can create multiple data profiles and target the profilers to run for the selected assets. This would generate the data profile score for the selected assets and helps identify issues related to completeness and validity of data. Multiple data profiles can be created for a single data connection.You must set up a successful data connection before creating a data profile.
- On the main navigation menu, click and select + Create Profile.
In the Source page:
- Select the datasets and fields that you want to profile and click Next. Default analysis rule is used by default to profile the selected datasets.
- In the Profile page:
- Default analysis rule is used by default to profile the selected datasets. Retain the configuration and click Next.
- In the Finalize page:
- Enter the Name and Description of the profile.
- Enable the scheduler and select the frequency at which the profile should execute.
- Click Save to create the profile. Optionally, you can click Save & Run to create a profile and run the profiling instantly.
- The profile is saved, and you are redirected to the Profilers page.
Once the profile is created, you can search for a specific data profile. You can search data profiles with a complete or partial name. The results will be displayed based on the entered keyword in the search bar.
You can view the profiling results for these assets from the Profiling page.
Duplicate profile
You can create a copy of an existing data profile, change profiling details, and save it.
- On the main navigation menu, click tab and click the ellipsis icon corresponding to the profile that you want to create and click Duplicate.
- Select the datasets and fields that you want to profile and click Next. Default analysis rule is used by default to profile the selected datasets.
- Click Next.
- In the final step, you can schedule the profile to run at the scheduled interval.
- Click Save. The profile is saved, and you are redirected to the Profilers page.
- You can click Save & Run to save run the profiling instantly.
Edit profile
You can edit an existing data profile to make changes or edit a data profile after duplicating it.
To edit an existing data profile:
- On the main navigation menu, click tab and click the ellipsis icon corresponding to the profile that you want to edit and click Edit.
- Select the datasets and fields that you want to profile and click Next. Default analysis rule is used by default to profile the selected datasets.
- Click Next.
- To finalize the profile, you can schedule the profile.
- Click Save. The profile is saved, and you are redirected to the Data Profiles page.
- You can click Save & Run to save and instantly run the profiling.
Delete profile
- On the main navigation menu, click tab and click the ellipsis icon corresponding to the profile that you want to delete and click Delete.
- Click Yes to confirm.
- The profile is deleted.
Sort Profiles
You may need to locate the data profile from a long list of existing profiles. You can quickly locate a data profile by sorting the data profiles.The data profiles are sorted by Last Run as default.
Data Profile
The data profiles can be sorted in alphabetical order of their name.
To sort by data profile name:
- On the main navigation menu, click tab.
- Click the Data Profile column.The profiles are sorted in alphabetically ascending order.
- You can click the column again to change the order of sorting.
Completeness
You can sort data profiles by the completeness of data. Completeness is the percentage of complete and incomplete rows detected in your profiled data.
- On the main navigation menu, click .
- Click the Completeness column. On clicking the column first time, the data profiles are sorted from lower value to higher value of completeness.
- You can click the column again to change the order of sorting.
Last Run
The last run on sorts data profiles based on the time when the profiling was run.The data profiles are sorted by default in descending order—the profile that is run recently is shown at the top.
- On the main navigation menu, click tab.
- Click the Last Run column. On clicking the column first time, the data profiles are sorted in the ascending order.
- You can click the column again to change the order of sorting.
Run Profile
Once the connection is set up and a data profile is created, you can run profiling on the data. Profiling shows the completeness in terms of complete and incomplete tables in the data.
- On the main navigation menu, click .
- Click the ellipisis icon corresponding to the data profile that you want to run.
- Click Run Profile. The profile runs and displays a confirmation message after a successful run.
- After a successful run, click the data profile name to view profiling results. In case profiling fails, an error icon is displayed next to the data profile name. Hover over the icon to view error log details.
Stop running profile
- On the main navigation menu, click .
- Click the ellipsis icon corresponding to the running profile that you want to stop.
- Click Stop Profile.
- The running profile stops before profiling is completed.