Values in set percent
In this example - check the data of bigquery-public-data.austin_crime.crime
using
values_in_set_percent
check.
The goal is to set up a validity check on district column in order to check how many percent of given values are in
set. In this case, check how many percent of data are in ['UK','A','B','C', D']
districts.
Adding connection
GCP
Download and install Google Cloud CLI. After installing Google Cloud CLI, log in to your GCP account (you can start one for free), by running:
After setting up the GCP account, create a GCP project. That will be the GCP billing project used to run SQL sensors on the public datasets provided by Google.
The examples are using a name of the GCP billing project, received as an environment variable GCP_PROJECT. Set and export this variable before starting DQO shell.
Navigate to the example directory and run the check
After starting the example, run the following commands in the DQO shell:
This command will let up login or sign up for the cloud.dqo.ai account. The data quality checks will be executed. The result files will be pushed to cloud.dqo.aiNow, you can open the browser and navigate to https://cloud.dqo.ai/ and review the sensor results on the dashboards.