AWS Glue Data Quality

Data quality describes how well a dataset serves its specific purpose, in that way data consumers can trust the data and leverage it to improve decision-making, leading to the development of new business strategies or optimization of existing ones.

AWS Glue Data Quality is a service managed by Amazon which allows users to measure and monitor the quality of data using a domain specific language called Quality Definition Language (QDL) to define a set of rules that the data must abide by.

Data quality can be done with AWS Glue in two ways: using AWS Glue Data Catalog or AWS Glue ETL jobs.

AWS Glue Data Catalog: with this service you can define your own rules or let AWS Glue analyze your data and recommend you ones, this service does not offer autoscaling so the workloads should be examined beforehand.

AWS Glue ETL jobs: with this service you can define your own rules, you do not get rule recommendations though, this service has autoscaling so the workloads could scale easily.

Source: docs.aws.amazon.com
AWS Glue Data Quality