Configuring Data Profiling and DQ Scores

You can configure data quality (DQ) score options and data profiling parameters.

Configuring data profiling parameters involves specifying:

  • Whether data profiling requires to analyze character data for maximum and minimum
  • Most frequent patterns
  • Least frequent patterns

To configure data profiling parameters, follow these steps:

  1. Go to Application Menu > Miscellaneous > Settings > Metadata Manager.
  2. Click the Data Quality tab and then click the Settings tab.
  3. The following page appears.

  4. Click .
  5. Use the following options:
    Analyze character data for Max/Min
    This option specifies whether the data profiling requires to analyze character data for maximum and minimum. Turn the Analyze character data for Max/Min to ON to analyze character data for maximum or minimum.
    Most Frequent Patterns
    This option specifies the number of top most frequent patterns to be displayed in the Data Profiling Pattern Summary report. To set the number of top most frequent patterns for display, type the number in the Most Frequent Patterns box.
    For example, if you type the number 3 in the box, then top three most frequent patterns would be displayed in the report.
    Least Frequent Patterns
    This option specifies the number of bottom least frequent patterns to be displayed in the Data Profiling Pattern Summary report. To set the number of bottom least frequent patterns for display, type the number in the Least Frequent Patterns box.
    For example, if you type the number 3 in the box, then bottom three least frequent patterns would be displayed in the report.

To configure DQ score option, follow these steps:

  1. Under the DQ Scores section, click .
  2. The DQ Score Options page appears.

  3. Click .
  4. A new row is added in the DQ Score Options grid.

  5. Double-click the cell under the Key column to enter the key.
  6. Double-click the cell under the Value column to enter the value.
  7. Note: Turn Publish to OFF to remove the DQ score option from the DQ Scores list.

  8. Click .
  9. The DQ Score option is added to the DQ Scores list.

You can schedule data profiling job and assess the data quality in the Metadata Manager. For more information on profiling data, refer to the Profiling Data at Table Level topic.