If you have checked a record for fuzzy duplicates, you can view the fuzzy duplicate check history for that record. The history shows all fuzzy duplicate checks that are done for the record.
You can clean up the logged fuzzy duplicate check history manually or in recurring mode.
For example, you want to keep fuzzy duplicate check history records for one month. Each week, you can do a cleanup, deleting history records older than one month.
The steps of this topic explain how to clean up the fuzzy duplicate check history.
1. | Go to Data quality studio > Periodic tasks > Clean-up duplicate check history. |
2. | Define the number of days for which you want to keep the fuzzy duplicate check history records. All history records that are older than the defined number of days, are deleted. |
  | In the Retention days field, enter a number. |
3. | Sub-task: Set up batch processing. |
3.1 | Expand the Run in the background section and fill in the fields as desired. |
  |
Note: The fuzzy duplicate check history cleanup always runs in batch. |
3.2 | Usually, you clean up the fuzzy duplicate check history in recurring mode. |
  | Click Recurrence and fill in the fields as desired. |
3.3 | Click OK. |
4. | Click OK. |
Related to | Notes |
---|---|
Monitor and clean up data quality execution history |
  |