Abstract
A computer-implemented method for detecting a set of inconsistent data records in a database including multiple records, comprises selecting a data quality rule representing a functional dependency for the database, transforming the data quality rule into at least one rule vector with hashed components, selecting a set of attributes of the database, transforming at least one record of the database selected on the basis of the selected attributes into a record vector with hashed components, computing a dot product of the rule and record vectors to generate a measure representing violation of the data quality rule by the record.
| Original language | English |
|---|---|
| Patent number | US2013226879 |
| IPC | G06F 17/ 30 A I |
| Priority date | 28/02/12 |
| Publication status | Published - 29 Aug 2013 |