To assess the quality of data it is useful to extract properties and relationships among them. However, exceptions and approximations need be considered in real-world settings. To this end, relaxed FDs (REDS) are data dependencies accounting for both exceptions and similarities on data, but their discovery is an extremely complex problem, also due to the necessity of specifying similarity and validity thresholds. The RFD discovery algorithm presented in this paper exploits the concept of dominance to automatically derive similarity thresholds. The discovery performances and the effectiveness of the proposed algorithm are assessed through a comparative evaluation with state-of-art approaches.
Discovering relaxed functional dependencies based on multi-attribute dominance [extended abstract]
Caruccio L.;Deufemia V.;Naumann F.;Polese G.
2021-01-01
Abstract
To assess the quality of data it is useful to extract properties and relationships among them. However, exceptions and approximations need be considered in real-world settings. To this end, relaxed FDs (REDS) are data dependencies accounting for both exceptions and similarities on data, but their discovery is an extremely complex problem, also due to the necessity of specifying similarity and validity thresholds. The RFD discovery algorithm presented in this paper exploits the concept of dominance to automatically derive similarity thresholds. The discovery performances and the effectiveness of the proposed algorithm are assessed through a comparative evaluation with state-of-art approaches.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.