scalaamazon-web-servicesamazon-deequ

Uniqueness check in Deequ


I'm currently exploring Deequ library and I'm trying to understand whether it's possible to check for the uniqueness of a combination of column.

This code

.hasUniqueness(Seq("col1", "col2"), Check.IsOne))

seems to calculate uniqueness for each column separately (correct if I'm wrong)

Thanks


Solution

  • I am one of the authors of Deequ. Your code snippet should calculate the uniqueness of the combined columns. If you feel that something is wrong with the result, then I would encourage you to open an issue at https://github.com/awslabs/deequ/issues and provide some sample code so that we can reproduce the error.

    Best, Sebastian