i have one dataflow with multiple sources and joins and want to create a new column based in the count distinct of column1 from the stream line vs another source, like comparing customerId in the customers_tables vS customerId in the invoice table. In SQL i do it like this:
(select count(DISTINCT customerID) from invoice) AS CustomersWithNoSales,
is there a way to make this comparative using an aggregate transformation in a dataflow?
thanks.
Have to add the other source and then join with the other stream. After that i was able to use an aggregate transformation with the function countDistinct between the two fields of each sources.