I have a database which contains erroneous data. When corrected data comes in, I want to replace the original data, preferably in place. Would stack + dedupe work? Does Dedupe always keep the first instance, or does it pick randomly?
I always want to use the new corrected data.
I was going to subtract the updated values from the incorrect ones, then add them back, but there isn’t a unique identifying column, the data values are identified by 3 separate columns.
Dedupe keeps the first unique row.
Can you post a simple example of the data you have and the data you want?
I managed to find a solution to it. The problem was that there wasn’t a single identification column for subtract to use, so I used the concat cols transform.
1 Like