Selecting one row based on date from duplicates

I have rows of duplicates by ID. I need to know if there is a way to choose the row in each duplicated set that has the most recent date by the date column.

You should be able to Sort by the date column (ascending) then do the Dedupe by id.

If you want to preseved the original row order, use Row num to add a row number before the sort, then Sort by the row number column after the Dedupe. Then Remove cols the row number column.

So dedupe always takes the top reference for any duplicate which is what allows the sort to work, correct?
Thank you for your help!

Yes, that is correct.

E.g.:

Key Value
A 1
A 2
B 3
B 4

Is changed by:

To:

Key Value
A 1
B 3

Awesome, very helpful, thanks.

1 Like