"Shuffle" transform

lainyj · April 8, 2025, 5:25pm

I find myself needing to randomize for a lot of reports recently. Example: I have 400 shifts worked in the past 30 days. I dedupe to remove duplicates and am left with just the employees name once regardless of how many shifts were worked per person. I then need to randomize/shuffle the rows so that I can then slice and have 20 random names out of say, 115 names.

If this is already a feature, then I apologize!

Olaf · April 8, 2025, 6:17pm

Have a look to the Random transformation https://www.easydatatransform.com/help/latest/mac/html/index.html

The new generated values you can sort and remove the column afterwards and the remaining colums are in a random order.

Admin · April 8, 2025, 6:47pm

As @Olaf says, you can:

use the Random transform to a column of random numbers
then use the Sort transform to sort by the random number column
then use the Sample transform to keep the top 20 rows
then use the Remove Cols transform to remove the column added

But it is simple to just use:

Monotone94 · April 8, 2025, 11:29pm

As the documentation says:

•The default Seed value is based on the system clock when the transform was first created.

This is good, but it would also be useful to have one of the following:

A button within Random to generate a new seed, or
A tick-box which when enabled generates new seeds for that Random transform*. The circumstances in which that generation occurs could be
- whenever the transform is Run (including Auto-run), or alternatively
- the first run only after the transform is opened, whether or not Auto.

I mean to suggest one of those sub-options, not have both, and prefer the latter one. Benefit is simply saving me randomly generating and entering my own seeds, while retaining existing behaviour as the default.

*edited to change “all Random transforms” to the more sensible and consistent “that Random transform”.

Admin · April 9, 2025, 7:09am

@Monotone94
Why do you need to keep changing the seed?

Monotone94 · April 9, 2025, 7:41am

If the data is both similar in content and ordered then the same seed produces substantially the same result. Normal operation is often what is desired but too often for me I want sub-set selection to be more variable. The Sample transform with random selection works in the same fashion as Random.

Admin · April 9, 2025, 8:16am

I will make a note of it.

prashant · April 21, 2025, 10:44am

If you decide to do this , the play button I copied form IMPORT/WRite to file will be great to redo seed and generate options