Reusing transforms on a different input file

Titus · August 12, 2021, 8:12pm

Sorry if this has been covered in answers to other questions, but I saved a series of transforms on one file that I wanted to reuse on a different input file having the exact same format, but different data.

Unfortunately, when I disconnect the original input file and try to connect the new one to the transforms, all the transforms that require column selections now have them unchecked. I have to re-check each one. Is there a way for it to remember which columns were selected for each transform, or to save column selections?

Admin · August 12, 2021, 10:28pm

@Titus
If you disconnect the input it will reset all the column related parameters downstream, for consistency. So we don’t recommend that. Instead just change the file that the input is pointing to. E.g. by clicking on the ‘…’ browse button in the Right pane with the input item selected.

If you are trying to do the same set of transforms on lots of files then please look at batch processing:
https://www.easydatatransform.com/help/1k/windows/html/batch_processing.html

astro · August 17, 2021, 4:22pm

I had my moment when quickly I wanted to add a transformation in the beginning to solve a problem at the end faster.

I disconnected and all but the rename transformation were reseted. (Lucky me because there was a lot to rename)

If the parameters could stay “locked” during the sessions that would be a very handy feature.

Admin · August 17, 2021, 4:50pm

It is tricky because how do you ‘lock’ something to column 9 when there is no column 9?

Note that you can insert a transform between 2 existing transforms.

insert-transform-animation

This generally avoids the issue (not always, this is something we hope to fix soon).

astro · August 17, 2021, 5:41pm

I see. It is a one time mistake anyway. (Or hopefully so.)

But it is somehow an understandable reflex due to the design. When you see the transformations, there comes immediately the idea to ones mind to just quickly copy the branch and stick it to another input.

So I need to think about a naming strategy for the inputs then. Maybe a folder that renames the inputs accordingly. Is there a best practice or do people regularly do this go via the batch function? (have not come to the moment to use that one, i.e. learn about it.)

Admin · August 17, 2021, 6:19pm

I completely understand. But we have to maintain internal consistency, which regretably can lead to nuking of column related parameters downstream. But we may find a cleverer way to deal with this issue in future.

Note that you can click the ‘…’ browse button next to the location of a file and choose a different file. if this file has the same (or more) columns then no column information will be lost.

The batch feature is useful when you want to apply the same .transform to lots of files.

astro · August 17, 2021, 6:39pm

Have not seen that one. Thank you very much. (That was kind of embarrassing… )

outrigger999 · June 29, 2022, 7:34am

I have a slightly different scenario. I’d like to take a fairly large number of files (different data but same columns) and reuse the transforms by attaching them to the initial stack. I understand that I have to leave one dataset there from the previous or opened transform so that it doesn’t reset down the line. How can I easily attach files (let’s say 20 of them) to the original stack without going one by one and clicking the plus button and attaching each one to the stack? When I bring in that many files, the entire tree gets very small. Is there a way to select all the new files and connect them to the stack? Thanks…

Olaf · June 29, 2022, 2:01pm

Have you checked the batch functionality, which is great for identically structured files (it support * in the file names, too). In case you can not use it in the existing flow. Create a batch job merging all files into one and use the result for the exiting flow.

outrigger999 · June 29, 2022, 7:02pm

Hey thanks for the quick reply. I’ll try that!

Admin · June 30, 2022, 9:26pm

As Olaf says, you could use the batch feature to perform the transforms on each input file. If you want you can append the batch results to a single file. See example 2 here:

mk2109 · August 10, 2022, 4:52am

Is there a way to switch the Input document, without needing to be identical to the previous Input file?

I save all the input docs as a “template,” but safekeeping documents seems inefficient.
Even with the Input file intact, any small difference breaks the whole Transformation file.

image1883×554 184 KB

I’m relying on notes to transform, which defeats the automation intent.

It’s ironic that I can’t automate a transformation unless I input a formatted file, so what am I doing wrong.

Admin · August 10, 2022, 6:31am

In what way is the new input file different to the old input file?

If they have the same column structure, you should be able to click the ‘…’ button in the Right pane of the input and choose the new file.

If the column structure is different, you will probably need to use a Stack to get them in a consistent order. See:

Some transforms (such as Split Cols) can produce a different number of columns, depending on the input. This can make things a bit more complicated.

mk2109 · August 11, 2022, 8:11am

Well, that’s the problem; I assume the new file will be the same format. I have redundant steps and make transformations to encompass possible changes.

Matching 20 column headers to run a transformation takes as much time as manually formatting the spreadsheet. Even so, the differences are so small that they’re not caught, like a white space or an abbreviation.

Admin · August 11, 2022, 8:24am

If the columns are in different orders with different names, I’m not how we can fully automate that.

Perhaps you could supply a real world example of the differences between 2 inputs. Only the first few rows are needed and you can change any sensitive data.

Anonymous · August 11, 2022, 5:01pm

Hi,

If problem is having column name different or in wrong order or both different and in wrong order, but the data is the same in the column to process, then hope the following steps help you out.

Step 1:
Create your column name the way it suppose to be and copy paste them as input from Clipboard

col1,col2,col3,col4

Step 2:
Add Stack transform and set align columns by: Header name

Step 3:
Now take only the column header from your input file and add only one row, with the column numbers the column heading is representing

inputfile
col 1,c0l4,Col3,column2
1,4,3,2

copy and paste them as input from Clipboard and attach it to the Stack, and you will see which column matched and which did not by having the row value under the columns and on right side you will see all columns that did not match. This will solve your issue of locating the differences in column names due to white space, spelling, abreviation and such.

Step 4:
Add another Stack transform and have it’s first input from the Step 1 right column name input from Clipboard

Step 5:
Add your input file

Step 6:
Add Rename Column transform and rename the columns that did not match

Step 7:
Now simply do your transform after the Stack

Now if you remove Step 5 and 6, your Step 7 transforms will stay intact as long as your Step 1 column name do not change.