Reading multiple files into 1 Input dataset

We are currently working on allowing you to read multiple files into a single input dataset.

For example, you can read every file with extension ‘csv’ in folder ‘c:\Users\Desktop\multitest\’ into a single Input item.

You can also choose to recurse into sub-folders.

For Excel files you will be able choose what sheets to read using a wildcard.

The datasets will be automatically stacked by column number. You can also stack by column name by defining a schema.

This will allow you to read and stack 100s or 1000s of files or Excel sheets into a single Input in one operation.

All the files read need to be the same type (e.g. all CSV or all Excel).

It has been a significant amount of work and we still have more work to do, to integrate it with Batch processing and command line arguments, write documentation, test etc. But we are hoping to have a snapshot release you can try soon.

7 Likes