First, I’m very interested in the Transform, I can see many valid and invalid uses of the transform.
The manipulation of raw data is concerning to me, especially if the manipulated data is shared with folks who don’t realize the data set has been altered.
I teach a class at Dartmouth on Forensic Analytics and Fraudulent Data using Benford’s law as a way of identifying data sets that have been tampered with. I can’t believe how much “scientific data” has been manipulated.
Would it be possible to have a comment generated in the output/processing window indicating the number of outliers that were changed?
"If you torture the data long enough, it will confess to anything”
Yes. Pretty much all the transforms tell you how many rows have been modified.
In future we might add a detailed report that you can generate that gives you an outline of what was done during a run. It will have to be an outline to keep it a manageable size.