Questions tagged [standardization]
7 questions
4
votes
1 answer
using Standardization and Normalization in the same pipeline
I have a pyspark ML pipeline that uses PCA reduction and ANN. My understanding is that PCA performs best when given standardized values while NN perform best when given normalized values. Does it make sense to standardize the values before PCA and…
Mike Pone
- 43
- 6
2
votes
0 answers
How should I input and output feature and target timeseries to timeseries transformer
I am trying out PatchTST timeseries transformer (paper, code) on a timeseries data that I have. The way PatchTST handles data is as follows:
Note that on line 78-79, the repo does following:
self.data_x = data[border1:border2]
self.data_y =…
Mahesha999
- 299
- 1
- 9
2
votes
1 answer
Standard Scaling After Log Transformation
I have a quick question about whether or not to standardize features after a log transformation.
I have one feature that is heavily skewed and requires the log transformation, for the other features I'm applying the standard scaler.
My question is,…
atn291
- 21
- 2
1
vote
1 answer
Combining standardizing and normalizing my input data for ML gives the best results, why?
When I combine standardizing and normalizing my input data for my hybrid ANN model, it gives the best results.
But I can't find anywhere, why. I based it on a paper's approach but they don't justify their practice either. Anyone knows…
Mia May
- 11
- 2
0
votes
0 answers
Should I use StandardScaler on a dataset with binary,descreet and continuos data?
I have a dataset consisting of mixed type of features, I already transformed the categorical ones in descrete and binary. Because the dataset is highly dimensional I want to use PCA to reduce it. I used StandardScaler to the dataset then I use PCA…
0
votes
1 answer
Potential Sign Issues in a Composite Performance Metric for Model Selection
I am analyzing the results of various machine learning models for a regression task, using four metrics: RMSE, MAE, MAPE, and $R^2$. My approach involves two types of analyses:
Individual Metric Analysis: Where each metric is…
Felipe
- 21
- 6
0
votes
1 answer
Why is Standard Scaler recommended for Normally distributed data?
I often read the recommendation of using StandardScaler for Normally distributed data, otherwise MinMax Scaling.
For example the answers here.
Curious to know the reason/maths behind it.
I get that the idea is to bring the distribution to standard…