Questions tagged [skewness]

6 questions
5
votes
1 answer

Linear Regression bad results after log transformation

I have a dataset that has the following columns: The variable I'm trying to predict is "rent". My dataset looks a lot similar to what happens in this notebook. I tried to normalize the rent column and the area column using log transformation since…
2
votes
0 answers

Why might transforming my features improve the performance on a simple decision tree?

The features & target in my dataset are very skewed. Could anyone explain why transforming the features & target (I'm using a Yeo-Johnson transformation) is significantly improving the performance of my model when using a decision tree?
1
vote
1 answer

how to fix left and right skewness

I know that left and right skewness means it has a long tail on either the left(left skewness) or right(right skewness). However, the example below is an example of right skewness. data = pd.DataFrame({ "Income": [15000, 22000, 30000, 35000,…
RushHour
  • 145
  • 6
1
vote
0 answers

Why is resnet regression model (on a skewed data with small interval) not converging?

Using resnet50 (torchvision.models pretrained=False) with an input of [15, 224,224] which includes 14 heatmaps and a level set map, per datapoint. The goal is to predict a cutoff value for each datapoint, this value is used for heatmaps…
topcat
  • 11
  • 2
1
vote
1 answer

Is there an appropriate hypothesis test: Two samples, one only with one datapoint, non-normal?

I have two 'samples'. The first consists of approx. 400 physical measurements of a quantity (taken over one hour, and the real situation is not a steady state). They show a very skewed distribution for which the theoretical model is not known, and…
Mars
  • 11
  • 1
1
vote
2 answers

Is there a measure to compare features on the basis of normality

I have a dataset of cars and it has many features including 'acceleration’, ‘horsepower’, and ‘mpg'. I am supposed to check which of these features is the most similar to a normal distribution, so I made histograms of each feature, acceleration was…