Most Popular
1500 questions
8
votes
2 answers
Can a linear regression model without polynomial features overfit?
I've read in some articles on the internet that linear regression can overfit. However is that possible when we are not using polynomial features? We are just plotting a line trough the data points when we have one feature or a plane when we have…
Tim von Känel
- 381
- 2
- 12
8
votes
4 answers
Job title similarity
I'm trying to define a metric between job titles in IT field. For this I need some metric between words of job titles that are not appearing together in the same job title, e.g. metric between the words
senior, primary, lead, head, vp, director,…
Mher
- 181
- 5
8
votes
1 answer
Anybody know what this type of visualisation is called?
I think this is a pretty cool way to visualise changes in values but I can’t find any name for this type of visualisation.
I
Source: https://www.economist.com/graphic-detail/2020/07/28/americans-are-getting-more-nervous-about-what-they-say-in-public
K G
- 183
- 3
8
votes
3 answers
Modality of data
Can anyone please explain in clear words what is generally meant by "modality of data"?
I know what modality means with respect to distributions.
Julia
- 81
- 1
- 2
8
votes
3 answers
Bert-Transformer : Why Bert transformer uses [CLS] token for classification instead of average over all tokens?
I am doing experiments on bert architecture and found out that most of the fine-tuning task takes the final hidden layer as text representation and later they pass it to other models for the further downstream task.
Bert's last layer looks like this…
Aaditya ura
- 425
- 1
- 5
- 16
8
votes
2 answers
Do I need validation data if my train and test accuracy/loss is consistent?
I am trying to understand the purpose of a 3rd split in the form of a validation dataset. I am not necessarily talking about cross-validation here.
In the scenario below, it would appear that the model is overfit to the training dataset.
Train…
LayneSadler
- 549
- 6
- 17
8
votes
2 answers
Why Scikit and statsmodel provide different Coefficient of determination?
First of all, I know there is a similar question, however, I didn't find it so much helpful.
My issue is concerning simple Linear regression and the outcome of R-Squared. I founded that results can be quite different if I use statsmodels and…
Luckasino
- 183
- 1
- 4
8
votes
1 answer
Which ML approach to choose for the game AI when rewards are delayed?
Question: Which Machine Learning approach should I choose for the AI of my computer game, where the actions of the AI do not lead to immediate rewards, but delayed rewards instead?
About me:
I am a complete beginner in the area of machine learning.…
Logende
- 61
- 4
8
votes
1 answer
Keras Early Stopping: Monitor 'loss' or 'val_loss'?
I often use "early stopping" when I train neural nets, e.g. in Keras:
from keras.callbacks import EarlyStopping
# Define early stopping as callback
early_stopping = EarlyStopping(monitor='loss', patience=5, mode='auto',…
Peter
- 7,896
- 5
- 23
- 50
8
votes
2 answers
How important is advanced SQL for data science?
Is advanced level SQL required to be competitive as a data scientist? Is it more important for a data analyst to be good in SQL? Is it enough to be able to extract data using simple SQL queries?
I know it is faster to manipulate data in SQL than to…
user62049
8
votes
1 answer
FP16, FP32 - what is it all about? or is it just Bitsize for Float-Values (Python)
What is it all about FP16, FP32 in Python? My potential Business Partner and I are building a Deep Learning Setup for working with time series. He came up with "FP16 and FP32" while finding a GPU. It looks like he's talking about Floating Point…
Ishmael89
- 91
- 1
- 1
- 3
8
votes
1 answer
Encoding with OrdinalEncoder : how to give levels as user input?
I am trying to do ordinal encoding using:
from sklearn.preprocessing import OrdinalEncoder
I will try to explain my problem with a simple dataset.
X = pd.DataFrame({'animals':['low','med','low','high','low','high']})
enc =…
Ayush Ranjan
- 411
- 1
- 4
- 15
8
votes
2 answers
Joining tables from different locations in Bigquery
I have been trying to join two tables from different datasets that are in different locations but in the same project. However, I keep getting the error:
dataset not found in US location.
The datasets' locations are US and us-east1
Here is what I…
shivanshu dhawan
- 188
- 1
- 2
- 9
8
votes
2 answers
Why does vanilla transformer has fixed-length input?
I know that in the math on which the transformer is based there is no restriction on the length of input. But I still can’t understand why we should fix it in the frameworks (PyTorch). Because of this problem Transformer-XL has been created.
Can you…
Ann
- 133
- 1
- 7
8
votes
2 answers
AUC-ROC for Multi-Label Classification
Hey guys I'm currently reading about AUC-ROC and I have understood the binary case and I think that I understand the multi-classification case. Now I'm a bit confused on how to generalize it to the multi-label case, and I can't find any intuitive…
NotoriousFunk
- 81
- 1
- 1
- 3