Highest Voted 'training' Questions - Data Science Stack Exchange

125

votes

2 answers

Training an RNN with examples of different lengths in Keras

I am trying to get started learning about RNNs and I'm using Keras. I understand the basic premise of vanilla RNN and LSTM layers, but I'm having trouble understanding a certain technical point for training. In the keras documentation, it says the…

asked Jan 06 '18 at 23:41

Tac-Tics

1,370
2
9
6

63

votes

5 answers

Is it always better to use the whole dataset to train the final model?

A common technique after training, validating and testing the Machine Learning model of preference is to use the complete dataset, including the testing subset, to train a final model to deploy it on, e.g. a product. My question is: Is it always…

machine-learning dataset training accuracy

asked Jun 12 '18 at 09:54

pcko1

4,030
2
17
30

63

votes

6 answers

Should a model be re-trained if new observations are available?

So, I have not been able to find any literature on this subject but it seems like something worth giving a thought: What are the best practices in model training and optimization if new observations are available? Is there any way to determine the…

machine-learning predictive-modeling optimization training

asked Jul 13 '16 at 11:03

neural-nut

1,803
3
18
28

62

votes

4 answers

What is the advantage of keeping batch size a power of 2?

While training models in machine learning, why is it sometimes advantageous to keep the batch size to a power of 2? I thought it would be best to use a size that is the largest fit in your GPU memory / RAM. This answer claims that for some packages,…

machine-learning training

asked Jul 05 '17 at 05:43

James Bond

1,265
2
12
13

48

votes

5 answers

In the context of Deep Learning, what is training warmup steps

I found the term "training warmup steps" in some of the papers. What exactly does this term mean? Has it got anything to do with "learning rate"? If so, how does it affect it?

machine-learning deep-learning training

asked Jul 19 '19 at 10:10

Ashwin Geet D'Sa

1,217
2
11
20

46

votes

8 answers

What would I prefer - an over-fitted model or a less accurate model?

Let's say we have two models trained. And let's say we are looking for good accuracy. The first has an accuracy of 100% on training set and 84% on test set. Clearly over-fitted. The second has an accuracy of 83% on training set and 83% on test set.…

machine-learning-model training supervised-learning accuracy overfitting

asked Jan 12 '20 at 13:48

EitanT

569
4
3

35

votes

10 answers

Why is it wrong to train and test a model on the same dataset?

What are the pitfalls of doing so and why is it a bad practice? Is it possible that the model starts to learn the images "by heart" instead of understanding the underlying logic?

machine-learning neural-network dataset data training

asked Dec 13 '20 at 14:11

karalis1

461
1
5
8

22

votes

4 answers

Train, test split of unbalanced dataset classification

I have a model that does binary classification. My dataset is highly unbalanced, so I thought that I should balance it by undersampling before I train the model. So balance the dataset and then split it randomly. Is this the right way ? or should…

python classification training

asked Jun 08 '18 at 09:49

lads

423
1
5
8

21

votes

6 answers

Tool to label images for classification

Can anyone recommend a tool to quickly label several hundred images as an input for classification? I have ~500 microscopy images of cells. I would like to assign categories such as 'healthy', 'dead', 'sick' manually for a training set and save…

machine-learning image-classification training

asked Sep 16 '16 at 14:01

jlarsch

401
1
3
8

15

votes

1 answer

Is stratified sampling necessary (random forest, Python)?

I use Python to run a random forest model on my imbalanced dataset (the target variable was a binary class). When splitting the training and testing dataset, I struggled whether to used stratified sampling (like the code shown) or not. So far, I…

machine-learning python random-forest sampling training

asked Jan 12 '17 at 00:58

LUSAQX

783
3
10
24

13

votes

2 answers

Oversampling/Undersampling only train set only or both train and validation set

I am working on a dataset with class imbalance problem. Now, I know one needs to oversample or undersample only the train set and not the test set. But my issue is: whether to oversample the train set and then split it to train and validate set or…

data training smote

asked Oct 17 '19 at 08:21

yamini goel

761
3
7
14

10

votes

3 answers

How to split train/test datasets having equal classes proportion

I would like to know how I can split in an equal number the following Target 0 1586 1 318 in order to have the same proportion of 0 and 1 classes in a dataset to train, if my dataset is called df and includes 10 columns, both numerical and…

scikit-learn pandas predictive-modeling training

asked Oct 11 '20 at 14:05

user105599

155
1
1
5

10

votes

2 answers

Train object detection without annotated data/bounding boxes

From what I can see most object detection NNs (Fast(er) R-CNN, YOLO etc) are trained on data including bounding boxes indicating where in the picture the objects are localised. Are there algos that simply take the full picture + label annotations,…

neural-network training convolutional-neural-network object-recognition

asked May 31 '17 at 18:51

salient

203
1
2
6

10

votes

4 answers

Does training a neural network on a combined dataset outperform sequential training on individual datasets?

I have a neural network with a fixed architecture (let's call it Architecture A). I also have two datasets, Dataset 1 and Dataset 2, both of which are independently and identically distributed (i.i.d.). I’m exploring how training strategies affect…

machine-learning deep-learning neural-network training optimization

asked Mar 24 '25 at 21:01

Arvind Kumar Sharma

101
3

9

votes

4 answers

Understanding how convolutional layers work

After working with a CNN using Keras and the Mnist dataset for the well-know hand written digit recognition problem, I came up with some questions about how the convolutional layer work. I can understand what the convolution process is. My first…

cnn training convolution backpropagation

asked Aug 18 '20 at 11:48

Karampistis Dimitrios

103
1
4

Questions tagged [training]