Highest Voted 'generalization' Questions - Data Science Stack Exchange

12

votes

2 answers

The differences between SVM and Logistic Regression

I am reading about SVM and I've faced to the point that non-kernelized SVMs are nothing more than linear separators. Therefore, is the only difference between an SVM and logistic regression the criterium to choose the boundary? Apparently, SVM…

asked May 09 '18 at 15:34

David Masip

6,136
2
28
62

9

votes

2 answers

High accuracy on test-set, what could go wrong?

You are given a pre-trained binary ML classification model with 99% accuracy on the test-set (assume the customer required 95% and that the test-set is balanced). We would like to deploy our model in production. What could go wrong? How would you…

machine-learning generalization

asked Dec 17 '20 at 11:20

CodeHoarder

193
1
4

7

votes

2 answers

Multilabel classification: Choosing threshold

I'm creating a multilabel classification approach based on sentence embeddings applied to text taken from a chatbot. We have the following: a training dataset of 2,500 lines, where each line is a sentence associated with a particular label (the…

machine-learning dataset multilabel-classification generalization

asked Jul 11 '23 at 14:20

LasiusMind

71
3

6

votes

1 answer

Training on accurate data versus noisy data

I have data currently available that is very accurate and I would like to train my classification methods on this set of clean data to learn the important markers for distinguishing between classes. But in the future, my trained classifiers will not…

machine-learning error-handling noise generalization

asked Sep 12 '18 at 20:05

Mathews24

195
9

5

votes

1 answer

On design of the training set: conceptual question

I am curious to know how training data should be constructed so that it scales to examples that are not a part of the training data. For example, the problem that I am facing right now is in the application of identifying or distinguishing the…

machine-learning training generalization

asked Aug 26 '19 at 01:27

Sm1

541
5
19

5

votes

1 answer

The connection between optimization and generalization

Optimization algorithms such as gradient descent or particle swarm can find a minima in a function. On the other hand, learning methods such as back-prop define learning as an optimization problem and are used to learn weights in deep neural…

machine-learning neural-network deep-learning optimization generalization

asked Mar 23 '18 at 12:38

PickleRick

151
4

3

votes

1 answer

What is NLP technique to generalize manually created rules in text?

Let's say we have a free text containing key-value entities. Example: "... patient's tumour has width 6 cm and height 5 cm" Then an expert comes, marks it as important, thus we do have the rule for finding the same entity in new, different…

nlp text-mining information-retrieval named-entity-recognition generalization

asked Oct 31 '18 at 10:09

ludgo

133
3

2

votes

1 answer

Does convergence equal learning in Deep Q-learning?

In my current research project I'm using the Deep Q-learning algorithm. The setup is as follows: I'm training the model (using Deep Q-learning) on a static dataset made up of experiences extracted from N levels of a given game. Then, I want to use…

deep-learning reinforcement-learning q-learning convergence generalization

asked Feb 21 '21 at 16:00

Aeryan

81
1
6

2

votes

1 answer

Multilabel Classification - Overfitting?

My task is the following: To input drug combinations and output renal failure-related symptoms from the drug combinations. Both the drug combinations and renal-failure related symptoms are represented as one-hot encoded (for example, someone getting…

neural-network deep-learning multilabel-classification overfitting generalization

asked Sep 08 '20 at 12:20

shepan6

1,486
7
14

2

votes

1 answer

Why does this paper say that 0-1 loss is insensitive to scaling of weights in a neural network?

When discussing capacity control using norms of weights in a neural network,this paper says the following(see P4): Capacity control in terms of norm, when using a zero/one loss (i.e. counting errors) requires us in addition to account for scaling…

neural-network deep-learning loss-function generalization

asked Mar 02 '20 at 09:22

j200932

121
2

2

votes

1 answer

XGBoost: what to do when Kfold is not enough?

I have a dataset made of roughly 100 time-series and my final goal is to obtain a classification of each point (detection problem). To do so I have labels so I decided to use an XGB model to perform the detection over some features that I have…

time-series xgboost cross-validation training generalization

asked Mar 06 '19 at 17:50

FrankBool

23
4

2

votes

1 answer

How to train a neural network to generalize out-of-distribution (e.g., sin function extrapolation) without data leakage?

I am trying to train a neural network to approximate the sin(x) function, but I want it to generalize outside the range of the training data. Specifically, I train the network on x values within [-π, π] and test it on a disjoint range, such as [π,…

neural-network generalization grokking

asked Feb 10 '25 at 00:02

Amirhossein Rezaei

623
3
14

2

votes

1 answer

Which type of models generalize better, generative or discriminative models?

In NLP, which type of models (generative or discriminative) is more sensitive to the amount of data to generalize better? references? This is related to the way those two types capture the data probability (join-prob. vs conditional prob.)?

nlp generative-models generalization

asked Nov 15 '21 at 17:03

Minions

262
2
15

1

vote

2 answers

Graph Neural Network fails at generalizing on unseen graph topologies

I'm using PytorchGeometric to train a graph convolutional network for regression over nodes problem (the graph models physical phenomena in the network of sensors; the network of sensors is actually the network of measurements distributed across the…

machine-learning generalization graph-neural-network

asked Jun 10 '21 at 20:25

sesli

11
3

1

vote

0 answers

Generalization error problem on training set

Training data: $\mathcal {T} =\{(2,1),(3,2),(4,6),(0,0),(1,1)\}$ you already computed a predictor for the output using linear regression by least squares, where you used the first 3 samples as training samples: $f(X) = -4.5  +  2.5X$ Approximate the…

machine-learning variance bias generalization

asked May 20 '21 at 20:41

todapod264

11
1

Questions tagged [generalization]