Highest Voted 'gradient-boosting-decision-trees' Questions

6

votes

1 answer

What is Pruning & Truncation in Decision Trees?

Pruning & Truncation As per my understanding Truncation: Stop the tree while it is still growing so that it may not end up with leaves containing very low data points. One way to do this is to set a minimum number of training inputs to use on each…

asked Nov 27 '19 at 07:01

Pluviophile

4,203
14
32
56

5

votes

1 answer

Multi-target regression tree with additional constraint

I have a regression problem where I need to predict three dependent variables ($y$) based on a set of independent variables ($x$): $$ (y_1,y_2,y_3) = \beta_0 + \beta_1 x_1 + \beta_2 x_2 + \dots + \beta_n x_n +u. $$ To solve this problem, I would…

regression decision-trees multi-output gradient-boosting-decision-trees

asked Aug 20 '21 at 18:13

Peter

7,896
5
23
50

4

votes

1 answer

XGBoost - Imputing Vs keeping NaN

What is the benefit of imputing numerical or categorical features when using DT methods such as XGBoost that can handle missing values? This question is mainly for when the values are missing not at random. An example of missing not at random…

decision-trees xgboost data-imputation gradient-boosting-decision-trees

asked May 24 '21 at 15:25

thereandhere1

775
1
12
25

4

votes

3 answers

Am I building a good or bad model for prediction built using Gradient Boosting Classifier Algorithm?

I am building a binary classification model using GB Classifier for imbalanced data with event rate 0.11% having sample size of 350000 records (split into 70% training & 30% testing). I have successfully tuned hyperparameters using GridsearchCV, and…

python classification gradient-boosting-decision-trees

asked Jul 29 '22 at 14:09

RajendraW

43
4

3

votes

3 answers

Example for Boosting

Can someone exactly tell me how does boosting as implemented by LightGBM or XGBoost work in real case scenerio. Like I know it splits tree leaf wise instead of level wise, which will contribute to global average not just the loss of branch which…

machine-learning xgboost boosting lightgbm gradient-boosting-decision-trees

asked Dec 20 '21 at 00:03

Chris_007

203
7

2

votes

1 answer

What if root of a such tree is pruned in xgboost?

Extreme Gradient Boosting stops to grow a tree if $\gamma$ is greater than impurity reduction given as eq (7) (see below) , what does happen if tree's root has a negative impurity? I think there is no any way to boosting goes on because the next…

machine-learning decision-trees xgboost boosting gradient-boosting-decision-trees

asked Jun 11 '21 at 02:48

Davi Américo

133
5

2

votes

1 answer

My tree based models keep overfitting

This is a project of multi classification. Each model severely overfits. Decision Tree, Random Forrest and especially XGBoost. And the classification report reflects that. where the csv…

random-forest decision-trees multiclass-classification gradient-boosting-decision-trees

asked Jan 26 '25 at 12:03

Ico

41
2

2

votes

1 answer

How to determine the feasible domain of a trained tree model?

As far as I know, tree models (such as those trained using xgboost/lightgbm) makes reasonable prediction only if the input feature vector is similar to the train set data. If the feature vector looks like an outlier, then the prediction result is…

machine-learning-model anomaly-detection gradient-boosting-decision-trees libsvm

asked Jan 22 '25 at 03:31

PeopleMoutainPeopleSea

121
2

2

votes

0 answers

Handling Missing Values in Predictor Variables for Gradient Boosting Models ( gbm() ) in R

I am currently working on a predictive modeling project using the gbm package in R and have encountered a challenge regarding missing values in one of my predictor variables. I would appreciate your insights and recommendations on the best practices…

r missing-data gbm gradient-boosting-decision-trees

asked Nov 20 '24 at 11:05

Anso

21
2

2

votes

2 answers

Why does the regression model produced by XGBoost depend on the order of the training data when more than 8194 data points are used?

When I use XGBRegressor to construct a boosted tree model from 8194 or fewer data points (i.e., n_train $\leq$ 8194, where n_train is defined in the code below) and randomly shuffle the data points before training, the fit method is order…

xgboost algorithms gradient-boosting-decision-trees

asked Sep 20 '24 at 06:06

SapereAude

141
4

2

votes

0 answers

Transfer learning for tabular data

I wonder if transfer learning can be used in tabular data similarly to how it's used in neural networks for image recognition. My idea would be to train a "general" model and then "localize" it using a specific dataset. I have a problem akin to this…

neural-network transfer-learning gradient-boosting-decision-trees stacking

asked Apr 15 '24 at 12:00

Dudelstein

135
7

2

votes

0 answers

Tuning the learning rate parameter for GBDT models

I've always been taught that decreasing the learning rate parameter in gbdt models such as XGBoost, LightGBM and Catboost will improve the out-of-sample performance, assuming the number of iterations is increased accordingly and all else…

machine-learning xgboost lightgbm gradient-boosting-decision-trees catboost

asked Feb 26 '24 at 13:14

Casper

21
1

2

votes

1 answer

Model performance impact on social discrimination?

I am currently working on a project where the data concerns people and the dataset contain personal data with sensitive attributes. (typically: age, sex, handicap, race). Now it seems there are mainly three options for modelling: Not take the…

linear-regression bias gradient-boosting-decision-trees

asked Dec 10 '23 at 12:42

Lucas Morin

2,775
5
25
47

2

votes

1 answer

Random LightGBM Forest

I'm not completly sure about the bias/variance of boosted decision trees (LightGBM especially), thus I wonder if we generally would expect a performance boost by creating an ensemble of multiple LightGBM models, just like with Random Forest?

ensemble-modeling lightgbm gradient-boosting-decision-trees bagging

asked Apr 13 '22 at 13:38

CutePoison

520
3
10

1

vote

0 answers

Why is the average prediction moving away from average response for a reg:gamma model

I'm predicting a response that I would typically model under a gamma distribution, with relatively simple paramters, I'm just using the default other than these: learning_rate = 0.01 max_depth = 6 base_score = the average of y Since my base_score…

xgboost gradient-boosting-decision-trees

asked Aug 11 '21 at 02:27

Mattice Verhoeven

111
5

Questions tagged [gradient-boosting-decision-trees]