XG Boost Algorithm

XG Boost Algorithm

Basically, XGBoost is an algorithm. Also, it has recently been dominating applied machine learning. XGBoost is an implementation of gradient boosted decision trees. Although, it was designed for speed and performance. Basically, it is a type of software library. That you can download and install on your machine. Then have to access it from a variety of interfaces.

Model Features

XGBoost model implementation supports the features of the scikit-learn and R implementations. Three main forms of gradient boosting are supported:

Gradient Boosting

This is also called as gradient boosting machine including the learning rate.

Stochastic Gradient Boosting

This is the boosting with sub-sampling at the row, column, and column per split levels.

Regularized Gradient Boosting

It includes boosting with both L1 and L2 regularization.

Algorithm Features

For efficiency of computing time and memory resources, we use XGBoost algorithm. Also, this was designed to make use of available resources to train the model.

Some key algorithm implementation features include:

  • Sparse aware implementation with automatic handling of missing data values.


  • Block structure to support the parallelization of tree construction.


  • Continued training so that you can further boost an already fitted model on new data.


  • XGBoost is free open source software. That is available for use under the permissive Apache-2 license.

Why XGBoosting is good?

a. Flexibility

XGBoosting supports user-defined objective functions with classification, regression and ranking problems. We use an objective function to measure the performance of the model. That is given a certain set of parameters. Furthermore, it supports user-defined evaluation metrics as well.

b. Availability

As it is available for programming languages such as R, Python, Java, Julia, and Scala.

c. Save and Reload

We can easily save our data matrix and model and reload it later. Let suppose, if we have a large dataset, we can simply save the model. Further, we use it in future instead of wasting time redoing the computation.

 Xgboost Algorithm – Parameters

a. General Parameters

Following are the General parameters used in Xgboost Algorithm:

  • silent: The default value is 0. You need to specify 0 for printing running messages, 1 for silent mode.


  • booster: The default value is gbtree. You need to specify the booster to use: gbtree (tree based) or gblinear (linear function).


  • num_pbuffer: This is set automatically by xgboost Algorithm, no need to be set by a user. Read the documentation of xgboost for more details.


  • num_feature: This is set automatically by xgboost Algorithm, no need to be set by a user.

b. Booster Parameters

Below we discussed tree-specific parameters in Xgboost Algorithm:

  • eta: The default value is set to 0.3. You need to specify step size shrinkage used in an update to prevents overfitting. After each boosting step, we can directly get the weights of new features. eta actually shrinks the feature weights to make the boosting process more conservative. The range is 0 to 1. Low eta value means the model is more robust to overfitting.


  • gamma: The default value is set to 0. You need to specify minimum loss reduction required to make a further partition on a leaf node of the tree. The larger, the more conservative the algorithm will be. The range is 0 to ∞. Larger the gamma more conservative the algorithm is.


  • max_depth: The default value is set to 6. You need to specify the maximum depth of a tree. The range is 1 to ∞.


  • min_child_weight: The default value is set to 1. You need to specify the minimum sum of instance weight(hessian) needed in a child. If the tree partition step results in a leaf node. Then with the sum of instance weight less than min_child_weight. Then the building process will give up further partitioning. In linear regression mode, corresponds to a minimum number of instances needed to be in each node. The larger, the more conservative the algorithm will be. The range is 0 to ∞.


  • max_delta_step: The default value is set to 0. Maximum delta step we allow each tree’s weight estimation to be. If the value is set to 0, it means there is no constraint. If it is set to a positive value, it can help make the update step more conservative. Usually, this parameter is not needed, but it might help in logistic regression. Especially, when a class is extremely imbalanced. Set it to a value of 1-10 might help control the update.The range is 0 to ∞.


  • subsample: The default value is set to 1. You need to specify the subsample ratio of the training instance. Setting it to 0.5 means that XGBoost randomly collected half of the data instances. That needs to grow trees and this will prevent overfitting. The range is 0 to 1.


  • colsample_bytree: The default value is set to 1. You need to specify the subsample ratio of columns when constructing each tree. The range is 0 to 1.


c. Linear Booster Specific Parameters

These are Linear Booster Specific Parameters in XGBoost Algorithm.

  • lambda and alpha: These are regularization term on weights. Lambda default value assumed is 1 and alpha are 0.
  • lambda_bias: L2 regularization term on bias and has a default value of 0.


d. Learning Task Parameters

Following are the Learning Task Parameters in XGBoost Algorithm

  • base_score: The default value is set to 0.5. You need to specify the initial prediction score of all instances, global bias.


  • objective: The default value is set to reg:linear. You need to specify the type of learner you want. That includes linear regression, Poisson regression etc.


  • eval_metric: You need to specify the evaluation metrics for validation data. And a default metric will be assigned according to the objective.


  • seed: As always here you specify the seed to reproduce the same set of outputs.

Conclusion

As a result, we have studied Introduction to XGBoost. learned XGBoosting features, Why we use XGBoost Algorithms, and Why XGBoosting is good. General Parameters, Booster Parameters, Linear Booster Specific Parameters, Learning Task Parameters.






To view or add a comment, sign in

More articles by Revanth Yadama

  • Encode-Categorical-Features

    Encode-Categorical-Features

    Handling Categorical/Qualitative variables is an important step in data pre-processing. Many Machine learning…

    1 Comment
  • Bias and Variance in Machine Learning

    Bias and Variance in Machine Learning

    Overview Learn to interpret Bias and Variance in a given model. What is the difference between Bias and Variance? How…

Insights from the community

Others also viewed

Explore topics