LightGBM is a gradient boosting framework that uses tree based learning algorithms. It is designed to be distributed and efficient with the following advantages:
- Faster training speed and higher efficiency
- Lower memory usage
- Better accuracy
- Parallel learning supported
- Capable of handling large-scale data
For more details, please refer to Features.
Experiments on public datasets show that LightGBM can outperform other existing boosting framework on both efficiency and accuracy, with significant lower memory consumption. What's more, the experiments show that LightGBM can achieve a linear speed-up by using multiple machines for training in specific settings.
12/05/2016 : Categorical Features as input directly(without one-hot coding). Experiment on Expo data shows about 8x speed-up with same accuracy compared with one-hot coding (refer to categorical log and one-hot log). For the setting details, please refer to IO Parameters.
12/02/2016 : Release python-package beta version, welcome to have a try and provide issues and feedback.
To get started, please follow the Installation Guide and Quick Start.
This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.