Skip to content

bigfootproject/spark-dectree

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Classification and Regression Tree algorithms for Apache SPARK

This repository focus on a possible implementation of a decision tree for Apache SPARK.

This repository is contains packages that helps building two kinds of tree models: Classification Tree and Regression Tree on the scalable enviroment with platform SPARK.

Paricularly, Regression Tree is implemented from CART algorithm of Breiman, which was introduced in 1984. Classification Tree has two types: Binary Classification Tree (implemented by CART ) and Multi-way Classification Tree (built by ID3).

Besides, you can do cross validation to prune tree models or use Random Forest to aggregate and increase the accuracy of prediction.

You can find some information about our work here:

  1. Overview about decision tree

  2. Regression Tree with CART algorithm

  3. How to use Regression Tree in this repository

About

A possible implementation of a decision tree for SPARK

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages