Skip to content

A Python implementation of the Hoeffding Tree algorithm.

License

Notifications You must be signed in to change notification settings

Instance-contrib/HoeffdingTree

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HoeffdingTree

A Python implementation of the Hoeffding Tree algorithm, also known as Very Fast Decision Tree (VFDT).

The Hoeffding Tree is a decision tree for classification tasks in data streams.

Pedro Domingos and Geoff Hulten. 2000. 
Mining high-speed data streams.
In Proceedings of the sixth ACM SIGKDD international conference on
  Knowledge discovery and data mining (KDD '00). 
ACM, New York, NY, USA, 71-80.

This implementation was initially based on Weka's Hoeffding Tree and the original work by Geoff Hulten and Pedro Domingos, VFML. Although it is based on these I cannot guarantee that the algorithm will work exactly, or even produce the same output, as any of these implementations. Most of the class and variable names, for example, follow Weka's implementation in order to ease the use of the algorithm for some of my peers who are used to Weka when performing data mining tasks, but there may be significant changes in the code behind it.

If you use this in your paper, please cite:

Vitor da Silva and Ana Trindade Winck. 2017.
Video popularity prediction in data streams based on context-independent features. 
In Proceedings of the Symposium on Applied Computing (SAC '17). 
ACM, New York, NY, USA, 95-100. 
DOI: https://doi.org/10.1145/3019612.3019638

About

A Python implementation of the Hoeffding Tree algorithm.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%