A Python implementation of the Hoeffding Tree algorithm, also known as Very Fast Decision Tree (VFDT).
The Hoeffding Tree is a decision tree for classification tasks in data streams.
Pedro Domingos and Geoff Hulten. 2000.
Mining high-speed data streams.
In Proceedings of the sixth ACM SIGKDD international conference on
Knowledge discovery and data mining (KDD '00).
ACM, New York, NY, USA, 71-80.
This implementation was initially based on Weka's Hoeffding Tree and the original work by Geoff Hulten and Pedro Domingos, VFML. Although it is based on these I cannot guarantee that the algorithm will work exactly, or even produce the same output, as any of these implementations. Most of the class and variable names, for example, follow Weka's implementation in order to ease the use of the algorithm for some of my peers who are used to Weka when performing data mining tasks, but there may be significant changes in the code behind it.
If you use this in your paper, please cite:
Vitor da Silva and Ana Trindade Winck. 2017.
Video popularity prediction in data streams based on context-independent features.
In Proceedings of the Symposium on Applied Computing (SAC '17).
ACM, New York, NY, USA, 95-100.
DOI: https://doi.org/10.1145/3019612.3019638