- Run the net over the notes first.
- There is more error as we go further, should we implement something for this? Perhaps this will guide to a better idea: https://machinelearningmastery.com/what-is-imbalanced-classification/
- Make tests with the optimizator to how iw behaves respect the number of steps that it makes.
- Read why to use the Adamoptimizer: https://machinelearningmastery.com/ adam-optimization-algorithm-for-deep-learning/
- Use a mapping from the distributions withouth giving the songs the same weight.
- Sum something additional to the cost function.
- Change interval_len, batch len variables to have a dependency with the number of notes that can be reolved or at least give better names.
- See the meaning of the conservativity.
- Review mask in training.