The repo plots the scalled softmax and sigmoid functions on the same underlying dataset for better understanding of how the two behave. By plotting the two in a similar setting in order to understand the difference between softmax and sigmoid.
Notice that the activation function softmax is scalled by a factor lambda=10 for better visualization.