Reimplementation of Learning Simple Algorithms From Examples paper on Julia
- Revisit grid tasks data generation
Make supervised grid tasks workingGRU controllerModel saveVisualization/demo(Have some harmless bugs. Still wip)- Q-learning (work in progress)
implement make_batches functionimplement loss/train functiondebug run_episodes! and other things- correct the objective and training