Tentative code to train an offline RL algorithm on healthcare data. At the time of writing, this code samples heavily from Ilya Kostrikov's IQL Github repo.