The model used for the system dynamics is relatively simple for now. If you start with a complex model with so few data, you run into overfitting easily. If it is modeled by a neural network, it is usually small. What technology to use is not very important as long as it can do the work.
There are papers on how to start from a model-based training and then evolve to model-free training later.