Training the Neural Networks#
After the preprocessing is finished, we can start training the different taggers which are supported by the Umami Framework. Theses taggers are DIPS, DIPS Attention, CADS, DL1, different versions of DL1* (like DL1r, DL1d, etc.), Umami, Umami Conditional Attention.
All taggers in the Umami framework are trained using tensorflow
with the Keras
backend. The different models are all able to utilise GPU resources which will drastically shorten the time needed for training. In the following sections, the different steps and the config file are explained in detail which are needed to successfully train one of the taggers.
In general, the train config file consists of 4 parts:
- Global Settings (Explained in Start Training your Model)
- Network Settings (Explained in Start Training your Model)
- Validation Settings (Explained in Validate your Training)
- Evaluation Settings (Explained in Evaluate your Training)
Example train config files for the different taggers can be found here. Using DIPS as an example in the following sections, the different options of the train config are explained. While the basic options needed/provided inside the config files are the same for all taggers, some options are only available for some other.