12Furthermore, quick development and short training times also allow us to treat some of the component choices as hyper-parameter choices. It effectively means that such configuration choices can also be fine-tuned similar to any other hyper-parameter to optimize the final retrieval performance.