My PhD focuses on Continual Learning with Transformer Architectures for magnetic resonance images (MRIs) and computer tomography (CT) scans. Changing patient populations over time as well as different acquisition techniques across and within medical institutions lead to shifts in the data domain. Networks only trained on a single domain inevitably create unreliable predictions for out-of-distribution images. Transformer Architectures help to contain this restriction, however they are not perfectly suited for a direct application on segmentation tasks. My goal is to use Deep Learning based Transformer registration models for atlas-based segmentation in clinical multi-institutional settings to fully leverage the potential of Transformers known from NLP and Machine Translation.